Tag

data synthesis

0 views collected around this technical thread.

DataFunSummit
DataFunSummit
Jun 6, 2025 · Artificial Intelligence

Automating High‑Quality NL2SQL Data Synthesis with Intermediate Representations

This work tackles the difficulty of incorporating extensive domain knowledge into in‑domain NL2SQL tasks by proposing an intermediate‑representation‑based data synthesis method that decouples knowledge compliance from SQL generation, enabling automated creation of high‑quality training data with 60× human efficiency and over 97% accuracy.

NL2SQLSQL generationdata synthesis
0 likes · 2 min read
Automating High‑Quality NL2SQL Data Synthesis with Intermediate Representations
DevOps
DevOps
May 18, 2025 · Artificial Intelligence

Why the Focus Has Shifted from AI Agents to Agentic Workflows

Although large language models have enabled AI agents that mimic human digital interactions, their commercial accuracy remains far below production standards, prompting the industry to pivot toward agentic workflows and data synthesis, which promise more reliable task automation, reasoning, and observable, auditable processes for knowledge work.

AI Agentsagentic workflowsdata synthesis
0 likes · 6 min read
Why the Focus Has Shifted from AI Agents to Agentic Workflows
DataFunSummit
DataFunSummit
Feb 10, 2025 · Artificial Intelligence

Intelligent Decision-Making Large Model ORLM: Research, Training Challenges, Commercialization, and Future Directions

This article presents the ORLM intelligent decision‑making large model, detailing how real‑world decision problems are formalized and solved, the training difficulties and data synthesis methods, the transition from academic research to commercial platforms, and future technical improvement plans.

Decision Modelingaidata synthesis
0 likes · 10 min read
Intelligent Decision-Making Large Model ORLM: Research, Training Challenges, Commercialization, and Future Directions
Kuaishou Tech
Kuaishou Tech
Jul 23, 2024 · Artificial Intelligence

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

This paper introduces Parrot, a system that enhances large language models' (LLMs) multi-turn instruction following capabilities through context-aware preference optimization (CaPO) and synthetic data generation, achieving significant performance improvements with limited training data.

AI researchCaPOMulti-turn Dialogue
0 likes · 9 min read
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
DataFunTalk
DataFunTalk
Aug 27, 2023 · Artificial Intelligence

AIGC and Causal Inference: Mutual Empowerment and Practical Applications

This article explores how generative AI (AIGC) can be used to synthesize structured data, how such synthetic data enhances causal inference tasks, and how agent‑based modeling and the YLearn framework together enable a two‑way synergy between AIGC and causal learning for enterprise AI solutions.

AIGCAgent-Based ModelingYLearn
0 likes · 15 min read
AIGC and Causal Inference: Mutual Empowerment and Practical Applications
Shopee Tech Team
Shopee Tech Team
Nov 10, 2022 · Artificial Intelligence

ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video

ShopeeVideo OCR is a multi‑language text‑recognition system for Southeast Asian e‑commerce videos that unifies detection, Transformer‑based recognition, layout analysis, and large‑scale synthetic data generation to handle Indonesian, Filipino, English, Vietnamese, Thai and Chinese scripts, delivering industry‑leading accuracy and winning thirteen ICDAR first‑place awards.

Deep LearningMulti-language OCROCR
0 likes · 15 min read
ShopeeVideo OCR: Multi-language Text Recognition System for E-commerce Video
Laiye Technology Team
Laiye Technology Team
Sep 28, 2022 · Artificial Intelligence

Checkbox Detection and State Classification Using YOLOv5

This article describes a comprehensive solution for detecting checkboxes in document images and determining their selected or unselected status by combining YOLOv5 object detection, synthetic and semi‑synthetic data generation, specialized post‑processing, and association logic to handle varied shapes, positions, and markings.

Post-processingYOLOv5checkbox detection
0 likes · 13 min read
Checkbox Detection and State Classification Using YOLOv5
Architects Research Society
Architects Research Society
Oct 2, 2016 · Artificial Intelligence

Key Takeaways from Andrew Ng’s Deep Learning Talk at the Bay Area Deep Learning School

The article summarizes Andrew Ng’s presentation at BADLS, highlighting major deep‑learning trends such as the rise of big data, end‑to‑end models, the bias‑variance tradeoff, human‑level performance benchmarks, and practical advice for improving one’s AI skills.

AI TrendsDeep Learningbias-variance
0 likes · 10 min read
Key Takeaways from Andrew Ng’s Deep Learning Talk at the Bay Area Deep Learning School