Tagged articles

22 articles

Page 1 of 1

Apr 14, 2026 · Artificial Intelligence

Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution

The SPICE method leverages Fisher Information Matrix submodularity and a novel gradient‑conflict penalty to select a small, high‑quality subset of instruction‑tuning data, achieving comparable or superior performance to full‑data fine‑tuning while dramatically reducing training cost.

Fisher informationGradient ConflictInstruction Tuning

0 likes · 13 min read

Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution

Baobao Algorithm Notes

Jul 29, 2025 · Artificial Intelligence

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

The Qwen3‑30B‑A3B‑Instruct‑2507 model, an updated non‑thinking version of Qwen3‑30B‑A3B, delivers significant gains in instruction following, reasoning, multilingual knowledge coverage, and 256K context length, and its performance is benchmarked against leading LLMs across a wide range of tasks.

Instruction TuningMixture‑of‑ExpertsQwen3

0 likes · 6 min read

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

DataFunTalk

Jul 16, 2025 · Artificial Intelligence

How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models

Jason Wei, a former Google Brain and OpenAI researcher now at Meta, has driven key advances in large language models—including chain‑of‑thought prompting, instruction tuning, emergent abilities, zero‑shot learning, and data augmentation—shaping both AI research paradigms and real‑world applications.

Chain-of-ThoughtInstruction Tuningemergent abilities

0 likes · 7 min read

How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models

Instant Consumer Technology Team

Jun 17, 2025 · Artificial Intelligence

Mastering Fine‑Tuning Datasets: From Basics to Advanced LLM Techniques

This comprehensive guide explains the importance of fine‑tuning datasets for large language models, covering task classification, dataset formats, supervised and instruction tuning, domain adaptation, multimodal data, and practical code examples to help practitioners build effective training, validation, and test sets.

Fine-tuningInstruction Tuningdataset preparation

0 likes · 33 min read

Mastering Fine‑Tuning Datasets: From Basics to Advanced LLM Techniques

Qborfy AI

Mar 28, 2025 · Artificial Intelligence

Master Prompt Engineering: From Basics to Advanced SQL Generation

This article walks readers through the fundamentals of prompt engineering—covering role, context, instruction, examples, and output formatting—then demonstrates a step‑by‑step construction of a sophisticated SQL‑generation prompt, complete with concrete code snippets, best‑practice tips, and reference resources.

AI Prompt DesignInstruction TuningPractical Examples

0 likes · 21 min read

Master Prompt Engineering: From Basics to Advanced SQL Generation

Architect

Feb 11, 2025 · Artificial Intelligence

DeepSeek: Training Process, Working Principles, and Recent Innovations

The article explains DeepSeek's two‑stage training pipeline—including massive pre‑training on trillions of tokens and post‑training via instruction tuning and reinforcement learning from human feedback—describes the differences between its V3 instruction model and R1 reasoning model, and highlights performance optimizations and emerging research directions.

AIDeepSeekInstruction Tuning

0 likes · 8 min read

DeepSeek: Training Process, Working Principles, and Recent Innovations

Baobao Algorithm Notes

Nov 14, 2024 · Artificial Intelligence

How OpenCoder’s RefineCode Dataset Powers Next‑Gen Code LLMs

The OpenCoder technical report details the creation of the RefineCode dataset, its multi‑stage preprocessing, filtering, and sampling pipelines, the pre‑training and fine‑tuning schedules for 1.5B and 8B models, and the autonomous data selection methods that together achieve performance comparable to Qwen2.5‑Coder.

AutoDSCode LLMInstruction Tuning

0 likes · 18 min read

How OpenCoder’s RefineCode Dataset Powers Next‑Gen Code LLMs

Alibaba Cloud Big Data AI Platform

Nov 8, 2024 · Artificial Intelligence

How TAPIR Boosts Small LLMs with Task‑Aware Curriculum Planning

The paper introduces TAPIR, a task‑aware curriculum planning framework that distills instruction‑following abilities from black‑box LLM teachers into smaller student models by filtering difficult prompts, resampling tasks, enhancing response styles, and iteratively optimizing across multiple training rounds, achieving superior performance on benchmark evaluations.

Instruction TuningLLM distillationTAPIR

0 likes · 10 min read

How TAPIR Boosts Small LLMs with Task‑Aware Curriculum Planning

Alibaba Cloud Big Data AI Platform

Nov 5, 2024 · Artificial Intelligence

How DistilQwen2 Boosts LLM Performance with Knowledge Distillation

This article introduces DistilQwen2, a lightweight language model derived from Qwen2 via knowledge distillation, detailing its data collection, instruction‑data optimization, training strategies, extensive benchmark evaluations, and practical deployment guides for developers and enterprises.

AIInstruction Tuningknowledge distillation

0 likes · 21 min read

How DistilQwen2 Boosts LLM Performance with Knowledge Distillation

Baobao Algorithm Notes

Oct 30, 2024 · Artificial Intelligence

How to Choose High-Quality Instruction Data for LLM Fine‑Tuning: Methods Compared

This article surveys and categorizes instruction data selection techniques for large language model fine‑tuning, explaining metric‑based, trainable‑LLM, powerful‑LLM, and small‑model approaches, detailing representative papers, their pipelines, and empirical findings on data quality and diversity.

AI researchData QualityInstruction Tuning

0 likes · 15 min read

How to Choose High-Quality Instruction Data for LLM Fine‑Tuning: Methods Compared

Bilibili Tech

Jun 14, 2024 · Artificial Intelligence

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

The report presents the open‑source Index‑1.9B family—base, pure, chat, and character variants—detailing benchmark results, pre‑training optimizations such as a normalized LM‑Head and deeper‑slim architectures, the importance of modest instruction data, alignment via SFT/DPO, role‑play enhancements with RAG, and acknowledges remaining safety and factual limitations.

AlignmentInstruction TuningLLM

0 likes · 15 min read

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

DataFunSummit

May 23, 2024 · Artificial Intelligence

GraphGPT: Enabling Large Language Models as Zero‑Shot Graph Learners

GraphGPT integrates large language models with graph neural networks by introducing graph tokens and instruction tuning, enabling zero‑shot graph learning for tasks such as node classification and link prediction, and demonstrates superior performance and generalization across supervised and zero‑shot benchmarks.

GraphGPTInstruction Tuningzero-shot learning

0 likes · 15 min read

GraphGPT: Enabling Large Language Models as Zero‑Shot Graph Learners

Sohu Tech Products

Mar 20, 2024 · Artificial Intelligence

Comparison of Base LLM and Instruction Tuned LLM

The diagram contrasts a Base LLM, which merely predicts the next word from training data and can continue stories or answer simple facts but may generate unsafe text, with an Instruction‑Tuned LLM that is fine‑tuned via RLHF to understand and follow commands, delivering more accurate, useful, and safe responses.

AIAI applicationsBASE model

0 likes · 7 min read

Comparison of Base LLM and Instruction Tuned LLM

Sohu Tech Products

Oct 11, 2023 · Artificial Intelligence

EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning

EcomGPT, an Alibaba‑trained e‑commerce large language model, uses a 1.5 million‑sample instruction dataset (EcomInstruct) to demonstrate that domain‑specific instruction tuning dramatically outperforms general‑purpose models on e‑commerce tasks, reducing hallucinations and improving task accuracy, with performance scaling as data diversity increases.

Alibaba NLPDomain-Specific AIEcomGPT

0 likes · 7 min read

EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning

Sohu Tech Products

Sep 6, 2023 · Artificial Intelligence

Can Domain-Specific LLMs Outperform General Models? Insights from EcomGPT

This article presents the development and evaluation of EcomGPT, a domain‑specific large language model for e‑commerce, detailing dataset construction, instruction‑tuning methods, experimental results, and the impact of atomic tasks on model performance.

EcomGPTInstruction Tuningdomain adaptation

0 likes · 9 min read

Can Domain-Specific LLMs Outperform General Models? Insights from EcomGPT

Rare Earth Juejin Tech Community

Jul 30, 2023 · Artificial Intelligence

Understanding Codex: Training Framework, Evaluation Methodology, and Model Performance in ChatGPT’s Code Generation Ability

This article explains how Codex, built on the GPT‑3.5 architecture, is trained and fine‑tuned to give ChatGPT the ability to generate code, detailing the data collection, supervised fine‑tuning, evaluation using HumanEval and the pass@k metric, and presenting performance comparisons with GPT‑3 and Codex‑S.

AI model trainingChatGPTCode Generation

0 likes · 11 min read

Understanding Codex: Training Framework, Evaluation Methodology, and Model Performance in ChatGPT’s Code Generation Ability

IT Architects Alliance

Apr 20, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction‑Finetuned Variants

This article provides a comprehensive overview of major large language models—including GPT series, T5, LaMDA, LLaMA, BLOOM, and others—detailing their architectures, parameter scales, open‑source status, and the evolution of instruction‑fine‑tuning techniques that improve zero‑shot and few‑shot performance.

AI researchInstruction TuningLLM comparison

0 likes · 24 min read

Overview of Prominent Large Language Models and Instruction‑Finetuned Variants

Architect

Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

AIChatGPTGPT-3

0 likes · 27 min read

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

Architecture Digest

Feb 17, 2023 · Artificial Intelligence

Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5

This article dissects how ChatGPT acquired its surprising capabilities by tracing the evolution from the original GPT‑3 model through instruction tuning, code‑based pre‑training, and reinforcement learning from human feedback, ultimately presenting a comprehensive technical roadmap for reproducing GPT‑3.5‑scale models.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 26 min read

Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5

Top Architect

Feb 8, 2023 · Artificial Intelligence

A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities

This article analyses how ChatGPT and the GPT‑3.5 series evolved from the original GPT‑3 through large‑scale pre‑training, code‑based training, instruction tuning, and reinforcement learning from human feedback, identifying the origins of their language generation, in‑context learning, world knowledge, code understanding, chain‑of‑thought reasoning, and alignment capabilities while also outlining current limitations.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 27 min read

A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities

21CTO

Dec 29, 2022 · Artificial Intelligence

Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5

This article analyses how OpenAI’s ChatGPT evolved from the original GPT‑3 model, tracing the emergence of language generation, world knowledge, in‑context learning, code training, instruction tuning, and reinforcement learning from human feedback, and highlights both its strengths and current limitations.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 27 min read

Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5

Architect's Guide

Dec 9, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

The article explains how ChatGPT builds on the GPT‑3.5 large language model, using human‑annotated data and Reinforcement Learning from Human Feedback (RLHF) across three training stages to improve instruction understanding, answer quality, and continual model enhancement, while also discussing its potential to complement or replace traditional search engines.

AIChatGPTInstruction Tuning

0 likes · 15 min read

Technical Principles and Training Process of ChatGPT