Tagged articles
7 articles
Page 1 of 1
Bilibili Tech
Bilibili Tech
Apr 14, 2026 · Artificial Intelligence

Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution

The SPICE method leverages Fisher Information Matrix submodularity and a novel gradient‑conflict penalty to select a small, high‑quality subset of instruction‑tuning data, achieving comparable or superior performance to full‑data fine‑tuning while dramatically reducing training cost.

Fisher informationGradient ConflictInstruction Tuning
0 likes · 13 min read
Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution
Baobao Algorithm Notes
Baobao Algorithm Notes
Nov 11, 2025 · Artificial Intelligence

Why Redesign the Training Stack? Inside Olmo‑Thinking’s Open‑Source RL Journey

This article provides a detailed technical analysis of the Olmo‑Thinking project, covering why a new open‑source LLM was built, the challenges of reinforcement learning at scale, data‑mix optimization, architectural bottlenecks such as missing GQA and QK‑Norm, and the post‑training techniques used to improve reasoning and long‑context capabilities.

Open-source modelsRLVRdata selection
0 likes · 20 min read
Why Redesign the Training Stack? Inside Olmo‑Thinking’s Open‑Source RL Journey
AntTech
AntTech
Sep 19, 2025 · Artificial Intelligence

How Reinforcement Learning Cuts Hallucinations in Large Language Models: Ant Insurance’s Proven Approach

Ant Insurance’s tech team leveraged reinforcement learning, focused data selection, and a multi‑dimensional reward system to dramatically reduce hallucinations in LLMs, achieving top‑rank performance on the HHEM leaderboard and robust improvements across instruction‑following and reasoning‑enhanced models.

Hallucination ControlLLMLLM-as-judge
0 likes · 6 min read
How Reinforcement Learning Cuts Hallucinations in Large Language Models: Ant Insurance’s Proven Approach
DataFunSummit
DataFunSummit
Feb 17, 2024 · Artificial Intelligence

When to Pre‑Train Graph Neural Networks: Data‑Active Pre‑Training and a Graph Generator Framework

This article examines the conditions under which graph neural network pre‑training is beneficial, proposes a data‑centric generator framework to assess transferability, introduces a data‑active pre‑training strategy that selects informative graphs, and presents experimental results showing that using less, well‑chosen data can outperform full‑scale pre‑training.

Pre‑trainingdata selectiongraph generator
0 likes · 16 min read
When to Pre‑Train Graph Neural Networks: Data‑Active Pre‑Training and a Graph Generator Framework
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 6, 2024 · Artificial Intelligence

How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method

This article explains the Nuggets approach for selecting a high‑quality subset of annotated instructions to fine‑tune large language models, describing its three inputs, the gold‑score computation based on perplexity improvement, empirical results on Alpaca, and practical considerations such as task‑set design.

LLMNuggetsdata selection
0 likes · 7 min read
How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 11, 2023 · Artificial Intelligence

Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction

The article explains practical techniques for choosing and constructing fine‑tuning data for large language models, covering data diversity through similarity‑based clustering, semi‑supervised filtering with binary classifiers, and uncertainty‑driven sampling using perplexity or reward models to build an efficient, low‑cost pipeline.

Large ModelReward modelactive learning
0 likes · 9 min read
Boost Large‑Model Fine‑Tuning with Low‑Cost Data Selection and Construction
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 24, 2022 · Fundamentals

8 Powerful Pandas Tricks to Master Data Selection

This article presents eight practical pandas data‑selection techniques—including boolean indexing, loc/iloc, isin, str.contains, where/mask, query, filter, and any/all—illustrated with code examples and visual outputs to help Python users efficiently extract and analyze data.

data analysisdata selectionfiltering
0 likes · 9 min read
8 Powerful Pandas Tricks to Master Data Selection