Tagged articles
6 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 19, 2026 · Artificial Intelligence

Are Small Models the Core Component of Agent Systems?

The article analyzes how advancing small‑model capabilities are shifting agent system design from merely checking if a model can run under resource limits to evaluating its suitability for specific tasks, thereby redefining model selection logic and workflow partitioning.

Agent SystemsLLM scalingModel Selection
0 likes · 7 min read
Are Small Models the Core Component of Agent Systems?
Old Meng AI Explorer
Old Meng AI Explorer
Mar 24, 2026 · Industry Insights

Why 2026 Marks the Dawn of AI Agents and Embodied Intelligence

In 2026 the AI industry undergoes a paradigm shift as agents move from demos to large‑scale commercial use, embodied robots enter factories, and compact models reshape efficiency, signaling a new era of AI‑driven productivity across every sector.

AIEmbodied Intelligenceagents
0 likes · 11 min read
Why 2026 Marks the Dawn of AI Agents and Embodied Intelligence
Architect
Architect
Mar 9, 2025 · Artificial Intelligence

Experiments with Reinforcement Learning Fine‑Tuning of a 0.5B Qwen Model on the KK Dataset

The author reports a series of reinforcement‑learning‑based fine‑tuning experiments on a 0.5‑billion‑parameter Qwen‑0.5VB instruct model using the KK dataset, detailing reward design adjustments, curriculum‑style data scaling, observed convergence issues, and hypotheses about why small models fail to develop long reasoning chains.

LLM fine-tuningReinforcement Learningcurriculum learning
0 likes · 11 min read
Experiments with Reinforcement Learning Fine‑Tuning of a 0.5B Qwen Model on the KK Dataset
Infra Learning Club
Infra Learning Club
Jan 2, 2025 · Artificial Intelligence

Three Major LLM Trends in 2025: Ubiquitous Agents, Rising Small Models, and Multimodal Fusion

In 2025, large language models will see three key trends—agents becoming pervasive in daily life and industry, the emergence of efficient small models for edge and specialized tasks, and the integration of multimodal capabilities that combine text, images, and audio to enable more natural human‑machine interaction.

AI trendsLLMMultimodal
0 likes · 4 min read
Three Major LLM Trends in 2025: Ubiquitous Agents, Rising Small Models, and Multimodal Fusion
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 5, 2024 · Artificial Intelligence

Why Small LLMs Are the Secret Weapon for Scaling Large Model Research

The article explains how homologous small language models—trained on the same tokenizer and data as their large counterparts—serve as cheap, fast experimental platforms that can predict large‑model performance, guide pre‑training decisions, and support techniques like distillation and reward modeling.

AI researchLLM scalingQwen2
0 likes · 13 min read
Why Small LLMs Are the Secret Weapon for Scaling Large Model Research