Tagged articles

small models

8 articles · Page 1 of 1

Jun 26, 2026 · Artificial Intelligence

How Small Models, Edge Agents, and Multimodal Interaction Power the Next Wave of Computer Use

The article examines the 2025‑2026 surge of sub‑10B parameter models, outlines edge‑side AI agents, details multimodal interaction layers, and explains how Computer Use enables AI to operate computers, offering a roadmap for integrating these technologies across mobile, desktop, and IoT scenarios.

AI agentscomputer useedge AI

0 likes · 17 min read

How Small Models, Edge Agents, and Multimodal Interaction Power the Next Wave of Computer Use

AI Engineer Programming

Jun 8, 2026 · Artificial Intelligence

When to Use Small Models: A System Design Perspective

Small models are chosen based on deployment constraints rather than absolute parameter counts; the article outlines how resource limits, latency, cost, privacy, and task characteristics define their suitability, compares their strengths and weaknesses to large models, and offers system‑level design patterns for effective use.

LLM deploymentRAGinference optimization

0 likes · 20 min read

When to Use Small Models: A System Design Perspective

Machine Heart

Apr 19, 2026 · Artificial Intelligence

Are Small Models the Core Component of Agent Systems?

The article analyzes how advancing small‑model capabilities are shifting agent system design from merely checking if a model can run under resource limits to evaluating its suitability for specific tasks, thereby redefining model selection logic and workflow partitioning.

LLM scalingagent systemsmodel selection

0 likes · 7 min read

Are Small Models the Core Component of Agent Systems?

Old Meng AI Explorer

Mar 24, 2026 · Industry Insights

Why 2026 Marks the Dawn of AI Agents and Embodied Intelligence

In 2026 the AI industry undergoes a paradigm shift as agents move from demos to large‑scale commercial use, embodied robots enter factories, and compact models reshape efficiency, signaling a new era of AI‑driven productivity across every sector.

AIagentsembodied intelligence

0 likes · 11 min read

Why 2026 Marks the Dawn of AI Agents and Embodied Intelligence

Architect

Mar 9, 2025 · Artificial Intelligence

Experiments with Reinforcement Learning Fine‑Tuning of a 0.5B Qwen Model on the KK Dataset

The author reports a series of reinforcement‑learning‑based fine‑tuning experiments on a 0.5‑billion‑parameter Qwen‑0.5VB instruct model using the KK dataset, detailing reward design adjustments, curriculum‑style data scaling, observed convergence issues, and hypotheses about why small models fail to develop long reasoning chains.

LLM fine-tuningReinforcement Learningcurriculum learning

0 likes · 11 min read

Experiments with Reinforcement Learning Fine‑Tuning of a 0.5B Qwen Model on the KK Dataset

Infra Learning Club

Jan 2, 2025 · Artificial Intelligence

Three Major LLM Trends in 2025: Ubiquitous Agents, Rising Small Models, and Multimodal Fusion

In 2025, large language models will see three key trends—agents becoming pervasive in daily life and industry, the emergence of efficient small models for edge and specialized tasks, and the integration of multimodal capabilities that combine text, images, and audio to enable more natural human‑machine interaction.

AI trendsLLMagents

0 likes · 4 min read

Three Major LLM Trends in 2025: Ubiquitous Agents, Rising Small Models, and Multimodal Fusion

Baobao Algorithm Notes

Sep 5, 2024 · Artificial Intelligence

Why Small LLMs Are the Secret Weapon for Scaling Large Model Research

The article explains how homologous small language models—trained on the same tokenizer and data as their large counterparts—serve as cheap, fast experimental platforms that can predict large‑model performance, guide pre‑training decisions, and support techniques like distillation and reward modeling.

AI ResearchLLM scalingQwen2

0 likes · 13 min read

Why Small LLMs Are the Secret Weapon for Scaling Large Model Research

NewBeeNLP

Jul 15, 2024 · Artificial Intelligence

How to Build and Train Sub‑1B Language Models from Scratch: Resources & Tips

This guide compiles open‑source repositories, research papers, and practical tricks for training miniature large‑language models under 1 billion parameters, helping readers learn by reproducing models like nanoGPT, tinyLlama, Phi‑1.5, and more.

LLMTrainingnanoGPT

0 likes · 7 min read

How to Build and Train Sub‑1B Language Models from Scratch: Resources & Tips