Tagged articles

small language models

10 articles · Page 1 of 1

Jun 27, 2026 · Artificial Intelligence

Large vs Small Language Models: An Apple‑Centric Technical Comparison

The article analyses how deployment targets, inference economics, and training budgets drive divergent design choices for large (LLM) and small (SLM) Transformer‑based language models, covering architecture tweaks, data‑centric training methods, quantisation, KV‑cache management, and hybrid routing strategies for production systems.

Inference OptimizationQuantizationTransformer architecture

0 likes · 16 min read

Large vs Small Language Models: An Apple‑Centric Technical Comparison

Machine Heart

May 6, 2026 · Artificial Intelligence

Can Adaptive Guidance Unlock Small Model Reasoning? Introducing G²RPO‑A

The paper identifies reward sparsity as the core obstacle for small language models in reinforcement‑learning‑based reasoning, proposes G²RPO‑A which injects high‑quality thinking trajectories and dynamically adjusts guidance length, and demonstrates large accuracy gains on math and code benchmarks such as Qwen3‑1.7B improving from 50.96 % to 67.21 % on MATH500 and from 46.08 % to 75.93 % on HumanEval.

G²RPO‑Aadaptive guidancecode generation

0 likes · 10 min read

Can Adaptive Guidance Unlock Small Model Reasoning? Introducing G²RPO‑A

AI Engineering

Mar 3, 2026 · Artificial Intelligence

Alibaba Qwen‑3.5 Small Models: 0.8B Parameters Enable Video on Edge Devices

Alibaba released four Qwen‑3.5 models (0.8B‑9B) that use a Gated DeltaNet hybrid‑attention architecture and native multimodal training to achieve 262k‑token contexts, outperform larger rivals on visual, reasoning, and math benchmarks, and run video analysis on phones and laptops, though they still demand significant VRAM.

Gated DeltaNetMultimodal AIQwen3.5

0 likes · 6 min read

Alibaba Qwen‑3.5 Small Models: 0.8B Parameters Enable Video on Edge Devices

SuanNi

Mar 2, 2026 · Artificial Intelligence

Can Small Language Models Match Big AI with the Skills Framework?

A recent study from top universities examines how the Skills framework enables small language models to reduce memory usage, improve accuracy, and handle complex industrial tasks, revealing performance gaps across model sizes, dataset challenges, and code‑specialized variants while highlighting cost‑effective deployment strategies.

AIModel EfficiencySkills Framework

0 likes · 8 min read

Can Small Language Models Match Big AI with the Skills Framework?

Architects Research Society

Jan 3, 2026 · Artificial Intelligence

2026: The Year AI Shifts from Scaling Hype to Practical, Small‑Model Innovation

The article forecasts that by 2026 AI will move away from sheer scale‑driven breakthroughs toward more usable, smaller models, world‑model learning, robust agents, and physical integration, emphasizing practical utility, augmentation of human work, and new job opportunities.

AIAugmentationphysical AI

0 likes · 7 min read

2026: The Year AI Shifts from Scaling Hype to Practical, Small‑Model Innovation

PaperAgent

Dec 11, 2025 · Artificial Intelligence

Which Small Language Model Wins After Fine‑Tuning? A Data‑Driven Benchmark

A comprehensive benchmark fine‑tunes twelve small language models on eight diverse tasks, compares them against a 120B teacher model, and reveals which models excel overall, which are most "plastic" for improvement, and how small models can rival much larger ones.

AILLMbenchmark

0 likes · 11 min read

Which Small Language Model Wins After Fine‑Tuning? A Data‑Driven Benchmark

Data Party THU

Sep 8, 2025 · Artificial Intelligence

Why Small Language Models Will Dominate Agentic AI by 2025

By 2025, Agentic AI is shifting from massive LLMs to cost‑effective Small Language Models (SLMs), driven by their comparable performance, lower latency, and dramatically reduced inference and fine‑tuning costs, as detailed through market data, model benchmarks, migration steps, and real‑world case studies.

AIAgentic AILLM

0 likes · 6 min read

Why Small Language Models Will Dominate Agentic AI by 2025

AI Frontier Lectures

May 9, 2025 · Artificial Intelligence

How Tiny Inference Model Tina Cuts Training Costs by 99.6% with LoRA‑RL

Researchers from ShanghaiTech and USC introduced the compact inference model Tina, which leverages low‑rank adaptation and reinforcement learning to achieve comparable or superior performance to large SOTA models while reducing post‑training and evaluation costs to just $9, a 99.6% savings over traditional approaches.

AICost‑Efficient Inferencelow-rank adaptation

0 likes · 12 min read

How Tiny Inference Model Tina Cuts Training Costs by 99.6% with LoRA‑RL

21CTO

Dec 18, 2024 · Artificial Intelligence

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

The 2024 AI engineering landscape is defined by mature AI coding assistants, the surge of AI agents like LangChain and LlamaIndex, the emergence of small, locally‑hosted language models, the solidifying role of AI engineers, and heated debates over what truly counts as open‑source AI.

AI EngineeringAI coding toolssmall language models

0 likes · 9 min read

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

Architect

May 5, 2024 · Artificial Intelligence

The Rise of Small Language Models (SLM) and Their Impact on AI Development

Amidst a growing trend that narrows performance gaps between large and small language models, researchers highlight the efficiency, adaptability, and specialized advantages of small language models (SLM), while also discussing the high costs, hallucinations, and security concerns that still challenge large‑scale LLMs.

AI efficiencyLLMModel Scaling

0 likes · 9 min read

The Rise of Small Language Models (SLM) and Their Impact on AI Development