Tagged articles
9 articles
Page 1 of 1
Machine Heart
Machine Heart
May 6, 2026 · Artificial Intelligence

Can Adaptive Guidance Unlock Small Model Reasoning? Introducing G²RPO‑A

The paper identifies reward sparsity as the core obstacle for small language models in reinforcement‑learning‑based reasoning, proposes G²RPO‑A which injects high‑quality thinking trajectories and dynamically adjusts guidance length, and demonstrates large accuracy gains on math and code benchmarks such as Qwen3‑1.7B improving from 50.96 % to 67.21 % on MATH500 and from 46.08 % to 75.93 % on HumanEval.

Code GenerationG²RPO‑Aadaptive guidance
0 likes · 10 min read
Can Adaptive Guidance Unlock Small Model Reasoning? Introducing G²RPO‑A
AI Engineering
AI Engineering
Mar 3, 2026 · Artificial Intelligence

Alibaba Qwen‑3.5 Small Models: 0.8B Parameters Enable Video on Edge Devices

Alibaba released four Qwen‑3.5 models (0.8B‑9B) that use a Gated DeltaNet hybrid‑attention architecture and native multimodal training to achieve 262k‑token contexts, outperform larger rivals on visual, reasoning, and math benchmarks, and run video analysis on phones and laptops, though they still demand significant VRAM.

BenchmarkGated DeltaNetMultimodal AI
0 likes · 6 min read
Alibaba Qwen‑3.5 Small Models: 0.8B Parameters Enable Video on Edge Devices
SuanNi
SuanNi
Mar 2, 2026 · Artificial Intelligence

Can Small Language Models Match Big AI with the Skills Framework?

A recent study from top universities examines how the Skills framework enables small language models to reduce memory usage, improve accuracy, and handle complex industrial tasks, revealing performance gaps across model sizes, dataset challenges, and code‑specialized variants while highlighting cost‑effective deployment strategies.

AIIndustrial AISkills Framework
0 likes · 8 min read
Can Small Language Models Match Big AI with the Skills Framework?
Data Party THU
Data Party THU
Sep 8, 2025 · Artificial Intelligence

Why Small Language Models Will Dominate Agentic AI by 2025

By 2025, Agentic AI is shifting from massive LLMs to cost‑effective Small Language Models (SLMs), driven by their comparable performance, lower latency, and dramatically reduced inference and fine‑tuning costs, as detailed through market data, model benchmarks, migration steps, and real‑world case studies.

AIAgentic AILLM
0 likes · 6 min read
Why Small Language Models Will Dominate Agentic AI by 2025
AI Frontier Lectures
AI Frontier Lectures
May 9, 2025 · Artificial Intelligence

How Tiny Inference Model Tina Cuts Training Costs by 99.6% with LoRA‑RL

Researchers from ShanghaiTech and USC introduced the compact inference model Tina, which leverages low‑rank adaptation and reinforcement learning to achieve comparable or superior performance to large SOTA models while reducing post‑training and evaluation costs to just $9, a 99.6% savings over traditional approaches.

AICost‑Efficient Inferencelow-rank adaptation
0 likes · 12 min read
How Tiny Inference Model Tina Cuts Training Costs by 99.6% with LoRA‑RL
21CTO
21CTO
Dec 18, 2024 · Artificial Intelligence

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

The 2024 AI engineering landscape is defined by mature AI coding assistants, the surge of AI agents like LangChain and LlamaIndex, the emergence of small, locally‑hosted language models, the solidifying role of AI engineers, and heated debates over what truly counts as open‑source AI.

AI EngineeringAI coding toolssmall language models
0 likes · 9 min read
5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models
Architect
Architect
May 5, 2024 · Artificial Intelligence

The Rise of Small Language Models (SLM) and Their Impact on AI Development

Amidst a growing trend that narrows performance gaps between large and small language models, researchers highlight the efficiency, adaptability, and specialized advantages of small language models (SLM), while also discussing the high costs, hallucinations, and security concerns that still challenge large‑scale LLMs.

AI efficiencyEdge ComputingLLM
0 likes · 9 min read
The Rise of Small Language Models (SLM) and Their Impact on AI Development