SuanNi
Author

SuanNi

A community for AI developers that aggregates large-model development services, models, and compute power.

142
Articles
0
Likes
19
Views
0
Comments
Recent Articles

Latest from SuanNi

100 recent articles max
SuanNi
SuanNi
Apr 28, 2026 · Artificial Intelligence

Why Your AI Agent Fails and How Skills Can Fix It

The article argues that monolithic AI agents suffer from stability, extensibility, and knowledge‑retention problems, and proposes a modular "Skills" architecture—analogous to a microkernel OS—that turns expertise into reusable, version‑controlled assets, enabling cross‑platform deployment, better human‑AI collaboration, and reshaping the labor market.

AI agentscross‑platform AIhuman‑AI collaboration
0 likes · 8 min read
Why Your AI Agent Fails and How Skills Can Fix It
SuanNi
SuanNi
Apr 27, 2026 · Artificial Intelligence

How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy

The paper shows that conventional LLM agents fail on real‑world enterprise data because of chaotic data sources, while the RUBICON architecture uses a minimal Agentic Query Language to let users direct data retrieval, achieving 100% accuracy with a much cheaper model and dramatically lower token and monetary costs.

Agentic Query LanguageData integrationEnterprise AI
0 likes · 11 min read
How MIT’s RUBICON Cuts AI Agent Costs by 90% While Achieving 100% Accuracy
SuanNi
SuanNi
Apr 27, 2026 · Artificial Intelligence

Agent Skills Explained: Definition, Structure, and Engineering Practices

This article breaks down the official Anthropic definition of Agent Skills, shows how they are simple file‑system‑based, composable units stored in SKILL.md, scripts, references and assets, and explains the three‑layer progressive‑disclosure loading model, discovery, selection, execution, composition patterns, security, version‑control integration and evaluation practices.

AIAgent SkillsComposable
0 likes · 14 min read
Agent Skills Explained: Definition, Structure, and Engineering Practices
SuanNi
SuanNi
Apr 26, 2026 · Artificial Intelligence

Xiaomi’s MiMo‑V2.5: Halving Cost, Doubling Efficiency with a New Multimodal LLM

Xiaomi unveiled the MiMo‑V2.5 and MiMo‑V2.5‑Pro large language models, highlighting up to 50% lower API cost, multimodal perception, token‑efficiency gains, benchmark superiority over Claude Opus 4.6 and GPT‑5.4, and real‑world demos that built a full compiler in 4.3 hours and a video‑editing web app in 11.5 hours.

AI AgentLarge Language ModelMiMo-V2.5
0 likes · 6 min read
Xiaomi’s MiMo‑V2.5: Halving Cost, Doubling Efficiency with a New Multimodal LLM
SuanNi
SuanNi
Apr 25, 2026 · Artificial Intelligence

Is Tencent’s Large Model Lagging? How Hy3‑preview Propels It Into the Top Tier

Tencent’s AI division rebuilt its Hunyuan model from the ground up, releasing the 295‑billion‑parameter Hy3‑preview with a fast‑slow hybrid expert architecture, extensive internal benchmarks, and strong performance on scientific, coding, and real‑world tasks, marking a decisive leap into the leading LLM tier.

AgentHy3-previewLarge Language Model
0 likes · 7 min read
Is Tencent’s Large Model Lagging? How Hy3‑preview Propels It Into the Top Tier
SuanNi
SuanNi
Apr 25, 2026 · Artificial Intelligence

Mastering Google’s AI Agent Protocols to Build Industrial‑Grade Applications

The article walks through Google’s AI Agent Development Kit, explains six open communication protocols, demonstrates their integration in a restaurant supply‑chain use case, extracts five architecture rules from the Google Cloud AI Agent Bake‑Off, and shows how to build production‑ready, modular AI agents without custom glue code.

AI agentsGoogle AIMicroservices
0 likes · 15 min read
Mastering Google’s AI Agent Protocols to Build Industrial‑Grade Applications
SuanNi
SuanNi
Apr 24, 2026 · Artificial Intelligence

Why GPT‑5.5 Beats Opus 4.7 and Sets a New Global SOTA

OpenAI’s newly released GPT‑5.5, marketed as a “next‑generation AI for real work,” outperforms competitors across coding, knowledge‑work, and scientific research benchmarks—achieving 82.7% accuracy on Terminal‑Bench 2.0, 58.6% on SWE‑Bench Pro, 84.9% on GDPval, and 98.0% on Tau2‑bench Telecom—while offering higher token efficiency and new pricing tiers.

AI AgentGPT-5.5OpenAI
0 likes · 11 min read
Why GPT‑5.5 Beats Opus 4.7 and Sets a New Global SOTA
SuanNi
SuanNi
Apr 24, 2026 · Artificial Intelligence

DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All

DeepSeek-V4 introduces a hybrid attention architecture, manifold‑constrained hyper‑connections, and the Muon optimizer to cut inference FLOPs and KV cache dramatically, enabling open‑source models to handle million‑token contexts at a fraction of the cost of leading closed‑source services while matching their performance.

DeepSeek V4Hybrid attentionLarge Language Model
0 likes · 7 min read
DeepSeek-V4 Launches: Million-Token Context Becomes Affordable for All