Author

Baobao Algorithm Notes

Author of the BaiMian large model, offering technology and industry insights.

291

Articles

Likes

Views

Comments

Latest from Baobao Algorithm Notes

100 recent articles max

Baobao Algorithm Notes

Aug 1, 2025 · Artificial Intelligence

Why Training Large Language Models Feels Like Alchemy—and How to Master It

This article breaks down the hardware bottlenecks of large‑scale LLM training, explains the Roofline performance model, arithmetic intensity, and how computation and communication costs interact on GPUs and TPUs, offering concrete formulas and examples for efficient scaling.

Arithmetic intensityDistributed ComputingGPU

0 likes · 12 min read

Why Training Large Language Models Feels Like Alchemy—and How to Master It

Baobao Algorithm Notes

Aug 1, 2025 · Artificial Intelligence

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

The article introduces Qwen3‑Coder‑30B‑A3B‑Instruct (aka Qwen3‑Coder‑Flash), detailing its architecture, 256K‑to‑1M token context, agentic coding capabilities, installation steps with Transformers, sample code for tool use, optimal sampling parameters, and deployment tips across various runtimes.

AI Coding AssistantAgentic CodingLarge Language Model

0 likes · 6 min read

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

Baobao Algorithm Notes

Jul 29, 2025 · Artificial Intelligence

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

The Qwen3‑30B‑A3B‑Instruct‑2507 model, an updated non‑thinking version of Qwen3‑30B‑A3B, delivers significant gains in instruction following, reasoning, multilingual knowledge coverage, and 256K context length, and its performance is benchmarked against leading LLMs across a wide range of tasks.

Instruction TuningMixture‑of‑ExpertsQwen3

0 likes · 6 min read

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

Baobao Algorithm Notes

Jul 28, 2025 · Industry Insights

Why AWS Bedrock AgentCore Signals a New Era for Agentic AI Infrastructure

The article analyzes AWS Bedrock AgentCore and related hardware and software requirements for Agentic AI, covering runtime isolation with microVMs, memory architectures, identity and gateway design, zero‑trust networking, and the challenges of multi‑tenant KVCache and context engineering.

AWS BedrockAgentic AIInfrastructure

0 likes · 15 min read

Why AWS Bedrock AgentCore Signals a New Era for Agentic AI Infrastructure

Baobao Algorithm Notes

Jul 18, 2025 · Artificial Intelligence

30+ Expert Q&A on Large Language Model Architecture, Training, and Deployment

This article compiles more than thirty interview‑style questions and detailed answers covering large‑model fundamentals such as encoder‑decoder trade‑offs, self‑attention versus RNN, context length, tokenization, embedding strategies, FlashAttention, RoPE, prompt design, retrieval‑augmented generation, safety measures, fine‑tuning, and model distillation, providing a comprehensive technical reference for practitioners.

Attention Mechanismretrieval-augmented generation

0 likes · 53 min read

30+ Expert Q&A on Large Language Model Architecture, Training, and Deployment

Baobao Algorithm Notes

Jul 17, 2025 · Artificial Intelligence

How QK-Clip Tames MaxLogit Explosions in Trillion‑Parameter LLMs

The article introduces QK-Clip, a lightweight per‑head weight‑clipping technique that uses the MaxLogit signal to prevent uncontrolled logit growth in massive LLMs, explains its design, compares it with prior methods, and shows that it stabilizes training without harming model performance.

Attention stabilityLLM trainingMaxLogit

0 likes · 15 min read

How QK-Clip Tames MaxLogit Explosions in Trillion‑Parameter LLMs

Baobao Algorithm Notes

Jul 16, 2025 · Artificial Intelligence

What Small Labs Reveal About RL Training: Multi‑Stage, Entropy, and Resource Strategies

The article analyzes Skywork OR1's technical report, detailing how small‑scale teams use GRPO‑based reinforcement learning with multi‑stage training, advantage‑mask variants, high‑temperature sampling, adaptive entropy loss, and resource‑allocation tricks to improve large language model performance while avoiding premature entropy collapse.

AI researchentropy controlmulti-stage training

0 likes · 21 min read

What Small Labs Reveal About RL Training: Multi‑Stage, Entropy, and Resource Strategies

Baobao Algorithm Notes

Jul 10, 2025 · Industry Insights

Grok 4 Unveiled: Why xAI Claims Its New Model Beats the Competition

On July 10, xAI launched Grok 4, a multimodal LLM with a 256K‑token context window, tool‑use upgrades and benchmark scores that surpass existing models, while pricing it at $30/month for the standard tier and $300/month for the heavy tier.

AI benchmarksGrok 4Multimodal AI

0 likes · 6 min read

Grok 4 Unveiled: Why xAI Claims Its New Model Beats the Competition

Baobao Algorithm Notes

Jul 2, 2025 · Industry Insights

Why Baidu’s Open‑Source Ernie 4.5 Could Redefine the Global AI Race

Baidu has open‑sourced ten Ernie 4.5 models ranging from 0.3B to 424B parameters, featuring multimodal MoE pre‑training, advanced infrastructure, and post‑training tricks that deliver benchmark results surpassing DeepSeek‑V3 and OpenAI‑o1, sparking worldwide industry attention and reshaping AI competition.

AI competitionBaiduErnie

0 likes · 8 min read

Why Baidu’s Open‑Source Ernie 4.5 Could Redefine the Global AI Race

Baobao Algorithm Notes

Jun 30, 2025 · Artificial Intelligence

How End‑to‑End Reinforcement Learning Powers the Kimi‑Researcher AI Agent

The article examines Kimi‑Researcher, an AI research agent built with end‑to‑end reinforcement learning, detailing its technical motivations, advantages over traditional workflow‑based and SFT methods, performance breakthroughs on benchmark exams, and diverse real‑world use cases ranging from literature reviews to legal analysis.

AI AgentEnd-to-End RLKimi Researcher

0 likes · 10 min read

How End‑to‑End Reinforcement Learning Powers the Kimi‑Researcher AI Agent