Tagged articles
3 articles
Page 1 of 1
AI Engineering
AI Engineering
Apr 13, 2026 · Artificial Intelligence

Why Your Tokens Burn Money Fast and How a Four‑Tier Model Stack Can Cut Costs

The article examines the rapid token consumption problem caused by popular LLM agents, proposes a four‑tier model hierarchy and concrete routing rules, and offers short‑term, long‑term, and budget‑friendly deployment recommendations to reduce expenses while maintaining performance.

LLMMulti‑model deploymentToken Cost
0 likes · 7 min read
Why Your Tokens Burn Money Fast and How a Four‑Tier Model Stack Can Cut Costs
Senior Tony
Senior Tony
Apr 5, 2026 · Artificial Intelligence

How to Impress Interviewers with Smart Token‑Optimization Strategies for LLMs

The article explains why simply switching to cheaper large language models fails in interviews and outlines five practical techniques—prompt simplification, context management, output control, model tiering, and caching—to reduce token consumption while preserving answer quality.

Interview TipsLLMToken Optimization
0 likes · 5 min read
How to Impress Interviewers with Smart Token‑Optimization Strategies for LLMs
Kuaishou Tech
Kuaishou Tech
Jul 15, 2021 · Artificial Intelligence

Kuaishou Y-tech AI SDK Framework: Secrets Behind Mass Production of Special Effects

The article details Kuaishou's Y-tech AI SDK (YKit) architecture, covering its design for computer vision capabilities, performance optimization strategies for mobile devices, and real-world case studies such as GAN-based effects and intelligent matting, outlining challenges and future directions.

AI SDKGAN EffectsKuaishou
0 likes · 14 min read
Kuaishou Y-tech AI SDK Framework: Secrets Behind Mass Production of Special Effects