Tagged articles

model tiering

3 articles · Page 1 of 1

Apr 13, 2026 · Artificial Intelligence

Why Your Tokens Burn Money Fast and How a Four‑Tier Model Stack Can Cut Costs

The article examines the rapid token consumption problem caused by popular LLM agents, proposes a four‑tier model hierarchy and concrete routing rules, and offers short‑term, long‑term, and budget‑friendly deployment recommendations to reduce expenses while maintaining performance.

LLMMulti‑model deploymentmodel tiering

0 likes · 7 min read

Why Your Tokens Burn Money Fast and How a Four‑Tier Model Stack Can Cut Costs

Senior Tony

Apr 5, 2026 · Artificial Intelligence

How to Impress Interviewers with Smart Token‑Optimization Strategies for LLMs

The article explains why simply switching to cheaper large language models fails in interviews and outlines five practical techniques—prompt simplification, context management, output control, model tiering, and caching—to reduce token consumption while preserving answer quality.

CachingInterview TipsLLM

0 likes · 5 min read

How to Impress Interviewers with Smart Token‑Optimization Strategies for LLMs

Kuaishou Tech

Jul 15, 2021 · Artificial Intelligence

Kuaishou Y-tech AI SDK Framework: Secrets Behind Mass Production of Special Effects

The article details Kuaishou's Y-tech AI SDK (YKit) architecture, covering its design for computer vision capabilities, performance optimization strategies for mobile devices, and real-world case studies such as GAN-based effects and intelligent matting, outlining challenges and future directions.

AI SDKGAN EffectsKuaishou

0 likes · 14 min read

Kuaishou Y-tech AI SDK Framework: Secrets Behind Mass Production of Special Effects