Zhihu Tech Column
Author

Zhihu Tech Column

Sharing Zhihu tech posts and exploring community technology innovations.

17
Articles
0
Likes
50
Views
0
Comments
Recent Articles

Latest from Zhihu Tech Column

17 recent articles
Zhihu Tech Column
Zhihu Tech Column
Mar 14, 2025 · Artificial Intelligence

Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations

The article summarizes Zhihu’s technical talk on the ZhiLight large‑model inference framework, detailing model execution mechanisms, GPU load analysis, multi‑GPU parallel strategies, open‑source engine comparisons, compute‑communication overlap, quantization techniques, benchmark results, and future directions for scalable LLM deployment.

GPU parallelismModel InferenceSGLang
0 likes · 11 min read
Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations
Zhihu Tech Column
Zhihu Tech Column
Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI searchChunkingEngineering Optimization
0 likes · 13 min read
Zhihu Direct Answer: Product Overview and Technical Practices
Zhihu Tech Column
Zhihu Tech Column
Dec 31, 2024 · Cloud Native

Cloud Native Innovation Forum: AutoMQ Table Topic, OceanBase Integrated Database, and Observability Practices

The article recaps Zhihu's Cloud Native Innovation Forum where experts from AutoMQ, OceanBase, and Flashcat shared practical solutions on streaming data ingestion, unified database architectures, and AI‑driven observability, highlighting real‑world deployments, performance optimizations, and cost‑saving strategies.

AIAutoMQMessaging
0 likes · 10 min read
Cloud Native Innovation Forum: AutoMQ Table Topic, OceanBase Integrated Database, and Observability Practices
Zhihu Tech Column
Zhihu Tech Column
Dec 25, 2024 · Databases

Technical Challenges and Solutions for Migrating Zhihu's Self‑Managed MongoDB Cluster to Alibaba Cloud

The article analyzes the storage, sharding, backup, and operational pain points of Zhihu's self‑operated MongoDB cluster, proposes cloud‑based architectural and procedural solutions, and details a step‑by‑step migration plan that ensures zero‑downtime, improved stability, and cost efficiency.

Alibaba CloudDatabase MigrationMongoDB
0 likes · 11 min read
Technical Challenges and Solutions for Migrating Zhihu's Self‑Managed MongoDB Cluster to Alibaba Cloud
Zhihu Tech Column
Zhihu Tech Column
Oct 28, 2024 · Databases

TiDB Architecture, Deployment Choices, and Zhihu’s Database Platform Practices

This article explains TiDB’s distributed relational database architecture, compares Kubernetes‑based and physical‑machine deployments, and describes Zhihu’s platform‑driven approach—including TiDB Operator, TiUP, self‑service tooling, monitoring, backup, security, and FinOps—to enable efficient database management for both developers and DBAs.

Database OperationsFinOpsKubernetes
0 likes · 15 min read
TiDB Architecture, Deployment Choices, and Zhihu’s Database Platform Practices
Zhihu Tech Column
Zhihu Tech Column
Oct 10, 2024 · Artificial Intelligence

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

This article presents a method for massive multi-label text classification on Zhihu content by combining a semantic retrieval model with a proprietary large AI model, detailing the challenges of large label spaces, model architecture, loss optimization, and experimental results showing significant accuracy gains.

BGELarge Language ModelSemantic Retrieval
0 likes · 16 min read
Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model