Tagged articles
32 articles
Page 1 of 1
Java Tech Enthusiast
Java Tech Enthusiast
Feb 28, 2025 · Industry Insights

Why Zhihu Forced Login and What It Means for Users and AI Training

The article examines how Chinese regulations curbed forced app downloads, yet Zhihu still blocked full content for non‑logged‑in users, sparking user backlash, speculation about AI data protection, and ultimately leading the platform to lift the login barrier.

AI data scrapingChina tech policyUser experience
0 likes · 5 min read
Why Zhihu Forced Login and What It Means for Users and AI Training
Zhihu Tech Column
Zhihu Tech Column
Oct 10, 2024 · Artificial Intelligence

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

This article presents a method for massive multi-label text classification on Zhihu content by combining a semantic retrieval model with a proprietary large AI model, detailing the challenges of large label spaces, model architecture, loss optimization, and experimental results showing significant accuracy gains.

BGElarge language modelmulti-label classification
0 likes · 16 min read
Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model
DataFunTalk
DataFunTalk
Dec 8, 2023 · Big Data

Zhihu Bridge Platform: Architecture, Capabilities, and Future Trends of Content Operations

This article presents a comprehensive overview of Zhihu's Bridge platform, detailing its content‑operation architecture—including content pool, management, analysis, monitoring, and intervention modules—explaining the underlying streaming and batch technologies such as Flink, Doris, and Elasticsearch, and outlining future automation and AI‑driven workflow directions.

AIBig DataStreaming
0 likes · 17 min read
Zhihu Bridge Platform: Architecture, Capabilities, and Future Trends of Content Operations
Python Programming Learning Circle
Python Programming Learning Circle
May 23, 2022 · Backend Development

Simulating Zhihu Login with Python Using urllib and Fiddler

This article demonstrates how to automate Zhihu login on Windows by analyzing network traffic with Fiddler, extracting required parameters, and implementing a Python script that builds HTTP requests using urllib2, handles cookies, captcha retrieval, and logs the results, complete with sample code and execution screenshots.

FiddlerHTTPLogin Automation
0 likes · 8 min read
Simulating Zhihu Login with Python Using urllib and Fiddler
Baidu MEUX
Baidu MEUX
Feb 15, 2022 · Product Management

How Zhihu’s Design Team Turned Text Content into a Video‑Driven Business

This article examines Zhihu’s evolution from a text‑centric platform to a video‑enabled product, detailing the design team’s role in content flow, the creation of video‑answer and video‑entity formats, the development of rapid‑layout tools, the shift to a membership model, and the ethical considerations designers must balance between user experience and commercial goals.

Content MonetizationDesign ManagementVideo Strategy
0 likes · 10 min read
How Zhihu’s Design Team Turned Text Content into a Video‑Driven Business
DataFunSummit
DataFunSummit
Aug 29, 2021 · Artificial Intelligence

Zhihu Recommendation Page Ranking: Architecture, Feature Design, Model Evolution, and Practical Insights

This article presents a comprehensive overview of Zhihu's recommendation page ranking system, detailing the request flow, ranking evolution from time‑based to deep‑learning models, feature engineering strategies, model architectures such as DNN, DeepFM, DIN, multi‑task learning, and lessons learned for production deployment.

CTRfeature engineeringmachine learning
0 likes · 12 min read
Zhihu Recommendation Page Ranking: Architecture, Feature Design, Model Evolution, and Practical Insights
Architecture Digest
Architecture Digest
May 15, 2021 · Backend Development

Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption

This article details Zhihu's read‑service architecture, covering its business requirements, high‑availability and high‑performance design goals, key components such as Proxy, Cache and Storage, extensive performance metrics, the migration from MySQL to TiDB, and the benefits brought by TiDB 3.0 features.

Backend ArchitectureScalabilityTiDB
0 likes · 18 min read
Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption
Big Data Technology & Architecture
Big Data Technology & Architecture
Sep 11, 2019 · Big Data

Evolution of Zhihu's Real-Time Data Warehouse: From Spark Streaming 1.0 to Flink‑Based 2.0

This article details Zhihu's real‑time data warehouse evolution, describing the 1.0 Spark Streaming architecture, its limitations, and the 2.0 redesign that introduces Flink, layered data models, streaming and batch ETL, metric storage choices, and future roadmap for scalable, low‑latency analytics.

FlinkLambda architectureSpark Streaming
0 likes · 19 min read
Evolution of Zhihu's Real-Time Data Warehouse: From Spark Streaming 1.0 to Flink‑Based 2.0
58UXD
58UXD
Aug 2, 2019 · Product Management

How Zhihu’s Product Designers Turn Ideas into Impactful Features

This article explores Zhihu’s product design workflow, from requirement gathering and solution planning through design output and effect evaluation, highlighting the role of intuition, visual communication, metric-driven assessment, and how design uniquely adds business value and bridges user experience with product strategy.

Product DesignUser experiencedesign metrics
0 likes · 16 min read
How Zhihu’s Product Designers Turn Ideas into Impactful Features
21CTO
21CTO
Jul 6, 2019 · Mobile Development

How Zhihu Accelerated Mobile Ad Updates with the Morph DSL Native+ Solution

Zhihu’s Morph dynamic solution, built on a Flexbox‑based DSL and JSON, enabled rapid, cross‑platform updates of mobile ad cards, dramatically reducing rollout time from eight days to one and supporting over 70 styles with minimal impact on app size and performance.

Ad TechDSLDynamic UI
0 likes · 20 min read
How Zhihu Accelerated Mobile Ad Updates with the Morph DSL Native+ Solution
Youku Technology
Youku Technology
May 20, 2019 · Big Data

Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners

In a playful data‑driven experiment, the author scraped 27,664 Zhihu answers to “What are your dating criteria?”, filtered out short, outdated, high‑profile or already‑matched posts, applied follower‑and engagement‑thresholds to narrow the pool to 480 candidates, then ranked the top 30 by a like‑to‑comment ratio, sharing the code and dataset for reproducibility.

data analysisdatingfiltering
0 likes · 8 min read
Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners
360 Tech Engineering
360 Tech Engineering
May 20, 2019 · Fundamentals

A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates

This article walks through a complete data‑analysis workflow—scraping Zhihu dating‑preference answers, cleaning and filtering the data, deriving gender and activity metrics, designing a four‑step screening process, and finally ranking candidates with a custom like‑to‑comment index—to help a single programmer create a concise, high‑quality list of potential partners.

MetricsWeb Scrapingdata analysis
0 likes · 9 min read
A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates
MaGe Linux Operations
MaGe Linux Operations
Nov 14, 2017 · Backend Development

How to Use Scrapy to Crawl Zhihu Users and Analyze Their Data

This tutorial explains how a Python developer can set up a Scrapy project, write spiders to crawl Zhihu user profiles, store the results in a MySQL database, adjust settings for headers and delays, and finally perform simple gender and location analysis on the collected data.

Backend DevelopmentPythonScrapy
0 likes · 14 min read
How to Use Scrapy to Crawl Zhihu Users and Analyze Their Data
21CTO
21CTO
Nov 7, 2017 · Big Data

What 3.3 Million Zhihu Users Reveal About Gender, Location, and Careers

Analyzing over 3.2 million publicly available Zhihu profiles collected via a distributed Python crawler, this report uncovers gender balance near 1:1, top residential cities, dominant occupations, university participation, and the most followed and active contributors, while noting data limitations and temporal relevance.

User Demographicsdata analysiszhihu
0 likes · 11 min read
What 3.3 Million Zhihu Users Reveal About Gender, Location, and Careers
MaGe Linux Operations
MaGe Linux Operations
Jul 10, 2017 · Backend Development

How to Build a Zhihu Crawler with Python, ELK, and Visual Analytics

This article walks through creating a Python-based Zhihu web crawler, detailing the tech stack, data collection, visualization of user demographics and top contributors, the crawler architecture, authorization handling, and suggestions for performance and storage improvements.

ELKWeb Scrapingzhihu
0 likes · 6 min read
How to Build a Zhihu Crawler with Python, ELK, and Visual Analytics
MaGe Linux Operations
MaGe Linux Operations
May 23, 2017 · Backend Development

How to Build a Python Zhihu Web Scraper: Login, User Data, and More

This article walks through building a Python web scraper for Zhihu, covering login simulation, extracting user profiles, answer likers, followers, avatars, and all answers of a question, and storing the collected data in SQLite, while highlighting challenges like captcha and anti‑scraping limits.

SQLitebeautifulsoupdata-extraction
0 likes · 10 min read
How to Build a Python Zhihu Web Scraper: Login, User Data, and More
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
May 28, 2016 · Backend Development

Extract All @Mentions from a Zhihu Page with Simple Scripts

This guide shows how to collect every @mentioned user on a Zhihu question page by using a JavaScript bookmarklet or a Python script, explains the extraction process, provides the necessary code snippets, and discusses why following programmers on Zhihu may not be the most effective learning method.

JavaScriptPythonWeb Scraping
0 likes · 6 min read
Extract All @Mentions from a Zhihu Page with Simple Scripts