Tagged articles
521 articles
Page 5 of 6
58UXD
58UXD
Apr 15, 2020 · Product Management

How Designers Can Leverage Data Analysis to Solve Real‑World Problems

Designers often overlook data analysis, yet mastering it helps identify problems, set measurable goals, generate hypotheses, collect and interpret metrics, and draw actionable conclusions, ultimately guiding product decisions and improving conversion rates across various scenarios such as page performance, financial calculations, and insurance plan selection.

MetricsProduct DesignUX
0 likes · 12 min read
How Designers Can Leverage Data Analysis to Solve Real‑World Problems
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Mar 17, 2020 · Fundamentals

A Real‑Life Example of User Profiling to Boost Sales

This article uses a vivid kite‑selling story to illustrate how user profiling, data tagging, and recommendation tactics can be combined to increase transaction volume, improve average order value, and avoid common pitfalls such as unclear goals, poor data quality, and unvalidated tags.

Data QualityMarketing Strategydata analysis
0 likes · 9 min read
A Real‑Life Example of User Profiling to Boost Sales
Xianyu Technology
Xianyu Technology
Mar 10, 2020 · Industry Insights

How Intelligent Slice Analysis Transforms Xianyu’s Operational Decision‑Making

This article explains how the Nanomirror data analysis platform introduces intelligent slice analysis—both activity‑metric and AB‑bucket slice analyses—to uncover the most impactful user segments, guide targeted interventions, and improve operational efficiency for Xianyu’s online campaigns.

AB testingNanomirrorOperational Insights
0 likes · 9 min read
How Intelligent Slice Analysis Transforms Xianyu’s Operational Decision‑Making
DevOps
DevOps
Mar 10, 2020 · Backend Development

Analyzing Git Repository Commit Statistics with libgit2sharp, .NET Core, and PowerBI

This article describes how to collect, process, and visualize Git commit data—including hashes, authors, dates, messages, and line changes—by using native git commands, the libgit2sharp .NET library, a custom cross‑platform CLI tool, and PowerBI dashboards for comprehensive reporting.

.NET CoreBackendCLI
0 likes · 8 min read
Analyzing Git Repository Commit Statistics with libgit2sharp, .NET Core, and PowerBI
FunTester
FunTester
Feb 19, 2020 · Operations

Turning Raw Performance Test Logs into Readable Text Charts with Groovy

This article explains how to replace cumbersome Python‑Plotly visualizations of performance test logs with a lightweight Groovy solution that generates plain‑text bar charts directly in the shell or email, using Unicode block characters and a bucket‑based median algorithm.

GroovyPerformance TestingShell
0 likes · 6 min read
Turning Raw Performance Test Logs into Readable Text Charts with Groovy
360 Tech Engineering
360 Tech Engineering
Feb 13, 2020 · Big Data

COVID-19 Daily Report and Data Analysis – February 12, 2020

The February 12, 2020 COVID‑19 daily report details a sharp rise in national confirmed cases to 59,804, explains the inclusion of clinically diagnosed cases in Hubei, presents provincial death and cure rates, and offers extensive data‑driven analyses of trends, infection coefficients, and regional transmission dynamics.

COVID-19ChinaInfection Rate
0 likes · 11 min read
COVID-19 Daily Report and Data Analysis – February 12, 2020
Python Programming Learning Circle
Python Programming Learning Circle
Feb 3, 2020 · Fundamentals

Master Pandas: Essential Data Manipulation Techniques for Python Beginners

This guide introduces pandas, the essential Python library for data science, covering installation, data import/export, basic DataFrame operations, logical filtering, visualization with matplotlib, performance tips using tqdm, and advanced techniques like merging, grouping, and iterating, helping beginners become efficient data analysts.

data analysisdata manipulationpandas
0 likes · 8 min read
Master Pandas: Essential Data Manipulation Techniques for Python Beginners
ITPUB
ITPUB
Jan 21, 2020 · Fundamentals

Mastering Data Queries: Pandas vs SQL – A Step‑by‑Step Comparison

This tutorial walks data analysts through a side‑by‑side comparison of common data‑manipulation tasks using pandas in Python and SQL, covering everything from basic selects and filters to joins, aggregations, unions, ordering, case expressions, and data updates with clear code examples.

ComparisonTutorialdata analysis
0 likes · 15 min read
Mastering Data Queries: Pandas vs SQL – A Step‑by‑Step Comparison
Programmer DD
Programmer DD
Jan 5, 2020 · Fundamentals

Why Every Engineer Must Master Business Knowledge to Stay Relevant

The article argues that programmers who only code are becoming replaceable, emphasizing that mastering real-world business processes, data-driven decision making, and continuous curiosity is essential for engineers to remain valuable and drive meaningful product outcomes.

Career DevelopmentSoftware Engineeringbusiness
0 likes · 11 min read
Why Every Engineer Must Master Business Knowledge to Stay Relevant
Youzan Coder
Youzan Coder
Dec 30, 2019 · Operations

How to Measure and Improve Project Efficiency: A Practical Guide

This article explains why measurement is essential for management, outlines a step‑by‑step process for collecting and analyzing efficiency metrics, and shows how to turn data‑driven insights into concrete conclusions and actionable improvement plans for software projects.

Continuous ImprovementOperationsProject Management
0 likes · 10 min read
How to Measure and Improve Project Efficiency: A Practical Guide
Meituan Technology Team
Meituan Technology Team
Nov 21, 2019 · Big Data

Designing a Platformized Jupyter Service Integrated with Spark for Meituan

Meituan Homestay created a platform‑wide Jupyter service built on JupyterHub and Kubernetes that integrates Spark, scheduling, documentation and storage, providing seamless, reproducible notebooks with custom extensions, magics and container isolation to unify data analysis, model training and production workflows.

Big DataJupyterKubernetes
0 likes · 19 min read
Designing a Platformized Jupyter Service Integrated with Spark for Meituan
FunTester
FunTester
Nov 14, 2019 · Backend Development

Web Scraping CBA Match Data with Java: Methodology and Full Code Example

This article explains how to scrape Chinese Basketball Association (CBA) match data from a portal website, analyzes the page structure, extracts table rows using regular expressions, converts them to CSV format, and provides a complete Java/Groovy code example for automated data collection.

CBACSVJava
0 likes · 8 min read
Web Scraping CBA Match Data with Java: Methodology and Full Code Example
MaGe Linux Operations
MaGe Linux Operations
Oct 5, 2019 · Fundamentals

How to Scrape and Analyze Holiday Tourist Spot Data with Python

This tutorial walks you through using Python to collect tourism data from Qunar, extract key fields such as name, price, and rating, store the results in Excel with pandas, and visualize sales and popularity trends using pyecharts, including a simple recommendation algorithm.

PyechartsPythonTourism
0 likes · 8 min read
How to Scrape and Analyze Holiday Tourist Spot Data with Python
Snowball Engineer Team
Snowball Engineer Team
Sep 24, 2019 · Big Data

Snowball Data Middle Platform (AIBO): Architecture, Capabilities, and Future Outlook

The article introduces Snowball's AIBO data middle platform, detailing its storage‑compute separation architecture, core capabilities such as data integration, catalog, tagging, analysis tools, micro‑service data APIs, and outlines future enhancements for security, lineage, and continuous business‑driven iteration.

Big DataData CatalogData Integration
0 likes · 12 min read
Snowball Data Middle Platform (AIBO): Architecture, Capabilities, and Future Outlook
DataFunTalk
DataFunTalk
Sep 3, 2019 · Big Data

The Value of Big Data in Machine Learning: Detailed Illustration and Insights

This article explains how big data enhances machine learning by enabling finer-grained data characterization, improving confidence in statistical conclusions, and supporting smarter learning through multiple stages of model development, illustrated with concrete examples and a discussion of sample size dilemmas.

Big Datadata analysismachine learning
0 likes · 10 min read
The Value of Big Data in Machine Learning: Detailed Illustration and Insights
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 20, 2019 · Fundamentals

Master Jupyter Notebook: A Step‑by‑Step Data Analysis Guide for Beginners

Learn how to install Jupyter via Anaconda or pip, create and manage notebooks, understand cells and kernels, write and run Python code, explore a Fortune 500 dataset with pandas, clean missing values, and visualize profit and revenue trends using matplotlib and seaborn—all illustrated with screenshots and code snippets.

Jupyter NotebookMatplotlibPython
0 likes · 15 min read
Master Jupyter Notebook: A Step‑by‑Step Data Analysis Guide for Beginners
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 13, 2019 · Fundamentals

Unlock Jupyter Notebook Power: Shell Commands, Magic, Logging & Seaborn Tricks

This guide explores advanced Jupyter Notebook techniques, including using shell commands, line and cell magic commands, autosave configuration, timing execution, logging customization, running external scripts, integrating Seaborn for enhanced visualizations, managing databases with ipython-sql, and extending functionality with plugins.

JupyterMagic CommandsPython
0 likes · 17 min read
Unlock Jupyter Notebook Power: Shell Commands, Magic, Logging & Seaborn Tricks
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 11, 2019 · Fundamentals

How to Scrape and Visualize a Year of Global Earthquake Data with Python

This tutorial walks you through discovering a real‑time earthquake data source, analyzing its pagination API, building a Python scraper with requests, parsing the JSON responses, storing the results in CSV, and performing visual analyses such as top‑magnitude locations, monthly frequency, repeat‑site counts, magnitude distribution, and word‑cloud generation.

CSVEarthquake DataPython
0 likes · 8 min read
How to Scrape and Visualize a Year of Global Earthquake Data with Python
Tencent Advertising Technology
Tencent Advertising Technology
Jun 13, 2019 · Artificial Intelligence

Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction

The article details a contestant's end‑to‑end approach for an advertising competition, covering data analysis, rule‑based preprocessing, a three‑layer neural network architecture, model‑rule ensemble weighting, self‑correction strategies for the B phase, and final model‑only solutions that achieved top scores.

AdvertisingNeural Networkcompetition
0 likes · 8 min read
Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction
NetEase Media Technology Team
NetEase Media Technology Team
Jun 5, 2019 · Product Management

Mastering AB Testing: From Basics to Scalable Multi‑Layer Architecture

This article explains the fundamentals of AB testing, outlines the iterative workflow, shares best‑practice guidelines, compares single‑layer and multi‑layer experiment frameworks, and details the technical implementation—including SDK design, hashing algorithms, data denoising, and statistical evaluation methods.

AB testingBackendHashing
0 likes · 15 min read
Mastering AB Testing: From Basics to Scalable Multi‑Layer Architecture
58UXD
58UXD
May 29, 2019 · Product Management

How Data Analysis Drives User Growth: From AARRR Funnel to Practical Tools

This article explains fundamental data‑analysis methods, introduces the AARRR user‑growth model with key metrics for each stage, and presents practical tools such as user path analysis, funnel conversion, heatmaps, and A/B testing to help product teams make data‑driven decisions and continuously improve user experience.

AARRRAB testingUser experience
0 likes · 9 min read
How Data Analysis Drives User Growth: From AARRR Funnel to Practical Tools
MaGe Linux Operations
MaGe Linux Operations
May 28, 2019 · Big Data

Recreating Google Ngram Trends with Python, PyTubes, and NumPy

This article demonstrates how to download the Google 1‑gram dataset, load and filter billions of rows with the PyTubes library, compute yearly word frequencies using NumPy, and reproduce the classic Python usage trend chart while discussing performance considerations and future improvements.

Big DataGoogle NgramNumPy
0 likes · 9 min read
Recreating Google Ngram Trends with Python, PyTubes, and NumPy
Youku Technology
Youku Technology
May 20, 2019 · Big Data

Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners

In a playful data‑driven experiment, the author scraped 27,664 Zhihu answers to “What are your dating criteria?”, filtered out short, outdated, high‑profile or already‑matched posts, applied follower‑and engagement‑thresholds to narrow the pool to 480 candidates, then ranked the top 30 by a like‑to‑comment ratio, sharing the code and dataset for reproducibility.

data analysisdatingfiltering
0 likes · 8 min read
Data‑Driven Dating Guide: Analyzing Zhihu Answers to Identify Potential Partners
360 Tech Engineering
360 Tech Engineering
May 20, 2019 · Fundamentals

A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates

This article walks through a complete data‑analysis workflow—scraping Zhihu dating‑preference answers, cleaning and filtering the data, deriving gender and activity metrics, designing a four‑step screening process, and finally ranking candidates with a custom like‑to‑comment index—to help a single programmer create a concise, high‑quality list of potential partners.

MetricsWeb Scrapingdata analysis
0 likes · 9 min read
A Data‑Driven Guide to Finding a Partner: From Crawling Zhihu Answers to Ranking Candidates
58UXD
58UXD
Apr 18, 2019 · Operations

How Winning Design Strategies Boosted Spring Festival Campaign Traffic

This article dissects the 2019 Spring Festival (春运) campaign by 58.com, revealing how a win‑win design mindset, data‑driven insights, and integrated business collaboration transformed user experience, increased traffic, and delivered measurable results across multiple channels and game‑based interactions.

Design ThinkingOperationsUser experience
0 likes · 11 min read
How Winning Design Strategies Boosted Spring Festival Campaign Traffic
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 15, 2019 · Databases

Master SQL Basics: A Concise Review Guide for Data Analysts

This guide provides a concise review of fundamental database concepts—including definitions of databases, RDBMS, SQL, and tables—and walks through essential SQL query techniques, data manipulation commands, schema definition statements, and control language basics, offering practical examples for data analysts.

DDLDMLdata analysis
0 likes · 9 min read
Master SQL Basics: A Concise Review Guide for Data Analysts
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 24, 2019 · Backend Development

How to Scrape Douban Book Data and Analyze It with Python

This tutorial shows how to collect book metadata such as publisher, publication date, ISBN, price, rating and review count from Douban for a list of titles stored in Excel, using Python requests, lxml XPath parsing, pandas for merging and analysis, and visualizing the results with matplotlib.

PythonXPathdata analysis
0 likes · 14 min read
How to Scrape Douban Book Data and Analyze It with Python
Tencent Cloud Developer
Tencent Cloud Developer
Mar 13, 2019 · Fundamentals

Introduction to NumPy, pandas, and Matplotlib for Python Data Analysis

This article introduces Python’s core data‑analysis stack—NumPy for fast multidimensional arrays, pandas for labeled DataFrames, and Matplotlib for interactive plotting—while showing how to set up a Jupyter/VS Code environment, perform basic indexing, slicing, and visualisation, and clean log files with pandas.

JupyterMatplotlibNumPy
0 likes · 9 min read
Introduction to NumPy, pandas, and Matplotlib for Python Data Analysis
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 27, 2019 · Fundamentals

R vs Python for Data Analysis: Which Language Wins?

This article presents a detailed infographic comparison of R and Python from a data‑science perspective, outlining their histories, ecosystems, usability, community support, and advantages in data analysis to help readers decide which language better fits their projects.

ComparisonData SciencePython
0 likes · 5 min read
R vs Python for Data Analysis: Which Language Wins?
iQIYI Technical Product Team
iQIYI Technical Product Team
Dec 14, 2018 · Artificial Intelligence

AI Applications in Modern Technology and Society

The podcast examines AI’s rapid integration into entertainment, security and personalization, highlighting its use in automated video editing, facial-recognition tagging of celebrities and non-celebrities, while debating ethical concerns such as echo-chambers, emotional nuance, and the technology’s transformative yet limited role across industries.

AIEntertainmentSecurity
0 likes · 7 min read
AI Applications in Modern Technology and Society
MaGe Linux Operations
MaGe Linux Operations
Dec 6, 2018 · Fundamentals

Mastering AWK: Powerful Text Processing and Reporting Techniques

AWK is a versatile text-processing language that reads files line by line, splits fields by default spaces, and applies pattern-action rules to analyze data, generate reports, and perform complex tasks such as extracting columns, counting records, and scripting with built-in variables, loops, and conditional statements.

Shell scriptingawkcommand-line
0 likes · 14 min read
Mastering AWK: Powerful Text Processing and Reporting Techniques
MaGe Linux Operations
MaGe Linux Operations
Nov 30, 2018 · Artificial Intelligence

Avoid These Common NumPy Pitfalls When Doing Machine Learning

This article examines frequent traps when using NumPy for matrix operations in machine learning, comparing its quirks to MATLAB/Octave and offering practical insights to prevent shape errors, inefficient indexing, confusing syntax, and unintuitive code patterns.

NumPyPythondata analysis
0 likes · 7 min read
Avoid These Common NumPy Pitfalls When Doing Machine Learning
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 27, 2018 · Big Data

What Do Python Jobs Really Pay? Inside a Data‑Driven Salary & Skill Analysis

This article crawls Lagou.com to collect 4,500 Python‑related job postings across ten roles, extracts salary, education, experience and skill requirements, visualizes the data with treemaps, rose charts, bar charts and word clouds, and provides detailed insights into each position’s market demands and compensation trends.

PythonSkillsdata analysis
0 likes · 18 min read
What Do Python Jobs Really Pay? Inside a Data‑Driven Salary & Skill Analysis
Tencent Cloud Developer
Tencent Cloud Developer
Nov 23, 2018 · Big Data

20 Free and Open-Source Data Visualization Tools

These 20 free and open‑source data visualization tools—from JavaScript libraries like D3.js and Chartist.js to user‑friendly platforms such as Datawrapper, Google Data Studio, and Tableau Public—enable businesses and analysts to transform raw data into interactive charts, maps, timelines, and dashboards, improving insight, decision‑making, and profitability.

Big DataData visualizationJavaScript libraries
0 likes · 12 min read
20 Free and Open-Source Data Visualization Tools
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 22, 2018 · Big Data

How to Scrape and Analyze China’s Tourist Attractions Data with Python

This article demonstrates how to crawl Qunar’s nationwide tourism listings with Python, extract key fields such as name, level, location and price, handle anti‑scraping measures, and then perform comprehensive data analysis and visualisation—including sales rankings, popularity scores, geographic distribution and geocoding using the Amap API.

GeocodingPythonTourism Data
0 likes · 11 min read
How to Scrape and Analyze China’s Tourist Attractions Data with Python
Big Data and Microservices
Big Data and Microservices
Sep 3, 2018 · Big Data

From Raw Data to Business Impact: A Complete Data Analyst Skill Guide

The article outlines a comprehensive data‑analyst competency framework, covering data collection, storage, extraction, mining, analysis, visualization, and practical application, and provides concrete questions, techniques, and tool recommendations to help analysts turn raw data into actionable business insights.

Business IntelligenceData visualizationdata analysis
0 likes · 9 min read
From Raw Data to Business Impact: A Complete Data Analyst Skill Guide
21CTO
21CTO
Aug 21, 2018 · R&D Management

From Code to Leadership: How Mid‑Career Developers Can Thrive in Management

The article shares a veteran programmer’s journey from facing a mid‑career tech crisis to leveraging data analysis and soft‑skill expertise, illustrating how shifting focus from pure coding to holistic project and business insight can open new management opportunities and sustain relevance in the evolving tech industry.

ManagementR&D managementSoftware Engineering
0 likes · 16 min read
From Code to Leadership: How Mid‑Career Developers Can Thrive in Management
Big Data and Microservices
Big Data and Microservices
Aug 16, 2018 · Big Data

Mastering Big Data Analysis: 5 Core Aspects and 4 Key Methods

This article outlines the five fundamental aspects of big data analysis—visualization, data‑mining algorithms, predictive analytics, semantic engines, and data quality management—and explains four primary analytical approaches: descriptive, diagnostic, predictive, and prescriptive analysis.

Big Datadata analysisdata mining
0 likes · 6 min read
Mastering Big Data Analysis: 5 Core Aspects and 4 Key Methods
Ctrip Technology
Ctrip Technology
Aug 7, 2018 · Artificial Intelligence

Forecasting and Monitoring in Business Intelligence: Practical Data‑Analysis Methods and Model‑Building Tips

The article explains how a data analyst can use statistical and machine‑learning models such as linear regression, tree‑based boosting, STL decomposition, and Prophet for both non‑time‑series forecasting and time‑series monitoring, highlighting data‑quality concerns, feature‑engineering practices, and deployment considerations like PMML packaging.

BIProphetSTL
0 likes · 13 min read
Forecasting and Monitoring in Business Intelligence: Practical Data‑Analysis Methods and Model‑Building Tips
MaGe Linux Operations
MaGe Linux Operations
Aug 2, 2018 · Big Data

Unlocking PUBG Victory: Data‑Driven Insights on Drop Zones, Final Circles, Weapons, and Kill Strategies

This article analyzes 18 million PUBG match records using Python to reveal optimal drop locations, high‑probability final‑circle spots, preferred weapons, and the relationship between kill distance, kill count, and winning chances, providing data‑driven strategies for players seeking more chicken dinners.

Big DataGame AnalyticsPUBG
0 likes · 13 min read
Unlocking PUBG Victory: Data‑Driven Insights on Drop Zones, Final Circles, Weapons, and Kill Strategies
Python Crawling & Data Mining
Python Crawling & Data Mining
Jul 31, 2018 · Big Data

Can Web‑Scraped Movie Reviews Predict Box Office? A Python Data‑Mining Case Study

Using Python to scrape over ten thousand Maoyan comments for the comedy film “The Billionaire” (西虹市首富), this article demonstrates data cleaning, geographic heat‑maps, city‑wise rating analysis, word‑cloud generation, and a simple box‑office forecast based on a comparable movie, illustrating practical web‑scraping and data‑mining techniques.

Box Office PredictionMovie ReviewsPython
0 likes · 10 min read
Can Web‑Scraped Movie Reviews Predict Box Office? A Python Data‑Mining Case Study
MaGe Linux Operations
MaGe Linux Operations
Jul 27, 2018 · Fundamentals

Master Pandas: Essential Techniques for Data Exploration and Analysis

This tutorial introduces Pandas fundamentals, covering installation, data structures, importing CSV files, inspecting and reshaping data, filtering with boolean masks, indexing, applying functions, grouping, merging, quick plotting, and saving results, all illustrated with clear examples and images.

Pythondata analysisdataframe
0 likes · 14 min read
Master Pandas: Essential Techniques for Data Exploration and Analysis
ITPUB
ITPUB
Jul 23, 2018 · Big Data

What China's Vaccine Procurement Data Reveals: A Province‑Level Analysis

This article documents the collection, cleaning, and statistical analysis of publicly released second‑category vaccine procurement data from 28 Chinese provinces, highlighting data sources, processing steps with pandas, top manufacturers, regional market shares, and the challenges encountered during the effort.

Big DataChinadata analysis
0 likes · 9 min read
What China's Vaccine Procurement Data Reveals: A Province‑Level Analysis
ITPUB
ITPUB
Jul 23, 2018 · Big Data

Uncovering China’s Vaccine Procurement: A Province‑Level Data Crawl and Analysis

This article documents the collection of public second‑class vaccine procurement data from 28 Chinese provinces, describes the CSV schema, outlines the challenges faced during web scraping, and presents a pandas‑driven statistical analysis that highlights top manufacturers and their provincial market shares.

Chinadata analysispandas
0 likes · 10 min read
Uncovering China’s Vaccine Procurement: A Province‑Level Data Crawl and Analysis
Tencent Cloud Developer
Tencent Cloud Developer
Jul 23, 2018 · Big Data

Analysis of Chinese Second-Class Vaccine Procurement Data

The study aggregates and cleans 2017‑2020 Chinese second‑class vaccine procurement data from 28 provinces into a 1,529‑record CSV, revealing a right‑skewed distribution where a handful of manufacturers—led by Beijing Kexing and Changchun Changsheng—account for the majority of entries, while noting gaps in several regions and encouraging further collaborative refinement.

Big DataChinese healthcaredata analysis
0 likes · 10 min read
Analysis of Chinese Second-Class Vaccine Procurement Data
Programmer DD
Programmer DD
Jul 10, 2018 · Big Data

Which Car Wins on Didi? Data‑Driven Model Selection for Ride‑Hailing

Using real‑time order and fuel‑point data from the Didi driver app, the author demonstrates a systematic, data‑driven approach to identify the most cost‑effective car models for ride‑hailing across major Chinese cities, complete with methodology, analysis, and city‑specific rankings.

DidiRide Hailingcar selection
0 likes · 19 min read
Which Car Wins on Didi? Data‑Driven Model Selection for Ride‑Hailing
21CTO
21CTO
Jul 6, 2018 · Fundamentals

Why Every Engineer Must Master Business Insight (And How)

The article argues that programmers need to deeply understand business concepts and data-driven decision making, explaining what business entails, why it matters for engineers, and offering practical methods to acquire business knowledge for more impactful, sustainable tech solutions.

Product DevelopmentSoftware Engineeringbusiness
0 likes · 11 min read
Why Every Engineer Must Master Business Insight (And How)
Efficient Ops
Efficient Ops
Jun 21, 2018 · Fundamentals

Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis

This article demonstrates how to use Python, pandas, and Jupyter Notebook to explore a comprehensive World Cup dataset, clean and enrich the data, visualize win and goal statistics for all teams, and finally predict the top three contenders for the 2018 tournament.

PythonSports AnalyticsWorld Cup
0 likes · 12 min read
Can Python Predict the 2018 World Cup Champion? A Data‑Driven Analysis
Qunar Tech Salon
Qunar Tech Salon
Jun 15, 2018 · Artificial Intelligence

Predicting the 2018 FIFA World Cup Winners Using Machine Learning

This article demonstrates how to collect historical football data, perform exploratory analysis and feature engineering, and apply a logistic‑regression model in Python to predict the 2018 FIFA World Cup champion, group‑stage results, and knockout‑stage outcomes.

FIFA World CupPythondata analysis
0 likes · 8 min read
Predicting the 2018 FIFA World Cup Winners Using Machine Learning
Meituan Technology Team
Meituan Technology Team
May 10, 2018 · Operations

Quality Operations for Intelligent Payment: Improving Test Phase Metrics

By applying a PDCA‑based quality‑operation framework that aligns QA KPIs, drills defect data across dimensions, automates test‑gate checks, and drives continuous improvement actions, Meituan Dianping’s Intelligent Payment team reduced severe defect ratios, met defined metric targets, and boosted iteration efficiency while supporting rapid business growth.

MetricsSoftware Testingdata analysis
0 likes · 12 min read
Quality Operations for Intelligent Payment: Improving Test Phase Metrics
Qunar Tech Salon
Qunar Tech Salon
May 4, 2018 · Fundamentals

Quantifying and Analyzing App Performance Slowness: A QA Perspective

The article explains how QA engineers can systematically measure, analyze, and resolve app slowness by quantifying response times, dissecting device fragmentation, network latency, and backend complexity, and establishing a data‑driven quality loop to improve user experience.

app performancedata analysismobile testing
0 likes · 7 min read
Quantifying and Analyzing App Performance Slowness: A QA Perspective
Beike Product & Technology
Beike Product & Technology
Apr 26, 2018 · Big Data

Chain Home's OLAP Platform and Kylin Usage

This article details Chain Home's OLAP platform architecture and Kylin usage, covering the evolution from early ROLAP to MOLAP multi-dimensional engine, Kylin's basic principles, platform structure, application scenarios, usage specifications, capability extensions, and middleware development.

Apache KylinBig DataChain Home
0 likes · 11 min read
Chain Home's OLAP Platform and Kylin Usage
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2018 · Artificial Intelligence

Master Python’s Top Data Analysis & AI Libraries with Hands‑On Code

This article introduces Python’s essential features for data analysis and mining, then reviews the most widely used libraries—NumPy, SciPy, Matplotlib, Pandas, Scikit‑Learn, Keras, and Gensim—each accompanied by concise code examples that demonstrate their core capabilities.

KerasPythondata analysis
0 likes · 14 min read
Master Python’s Top Data Analysis & AI Libraries with Hands‑On Code
dbaplus Community
dbaplus Community
Jan 29, 2018 · Operations

How Data‑Driven Monitoring Unlocks Real Value for Ops Teams

This article explains why quantifiable data is essential for evaluating the impact of operational changes, outlines common data‑collection stacks, defines core business and user‑centric metrics, and demonstrates practical monitoring techniques such as PCU analysis, simulated user flows, and intelligent scaling to turn ops work into measurable business value.

DevOpsOperationsbusiness metrics
0 likes · 15 min read
How Data‑Driven Monitoring Unlocks Real Value for Ops Teams
21CTO
21CTO
Jan 10, 2018 · Backend Development

Scrape and Analyze Your WeChat Friends with Python and R

This article demonstrates how to use the Python itchat library to extract personal WeChat friend data, then analyzes gender ratios, city distribution, and signature word clouds with R and Python visualisation tools, offering a practical guide for personal social network analytics.

RWeChatdata analysis
0 likes · 6 min read
Scrape and Analyze Your WeChat Friends with Python and R
MaGe Linux Operations
MaGe Linux Operations
Nov 15, 2017 · Fundamentals

Master Stock Market Data Analysis with Python: Moving Averages Explained

This tutorial walks through using Python and pandas to fetch Yahoo Finance data, visualize stock prices with line and candlestick charts, and apply moving‑average techniques—including 20‑day, 50‑day, and 200‑day averages—to identify trends and build simple trading signals, all while emphasizing that the content is for educational purposes only and not investment advice.

Pythondata analysisfinance
0 likes · 13 min read
Master Stock Market Data Analysis with Python: Moving Averages Explained
21CTO
21CTO
Nov 7, 2017 · Big Data

What 3.3 Million Zhihu Users Reveal About Gender, Location, and Careers

Analyzing over 3.2 million publicly available Zhihu profiles collected via a distributed Python crawler, this report uncovers gender balance near 1:1, top residential cities, dominant occupations, university participation, and the most followed and active contributors, while noting data limitations and temporal relevance.

User Demographicsdata analysiszhihu
0 likes · 11 min read
What 3.3 Million Zhihu Users Reveal About Gender, Location, and Careers
网易UEDC
网易UEDC
Aug 30, 2017 · Product Management

How Designers Can Master Product Thinking: 5 Essential Mindsets

This article explains why designers need product thinking and breaks down the five key mindsets—user‑centered, logical, data‑driven, marketing, and project thinking—providing practical examples and actionable steps to boost influence, career growth, and design impact.

DesignProject ManagementUser experience
0 likes · 14 min read
How Designers Can Master Product Thinking: 5 Essential Mindsets
21CTO
21CTO
Aug 12, 2017 · Fundamentals

Generate Automated Survey Reports with Python in Minutes

This guide introduces a Python tool that automates the creation of descriptive and cross‑analysis survey reports in PPT and Excel formats, covering installation, data preparation, quick‑start code snippets, and additional utility functions for comprehensive statistical reporting.

Cross AnalysisStatistical ReportingSurvey Automation
0 likes · 7 min read
Generate Automated Survey Reports with Python in Minutes
21CTO
21CTO
Jul 7, 2017 · Big Data

How to Kickstart Your Big Data Career: A Complete Learning Roadmap

This guide walks beginners through the vast big data landscape, helping them choose the right role, understand essential terminology, plan a learning path, and access curated resources for becoming a data engineer or analyst, all illustrated with clear diagrams.

Big DataLearning Pathbig data technologies
0 likes · 16 min read
How to Kickstart Your Big Data Career: A Complete Learning Roadmap
MaGe Linux Operations
MaGe Linux Operations
May 27, 2017 · Big Data

How Big Data Drives Vending Machine Placement in Beijing Subways

This article explores how big data analysis can determine the demand, optimal placement, and profitability of beverage vending machines in Beijing subway stations, detailing passenger flow statistics, gender ratios, consumption patterns, and revenue projections to illustrate data‑driven operational strategies.

Profitabilitydata analysissubway
0 likes · 5 min read
How Big Data Drives Vending Machine Placement in Beijing Subways
Ctrip Technology
Ctrip Technology
Mar 8, 2017 · Big Data

Essential Skills and Career Path for Data Professionals: From Big Data Platforms to AI Applications

This article outlines the key competencies and career roadmap for data professionals, covering big‑data infrastructure, data‑warehouse engineering, visualization, analysis, algorithmic mining, and deep‑learning, while emphasizing the importance of business sense, cloud adoption, and continuous learning.

Data visualizationcareer advicedata analysis
0 likes · 15 min read
Essential Skills and Career Path for Data Professionals: From Big Data Platforms to AI Applications
Nightwalker Tech
Nightwalker Tech
Feb 27, 2017 · Big Data

Community Discussion on Learning Paths, Tools, and Applications in Big Data

A diverse group of practitioners share recommendations for books, technologies, real‑world use cases, and practical challenges when learning and applying big‑data processing, covering Hadoop, Spark, data visualization, ETL, and the relationship between data, algorithms, and business value.

Big DataHadoopdata analysis
0 likes · 16 min read
Community Discussion on Learning Paths, Tools, and Applications in Big Data
Ctrip Technology
Ctrip Technology
Feb 23, 2017 · Product Management

Applying AB Testing in Ctrip Flight Booking: Process, Data Flow, and Analysis

The article explains how Ctrip’s flight‑booking team uses AB testing—from definition and experimental design to data collection, traffic allocation, orthogonal experiments, and result analysis—to drive conversion‑rate and revenue improvements across multiple platforms.

AB testingconversion ratedata analysis
0 likes · 10 min read
Applying AB Testing in Ctrip Flight Booking: Process, Data Flow, and Analysis
Architects' Tech Alliance
Architects' Tech Alliance
Nov 30, 2016 · Big Data

Core Technologies and Challenges of Big Data: ETL, Storage, Analysis, and Cloud Integration

This article examines the core technologies of big data—including data collection, storage, management, analysis, and mining—highlighting architectural challenges, analysis techniques, storage solutions, ETL processes, and the interplay between big data and cloud computing, while emphasizing practical implementation considerations.

ETLcloud computingdata analysis
0 likes · 11 min read
Core Technologies and Challenges of Big Data: ETL, Storage, Analysis, and Cloud Integration
MaGe Linux Operations
MaGe Linux Operations
Oct 15, 2016 · Operations

Master Linux Shell Commands for Fast Data Exploration

This guide walks through essential Linux shell commands—from basic file viewing and simple statistics to powerful exploratory analysis tools—showing how data engineers can efficiently inspect, process, and batch‑handle logs and other data files using the command line.

Shelldata analysisexploratory analysis
0 likes · 12 min read
Master Linux Shell Commands for Fast Data Exploration
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 26, 2016 · Artificial Intelligence

Can Machine Learning Predict China’s Car License Lottery? Secrets in 13‑Digit IDs

This article investigates whether the 13‑digit user IDs used in Chinese car‑license lotteries are truly random, revealing how the ID generation, seed‑based selection, and hidden patterns—especially the influential seventh digit—affect outcomes, and demonstrates that simple linear models can achieve an AUC of around 0.8 in predicting winners, while also discussing the system’s opacity across major cities.

ID generationcar license lotterydata analysis
0 likes · 17 min read
Can Machine Learning Predict China’s Car License Lottery? Secrets in 13‑Digit IDs
21CTO
21CTO
Aug 24, 2016 · Artificial Intelligence

How User Profiling Powers Modern Recommendation Systems

This article explains how comprehensive user profiling—combining static demographics and dynamic behavior logs—feeds recommendation engines, detailing data sources, feature extraction, ranking formulas, and the long‑term goals of delivering personalized, high‑quality content to users.

data analysispersonalizationrecommendation system
0 likes · 6 min read
How User Profiling Powers Modern Recommendation Systems
Baidu Intelligent Testing
Baidu Intelligent Testing
Aug 18, 2016 · Mobile Development

Establishing an Effective Mobile App Quality Monitoring System: Standards, Metrics, and Data Utilization

This article explains how to build a comprehensive mobile app quality monitoring framework by defining quality standards, setting capability indicators, and leveraging data acquisition, analysis, and visualization to continuously improve product reliability and user experience across different development stages.

MetricsOperationsQuality Monitoring
0 likes · 11 min read
Establishing an Effective Mobile App Quality Monitoring System: Standards, Metrics, and Data Utilization