Tagged articles
466 articles
Page 3 of 5
Python Programming Learning Circle
Python Programming Learning Circle
Nov 25, 2021 · Big Data

Analyzing and Visualizing Maoyan Movie Reviews for “Chinese Doctors” Using Python

This tutorial demonstrates how to crawl approximately 40,000 Maoyan movie reviews for the film “Chinese Doctors,” preprocess the data, and create visualizations such as rating pie charts, city distribution maps, top‑viewer bar charts, and a word cloud using Python libraries like requests, pyecharts, and wordcloud.

Data visualizationMovie ReviewsPython
0 likes · 12 min read
Analyzing and Visualizing Maoyan Movie Reviews for “Chinese Doctors” Using Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 2, 2021 · Backend Development

How to Scrape Global Electricity Capacity Data with Python and Visualize It

Learn step-by-step how to collect worldwide electricity generation data from the IEA using Python's requests and pandas, extract country names, retrieve historical consumption figures, and turn the results into animated visualizations with online tools, while avoiding manual entry and ensuring reproducible analysis.

Data visualizationElectricityIEA
0 likes · 5 min read
How to Scrape Global Electricity Capacity Data with Python and Visualize It
MaGe Linux Operations
MaGe Linux Operations
Oct 23, 2021 · Backend Development

How to Scrape and Analyze Taobao Snack Sales Data with Python

This article walks through a real‑world Python project that uses Selenium to crawl the first ten pages of Taobao snack listings, extracts sales, price and location data, visualizes price distribution and geographic concentration, generates a word‑cloud of top user comments, and lists the top‑selling stores, providing full source code for replication.

PythonSeleniumTaobao
0 likes · 12 min read
How to Scrape and Analyze Taobao Snack Sales Data with Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Oct 11, 2021 · Backend Development

How to Scrape and Analyze 46k Rental Listings with Python: From Crawling to Visual Insights

Learn step‑by‑step how to crawl 46,000+ rental listings from Ziroom using Python, extract house details with regex, clean and transform the data with pandas, and visualize distribution, pricing and location insights through pyecharts, matplotlib and seaborn, revealing rental market patterns in Beijing.

Data visualizationPyechartsWeb Scraping
0 likes · 24 min read
How to Scrape and Analyze 46k Rental Listings with Python: From Crawling to Visual Insights
Python Crawling & Data Mining
Python Crawling & Data Mining
Oct 4, 2021 · Backend Development

Automate Shanghai Stock Exchange Report Downloads with a Python Web Scraper

This tutorial shows how to use Python's requests and JSON handling to crawl the Shanghai Stock Exchange website, extract periodic report metadata, construct PDF URLs, and automatically download the files, providing a practical example of backend web‑scraping automation while warning against excessive server load.

PDF downloadSSEWeb Scraping
0 likes · 28 min read
Automate Shanghai Stock Exchange Report Downloads with a Python Web Scraper
MaGe Linux Operations
MaGe Linux Operations
Sep 18, 2021 · Backend Development

How to Build a Python Crawler to Grab TV Drama Links Automatically

This article explains how to create a Python web crawler that automatically generates URLs for a drama‑download site, filters out invalid pages, extracts ed2k links using requests and regular expressions, saves them to text files, and employs multithreading to speed up processing, while discussing challenges such as duplicate URLs and filename sanitization.

CrawlerWeb Scrapingmultithreading
0 likes · 7 min read
How to Build a Python Crawler to Grab TV Drama Links Automatically
Python Crawling & Data Mining
Python Crawling & Data Mining
Sep 11, 2021 · Artificial Intelligence

How to Preprocess Captcha Images with OpenCV for Python Scraping

This tutorial explains how to collect captcha images, preprocess them with OpenCV—including grayscale conversion, median blur, binarization, contour detection, and character segmentation—and provides core Python code and visual results for building a reliable Python web‑scraping pipeline.

CaptchaOpenCVWeb Scraping
0 likes · 7 min read
How to Preprocess Captcha Images with OpenCV for Python Scraping
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 26, 2021 · Fundamentals

Transform Raw Taobao Data into Stunning Interactive Charts with Python

This article walks you through cleaning messy Taobao product data using pandas and jieba, then visualizing ingredient and shelf‑life information with interactive Pyecharts charts—including pie, bar, table, funnel, and polar graphs—while showing how to combine multiple charts into a single draggable HTML page.

Data visualizationWeb Scraping
0 likes · 14 min read
Transform Raw Taobao Data into Stunning Interactive Charts with Python
Sohu Tech Products
Sohu Tech Products
Aug 25, 2021 · Backend Development

Scrapy Tutorial: Installation, Project Structure, Basic Usage, and Real‑World Example

This article provides a comprehensive, step‑by‑step guide to the Scrapy web‑crawling framework, covering its core components, installation methods, project layout, spider creation, data extraction techniques, pagination handling, pipeline configuration, and how to run the crawler to collect and store data.

CrawlerData ExtractionPython
0 likes · 13 min read
Scrapy Tutorial: Installation, Project Structure, Basic Usage, and Real‑World Example
MaGe Linux Operations
MaGe Linux Operations
Aug 14, 2021 · Frontend Development

Build a Stunning Python Music Player with PyQt5 – Step‑by‑Step Guide

This tutorial walks you through creating a feature‑rich desktop music player in Python using PyQt5, covering UI layout design, keyword‑based music crawling, multithreaded downloading, playback controls, volume adjustment, and additional functions like random and repeat modes, all illustrated with complete code snippets and screenshots.

Audio PlaybackGUIWeb Scraping
0 likes · 22 min read
Build a Stunning Python Music Player with PyQt5 – Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Aug 1, 2021 · Backend Development

How to Simulate a Login with Python: Scrape Protected Websites

This guide explains how to programmatically log into a website using Python's urllib2 and cookielib modules, covering cookie handling, captcha retrieval, form data construction, request headers, and posting credentials to access protected pages.

CaptchaLogin AutomationWeb Scraping
0 likes · 5 min read
How to Simulate a Login with Python: Scrape Protected Websites
Python Programming Learning Circle
Python Programming Learning Circle
Jul 14, 2021 · Backend Development

Bypassing Anti‑Scraping Mechanisms: User‑Agent Spoofing and IP Rate Limiting with Python

This article explains how to overcome common anti‑scraping defenses such as identity verification and IP rate limiting by spoofing the User‑Agent header and adding request delays, providing complete Python code examples using requests and BeautifulSoup to scrape Douban's Top 250 movies.

IP throttlingUser-AgentWeb Scraping
0 likes · 6 min read
Bypassing Anti‑Scraping Mechanisms: User‑Agent Spoofing and IP Rate Limiting with Python
21CTO
21CTO
Jul 12, 2021 · Backend Development

Master Scrapy: From Basics to Advanced Spider Development

This comprehensive guide introduces Scrapy's architecture, explains its core components and data flow, teaches XPath fundamentals, walks through installation, project creation, spider coding, item and pipeline definitions, middleware customization, pagination handling, and essential settings for effective Python web crawling.

CrawlerPythonScrapy
0 likes · 14 min read
Master Scrapy: From Basics to Advanced Spider Development
360 Quality & Efficiency
360 Quality & Efficiency
Jul 2, 2021 · Backend Development

Integrating Scrapy with Selenium for Dynamic Web Page Crawling

This guide explains how to combine Scrapy and Selenium to scrape dynamically rendered web pages, covering installation, project setup, middleware configuration, Selenium driver handling, and code examples that demonstrate a complete end‑to‑end crawling workflow.

Dynamic PagesPythonScrapy
0 likes · 12 min read
Integrating Scrapy with Selenium for Dynamic Web Page Crawling
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 25, 2021 · Backend Development

How to Scrape NBA Player Stats from Hupu and Auto‑Generate Excel Charts with Python

This guide walks you through building a Python web‑scraper that extracts NBA player information from the Hupu website, cleans and visualizes the data, and automatically creates Excel files with embedded line charts, covering URL navigation, data parsing with requests and BeautifulSoup, and chart generation with xlsxwriter.

Data visualizationNBAPython
0 likes · 10 min read
How to Scrape NBA Player Stats from Hupu and Auto‑Generate Excel Charts with Python
MaGe Linux Operations
MaGe Linux Operations
Jun 13, 2021 · Fundamentals

7 Fun Python Projects: Web Scraping, Chatbots, Poetry Classification and More

This article presents seven practical Python scripts—from a concise web scraper for Zhihu images and a chatbot conversation loop to a Naive Bayes poem author classifier, a lottery number generator, an automated essay writer, a screen‑capture tool, and a GIF creator—demonstrating how to avoid reinventing the wheel while exploring diverse automation tasks.

ChatbotData GenerationNLP
0 likes · 8 min read
7 Fun Python Projects: Web Scraping, Chatbots, Poetry Classification and More
MaGe Linux Operations
MaGe Linux Operations
May 7, 2021 · Fundamentals

7 Fun Python Projects to Automate Everyday Tasks (Full Code Included)

This article shares seven practical Python scripts—from web‑scraping Zhihu images and chatting bots to AI poem analysis, lottery generation, automatic apology letters, screen recording, and GIF creation—demonstrating how to avoid reinventing the wheel while learning useful automation techniques.

AICode ExamplesPython
0 likes · 9 min read
7 Fun Python Projects to Automate Everyday Tasks (Full Code Included)
NiuNiu MaTe
NiuNiu MaTe
May 2, 2021 · Fundamentals

How to Master Python Quickly: A Complete Learning Roadmap for 2024

This guide explains why Python is essential, presents a step‑by‑step learning roadmap covering beginner basics, backend web development, web crawling, data analysis, and machine learning, and provides curated resources and project links to help learners progress efficiently.

Backend DevelopmentWeb Scrapingdata analysis
0 likes · 8 min read
How to Master Python Quickly: A Complete Learning Roadmap for 2024
Python Crawling & Data Mining
Python Crawling & Data Mining
Apr 24, 2021 · Fundamentals

Discover 140+ Must‑Know Python Libraries for Data Science & AI

The article presents a comprehensive guide to Python's built‑in functions, standard libraries, and third‑party packages across file I/O, web scraping, databases, data cleaning, statistical analysis, machine learning, visualization, and more, rating each with stars and offering a free e‑book collection for readers.

PythonWeb Scrapingdata analysis
0 likes · 32 min read
Discover 140+ Must‑Know Python Libraries for Data Science & AI
Python Programming Learning Circle
Python Programming Learning Circle
Apr 13, 2021 · Backend Development

Python Web Scraping Techniques: GET/POST Requests, Proxy IP, Cookies, Header Spoofing, Gzip Compression, and Multithreading

This article provides a comprehensive Python web‑scraping guide covering basic GET/POST requests with urllib2, proxy handling, cookie management, header manipulation to mimic browsers, gzip compression handling, regular‑expression and library parsing, simple captcha strategies, and a multithreaded thread‑pool example.

GzipHeader SpoofingProxy
0 likes · 8 min read
Python Web Scraping Techniques: GET/POST Requests, Proxy IP, Cookies, Header Spoofing, Gzip Compression, and Multithreading
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 11, 2021 · Backend Development

How to Build a Robust Python Web Crawler for Forum Comments with Scrapy & Selenium

This article walks through building a Python web crawler that extracts forum post comments into MongoDB, covering project goals, environment setup, site structure analysis, Scrapy and Selenium integration, data storage design, handling anti‑scraping measures, and performance optimization with multithreading.

CrawlerData ExtractionMongoDB
0 likes · 13 min read
How to Build a Robust Python Web Crawler for Forum Comments with Scrapy & Selenium
FunTester
FunTester
Mar 9, 2021 · Backend Development

Groovy Script for Crawling and Downloading QR Code Images Using HTTP and Regex

This article demonstrates a Groovy script that extracts QR‑code image URLs from a web page using regular expressions, then downloads each image to a local directory, illustrating practical web‑scraping techniques and reusable utility methods for HTTP requests and file handling.

File DownloadGroovyWeb Scraping
0 likes · 6 min read
Groovy Script for Crawling and Downloading QR Code Images Using HTTP and Regex
Programmer DD
Programmer DD
Feb 18, 2021 · Backend Development

InfoSpider: Open‑Source Python Toolbox for Secure Personal Data Scraping and Analysis

InfoSpider is an open‑source Python‑based web‑scraping toolbox that securely aggregates personal data from over 24 sources—including email, e‑commerce, and social platforms—provides a GUI for easy operation, stores results in JSON, and offers basic visual analysis, making personal data integration and insight generation straightforward.

Open sourcePythonWeb Scraping
0 likes · 12 min read
InfoSpider: Open‑Source Python Toolbox for Secure Personal Data Scraping and Analysis
Java Captain
Java Captain
Feb 10, 2021 · Backend Development

Java Web Crawler for Downloading Sogou Image Search Results

This article demonstrates how to build a Java-based web crawler that fetches image URLs from Sogou Image Search, parses the JSON response, and downloads thousands of pictures to a local directory using multithreaded processing and custom HTTP utilities.

HTTPImage DownloadJava
0 likes · 19 min read
Java Web Crawler for Downloading Sogou Image Search Results
MaGe Linux Operations
MaGe Linux Operations
Jan 19, 2021 · Backend Development

Master Playwright-Python: Quick Guide to Browser Automation for Beginners

This article introduces Microsoft’s open‑source Playwright‑Python library, explains its advantages over Selenium, walks through installation, code generation, synchronous and asynchronous usage, mobile emulation, and highlights its strengths and current limitations for Python developers.

Browser AutomationMobile EmulationPlaywright
0 likes · 10 min read
Master Playwright-Python: Quick Guide to Browser Automation for Beginners
Python Crawling & Data Mining
Python Crawling & Data Mining
Jan 15, 2021 · Backend Development

How We Scraped and Analyzed the Global Top 100 Most Beautiful Women with Python

In this article we demonstrate how to use Python’s requests and BeautifulSoup to scrape a static webpage listing the world’s top 100 most beautiful women, process the data with pandas and pyecharts for continent, nationality and profession statistics, and apply Baidu’s face‑recognition API to score each celebrity’s attractiveness.

Data visualizationPyechartsPython
0 likes · 7 min read
How We Scraped and Analyzed the Global Top 100 Most Beautiful Women with Python
DevOps Cloud Academy
DevOps Cloud Academy
Jan 1, 2021 · Backend Development

Parsing WeChat Public Account Articles via API and Python Script

This guide demonstrates how to retrieve a list of WeChat public account articles by constructing the appropriate API URL, examining its query parameters, viewing the JSON response, and using a Python script to parse the data and generate a markdown file of titles and links.

APIJSONPython
0 likes · 3 min read
Parsing WeChat Public Account Articles via API and Python Script
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 31, 2020 · Backend Development

How to Scrape Thousands of New‑House Listings in Python: A Step‑by‑Step Guide

This article demonstrates how to use Python's requests, fake_useragent, and lxml libraries to batch‑scrape nearly a thousand new‑house listings from the 惠民之家 website, extracting 41 fields such as name, price, layout, opening date, plot ratio and green ratio, while handling pagination and anti‑scraping measures.

CSVPythonReal Estate Data
0 likes · 9 min read
How to Scrape Thousands of New‑House Listings in Python: A Step‑by‑Step Guide
Python Programming Learning Circle
Python Programming Learning Circle
Dec 19, 2020 · Fundamentals

XPath Basics and lxml Usage in Python

This article introduces the fundamentals of XPath syntax, common rules, and example expressions, then explains how to use the lxml library in Python for HTML/XML parsing, including practical tips and a complete code example for extracting links and text from a sample document.

PythonWeb ScrapingXML
0 likes · 6 min read
XPath Basics and lxml Usage in Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 2, 2020 · Artificial Intelligence

Scrape, Clean, and Visualize Tencent Video Comments with Python – A Full Guide

This article walks through using Python to crawl Tencent Video's "Offer" season 2 comments, merge and clean the CSV data, perform exploratory analysis, generate visualizations and word clouds, and apply Baidu's open‑source NLP model for sentiment scoring, providing complete code snippets for each step.

PythonSentiment AnalysisWeb Scraping
0 likes · 16 min read
Scrape, Clean, and Visualize Tencent Video Comments with Python – A Full Guide