Tagged articles
466 articles
Page 2 of 5
Model Perspective
Model Perspective
Feb 13, 2024 · Big Data

Mastering Noisy Data: From Cleaning to Visualization and NLP with Python

This article reviews the key concepts from the Bad Data Handbook, covering noise identification, data validation, human readability, web data restructuring, special domain challenges, and data quality analysis, while also presenting practical data visualization techniques, popular analysis tools, Python web‑scraping libraries, and a basic NLP workflow with code examples.

Data visualizationNLPPython
0 likes · 20 min read
Mastering Noisy Data: From Cleaning to Visualization and NLP with Python
Top Architecture Tech Stack
Top Architecture Tech Stack
Feb 8, 2024 · Backend Development

Guide to Using the 12306 Ticket‑Grabbing Python Project

This article introduces the popular 12306 ticket‑grabbing assistant, explains its history and features, provides step‑by‑step installation and configuration instructions—including required dependencies, proxy settings, and email notifications—and shows how to run the script to automatically secure train tickets during the Spring Festival travel rush.

AutomationBackendConfiguration
0 likes · 10 min read
Guide to Using the 12306 Ticket‑Grabbing Python Project
Python Programming Learning Circle
Python Programming Learning Circle
Feb 2, 2024 · Operations

17 Essential Python Scripts for Automating Everyday Tasks

This article presents 17 practical Python scripts covering file management, web scraping, email handling, Excel processing, database interaction, system tasks, image editing, and more, each with code examples and explanations, enabling developers and analysts to automate repetitive workflows and boost productivity across diverse domains.

AutomationEmailPython
0 likes · 26 min read
17 Essential Python Scripts for Automating Everyday Tasks
php Courses
php Courses
Jan 18, 2024 · Backend Development

Building an Efficient Web Crawler with PHP and Selenium

This article explains how to set up a web crawler using PHP and Selenium, covering installation of Selenium and its PHP bindings via Composer, configuring a Chrome WebDriver, simulating user actions to fetch news links, extracting titles and content, and storing results, with tips for further optimization.

AutomationPHPSelenium
0 likes · 4 min read
Building an Efficient Web Crawler with PHP and Selenium
Test Development Learning Exchange
Test Development Learning Exchange
Jan 16, 2024 · Artificial Intelligence

Python Code Samples for Data Scraping and Analysis Across Various Business Scenarios

This article presents a collection of Python code examples demonstrating how to scrape, process, visualize, and analyze data from news sites, social media, stock markets, e‑commerce, web traffic, text, images, and more, covering tasks such as clustering, time‑series forecasting, and sentiment analysis.

PythonWeb Scrapingdata analysis
0 likes · 12 min read
Python Code Samples for Data Scraping and Analysis Across Various Business Scenarios
Python Programming Learning Circle
Python Programming Learning Circle
Jan 13, 2024 · Operations

17 Essential Python Scripts for Automating Everyday Tasks

Explore 17 practical Python scripts that automate tasks ranging from file management and web scraping to email handling, database interaction, system monitoring, and cloud services, enabling developers and analysts to boost productivity, reduce errors, and streamline workflows across diverse domains.

AutomationEmailPython
0 likes · 28 min read
17 Essential Python Scripts for Automating Everyday Tasks
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 27, 2023 · Backend Development

How to Scrape ETF Data with Python: Step-by-Step Code and Tips

This article walks through retrieving ETF fund codes and names from Eastmoney using Python's requests and pandas, explains constructing the correct URLs, handling pagination, cleaning the JSON response, and provides complete sample scripts, while also highlighting a simpler solution and recommending a data‑collection platform.

Data ExtractionETFWeb Scraping
0 likes · 8 min read
How to Scrape ETF Data with Python: Step-by-Step Code and Tips
Test Development Learning Exchange
Test Development Learning Exchange
Dec 21, 2023 · Backend Development

Advanced Requestium Techniques for Python Web Automation and Scraping

This article introduces Requestium, a Python library that merges Selenium and Requests, and provides step‑by‑step examples covering basic session setup, installation, dynamic content handling, user interaction simulation, cookie and header management, asynchronous fetching, iframe switching, alert handling, and screenshot or video recording.

PythonRequestiumSelenium
0 likes · 8 min read
Advanced Requestium Techniques for Python Web Automation and Scraping
37 Interactive Technology Team
37 Interactive Technology Team
Dec 18, 2023 · Frontend Development

Using LangChain to Automatically Generate Front‑End Code from Documentation

This guide shows how to use LangChain with OpenAI’s API, Puppeteer, and vector stores to automatically read local or web‑based API documentation, split and retrieve relevant text, and prompt an LLM to generate ready‑to‑use TypeScript front‑end code, highlighting setup, prompt design, and example outputs.

Front-end Code GenerationLangChainNode.js
0 likes · 15 min read
Using LangChain to Automatically Generate Front‑End Code from Documentation
Tencent Cloud Developer
Tencent Cloud Developer
Dec 7, 2023 · Artificial Intelligence

Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model

Using Tencent's Hunyuan model, the tutorial walks through a Python workflow that scrapes a student‑score table from a web page, saves it as CSV and Excel, cleans missing values, computes total and average scores, and visualizes their distributions with matplotlib, illustrating how LLMs can accelerate data‑analysis coding while still needing human verification.

Data visualizationMatplotlibPython
0 likes · 8 min read
Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model
Tencent Cloud Developer
Tencent Cloud Developer
Nov 23, 2023 · Artificial Intelligence

Calling Your Private GPTs from ChatGPT with 50 Lines of Python Code

This tutorial shows how, with about 50 lines of Python using Playwright and pyperclip, you can automate a persistent Firefox session to log into ChatGPT Plus, navigate to your private GPT URL, send prompts and retrieve responses via the clipboard, all without incurring extra API fees.

AutomationBrowser AutomationChatGPT
0 likes · 14 min read
Calling Your Private GPTs from ChatGPT with 50 Lines of Python Code
Python Programming Learning Circle
Python Programming Learning Circle
Nov 11, 2023 · Fundamentals

Python Weather Data Scraping, CSV Storage, and Visualization with Matplotlib

This article demonstrates how to use Python's requests and BeautifulSoup libraries to scrape current and 14‑day weather data from China Weather, store the results in CSV files, and perform comprehensive visual analysis with pandas, numpy, and matplotlib, including temperature, humidity, AQI, wind radar, and correlation plots.

CSVData visualizationMatplotlib
0 likes · 24 min read
Python Weather Data Scraping, CSV Storage, and Visualization with Matplotlib
Python Programming Learning Circle
Python Programming Learning Circle
Nov 3, 2023 · Backend Development

Curated Collection of Python Web Scraping Tools and Tutorials

This article compiles a variety of Python web‑scraping utilities—including file download assistants, video downloaders, proxy IP pool builders, captcha bypass scripts, and game data extractors—providing descriptions, installation commands, usage examples, download links, and reference links for each tool.

BackendData ExtractionWeb Scraping
0 likes · 6 min read
Curated Collection of Python Web Scraping Tools and Tutorials
php Courses
php Courses
Sep 6, 2023 · Backend Development

Implementing a Web Crawler with PHP and Goutte

This tutorial explains how to set up the PHP environment, install the Goutte library, and use it to fetch page content, extract hyperlinks, and submit forms, providing complete code examples for building a functional web crawler.

AutomationBackendCrawler
0 likes · 5 min read
Implementing a Web Crawler with PHP and Goutte
Sohu Tech Products
Sohu Tech Products
Aug 23, 2023 · Backend Development

How to Scrape Bilibili Comments and Analyze Them with ChatGPT

This article walks through discovering Bilibili's comment API, programmatically fetching paginated JSON data, converting it into Java POJOs, storing and sorting the comments, and finally feeding the top entries to ChatGPT for automated sentiment and content analysis.

APIBilibiliChatGPT
0 likes · 14 min read
How to Scrape Bilibili Comments and Analyze Them with ChatGPT
Python Programming Learning Circle
Python Programming Learning Circle
Aug 21, 2023 · Backend Development

Python Multithreading for Web Scraping: Concepts, Code Samples, and Performance Comparison

This tutorial explains process and thread fundamentals, compares single‑threaded and multithreaded Python crawlers, provides complete code examples for both approaches, and demonstrates how converting a single‑threaded scraper to multithreading can significantly reduce execution time when handling large data volumes.

Web Scrapingconcurrencythreading
0 likes · 9 min read
Python Multithreading for Web Scraping: Concepts, Code Samples, and Performance Comparison
Python Programming Learning Circle
Python Programming Learning Circle
Aug 4, 2023 · Backend Development

Scraping Maoyan Real-Time Box Office Data with Selenium and Visualizing the Results

Using Python's Selenium library, this tutorial demonstrates how to scrape real-time box office data from Maoyan's regular page, extract movie names, total and incremental earnings, process the data with pandas, export to Excel, and create visual analyses of top‑10 films' revenues and market shares.

Data visualizationSeleniumWeb Scraping
0 likes · 16 min read
Scraping Maoyan Real-Time Box Office Data with Selenium and Visualizing the Results
php Courses
php Courses
Jul 25, 2023 · Backend Development

Using PHP cURL to Send Concurrent Requests to Multiple URLs

This article explains how to enable the PHP cURL extension, create an array of target URLs, and use curl_multi functions with example code to perform concurrent HTTP requests and collect their responses efficiently.

Concurrent RequestsPHPWeb Scraping
0 likes · 6 min read
Using PHP cURL to Send Concurrent Requests to Multiple URLs
Test Development Learning Exchange
Test Development Learning Exchange
Jul 12, 2023 · Fundamentals

Common Python Libraries and Practical Projects: NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, PyTorch

This article introduces ten widely used Python libraries—NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, and PyTorch—each accompanied by a concise real‑world project and complete code examples to help readers understand and apply them effectively.

Data ScienceDeep LearningGame Development
0 likes · 18 min read
Common Python Libraries and Practical Projects: NumPy, Pandas, Matplotlib, Scikit‑learn, Requests, Beautiful Soup, Selenium, Pygame, Flask, PyTorch
Python Crawling & Data Mining
Python Crawling & Data Mining
Jul 11, 2023 · Backend Development

Keep Selenium Browser Open for Multiple Debug Sessions

This article explains how to use Selenium's window handles and switch_to.window method to perform repeated debugging within the same browser instance, providing a full Python example that logs in, switches windows, and returns to the original session without closing the browser.

Browser debuggingSeleniumWeb Automation
0 likes · 5 min read
Keep Selenium Browser Open for Multiple Debug Sessions
Test Development Learning Exchange
Test Development Learning Exchange
Jul 6, 2023 · Backend Development

Scrapy Framework Overview and Usage Guide

Scrapy is a powerful Python-based web scraping framework designed for large-scale and complex website data extraction. It offers high-level abstractions, built-in data extraction tools using XPath and CSS selectors, asynchronous processing for parallel requests, and flexible pipelines for data storage, making it ideal for efficient and scalable web scraping projects.

Backend DevelopmentData ExtractionPython
0 likes · 5 min read
Scrapy Framework Overview and Usage Guide
Python Crawling & Data Mining
Python Crawling & Data Mining
Jul 3, 2023 · Backend Development

How to Fix Common Python Web‑Scraping Issues with Ready‑to‑Use Code

This article walks through a real Python web‑scraping problem, explains why missing request headers cause failures, and provides complete, runnable code snippets—including header setup, cookie handling, and request loops—to retrieve stock data from Xueqiu, followed by a concise summary of the solution.

Data ExtractionWeb Scrapingrequests
0 likes · 5 min read
How to Fix Common Python Web‑Scraping Issues with Ready‑to‑Use Code
php Courses
php Courses
Jun 28, 2023 · Backend Development

Simulating Website Login with PHP cURL

This guide demonstrates how to use PHP's cURL library to programmatically log into a website by configuring the login URL, form fields, request options, handling errors, and processing the response for tasks such as data scraping or automated testing.

Login AutomationPHPWeb Scraping
0 likes · 4 min read
Simulating Website Login with PHP cURL
php Courses
php Courses
Jun 14, 2023 · Backend Development

Using PHP and Selenium WebDriver for Browser-Based Web Scraping

This article explains how to install php-webdriver via Composer, set up a Selenium WebDriver instance in PHP, and write a script that automates a Chrome browser to scrape search results from Baidu, demonstrating key WebDriver APIs for element interaction and data extraction.

AutomationPHPSelenium
0 likes · 5 min read
Using PHP and Selenium WebDriver for Browser-Based Web Scraping
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 13, 2023 · Backend Development

How to Scrape 51Job Listings with Python: A Complete Guide

This article walks through a Python-based web scraper that extracts job listings from 51job.com, detailing required header and cookie settings, pagination logic, data parsing, and CSV output, and includes full code snippets and tips for handling site changes and further data analysis.

Data ExtractionJob DataWeb Scraping
0 likes · 9 min read
How to Scrape 51Job Listings with Python: A Complete Guide
Test Development Learning Exchange
Test Development Learning Exchange
May 30, 2023 · Backend Development

Building a Simple Python Tool for Detecting Event Tracking (埋点) in Web Applications

This article demonstrates how to create a Python-based tool that crawls click‑event data from an e‑commerce site, processes and visualizes the information to verify event‑tracking code, covering preparation, data collection, analysis, and visualization steps with complete source code.

Backend DevelopmentData visualizationPython
0 likes · 7 min read
Building a Simple Python Tool for Detecting Event Tracking (埋点) in Web Applications
Python Crawling & Data Mining
Python Crawling & Data Mining
May 1, 2023 · Backend Development

Scrape Shandong Government Data with Python: Full Code Walkthrough

This article walks through a real‑world Python web‑scraping case, showing how to retrieve disease‑pest data from the Shandong government site using custom headers, cookies, and request parameters, and provides two complete code examples that successfully return the desired JSON results.

Backend DevelopmentData ExtractionWeb Scraping
0 likes · 6 min read
Scrape Shandong Government Data with Python: Full Code Walkthrough
Python Programming Learning Circle
Python Programming Learning Circle
Apr 28, 2023 · Backend Development

10 Python Automation Scripts to Simplify Repetitive Tasks

This article presents ten practical Python automation scripts—including HTML parsing, QR code scanning, screenshot capture, audiobook creation, PDF editing, StackOverflow querying, mobile device control, CPU/GPU temperature monitoring, Instagram uploading, and video watermarking—to help readers eliminate repetitive tasks and streamline their workflows.

MobileScriptingWeb Scraping
0 likes · 13 min read
10 Python Automation Scripts to Simplify Repetitive Tasks
FunTester
FunTester
Apr 10, 2023 · Industry Insights

How to Map China’s Qingming Rainfall Using Free Weather APIs and Python

This guide shows how to collect nationwide weather data for the Qingming Festival via the free QWeather API, process it with Python to generate a CSV file, and visualize the rainfall distribution across China using a desktop data‑visualization tool.

CSVData visualizationPython
0 likes · 6 min read
How to Map China’s Qingming Rainfall Using Free Weather APIs and Python
Laravel Tech Community
Laravel Tech Community
Apr 2, 2023 · Backend Development

QueryList: A Modern PHP Content Scraping Library – Features, Installation, and Usage Guide

This article introduces QueryList, a modern PHP content‑scraping tool that uses CSS selectors instead of regex, explains its two versions (V3 and V4), shows how to install it via Composer, demonstrates basic crawling code and various collection methods such as flatten, take, reverse, filter, map, and multi‑request concurrency.

Content ExtractionWeb Scrapingdata-processing
0 likes · 7 min read
QueryList: A Modern PHP Content Scraping Library – Features, Installation, and Usage Guide
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 17, 2023 · Backend Development

Build a Python Stock Data Scraper with Requests and Pandas

This article walks through building a Python web scraper that fetches stock trading data using the requests library and pandas, showing the complete code, how to set headers and cookies, and the resulting DataFrame, while highlighting the limits of relying solely on ChatGPT‑generated snippets.

APIData ExtractionWeb Scraping
0 likes · 4 min read
Build a Python Stock Data Scraper with Requests and Pandas
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 13, 2023 · Backend Development

Master Python Web Scraping & Data Extraction with Requests, lxml, pandas

This article walks through a Python web‑scraping solution that fetches GDP data from a website using the requests library, parses HTML with lxml, and demonstrates two approaches—manual XPath extraction and a streamlined pandas.read_html method—while providing complete code snippets and tips for handling pagination and data storage.

Data ExtractionWeb Scrapinglxml
0 likes · 6 min read
Master Python Web Scraping & Data Extraction with Requests, lxml, pandas
MaGe Linux Operations
MaGe Linux Operations
Oct 16, 2022 · Operations

10 Powerful Python Automation Scripts to Eliminate Repetitive Tasks

Discover ten practical Python automation scripts—from HTML parsing and QR code scanning to screenshot capture, PDF editing, StackOverflow querying, mobile device control, hardware monitoring, Instagram uploading, and video watermarking—that streamline repetitive tasks and boost productivity across diverse workflows.

AutomationScriptingWeb Scraping
0 likes · 13 min read
10 Powerful Python Automation Scripts to Eliminate Repetitive Tasks
Python Crawling & Data Mining
Python Crawling & Data Mining
Oct 5, 2022 · Backend Development

How to Decode URL‑Encoded Strings in Python Web Scraping

An in‑depth guide shows how to decode URL‑encoded strings encountered during Python web scraping, explains the difference between two encoding formats, and provides ready‑to‑run urllib code that prints the original Chinese characters, helping developers troubleshoot similar crawling issues.

DecodeURL encodingWeb Scraping
0 likes · 4 min read
How to Decode URL‑Encoded Strings in Python Web Scraping
Python Programming Learning Circle
Python Programming Learning Circle
Jul 30, 2022 · Backend Development

Python Web Scraping Tutorial: Crawling QDaily, Storing in SQLite, Analyzing Data and Generating a Word Cloud

This tutorial walks through building a simple Python web crawler for the QDaily website, covering target analysis, environment setup, SQLite database creation, data extraction with requests and BeautifulSoup, storing articles and comments, performing basic analysis, and visualizing results with a word cloud.

PythonSQLiteWeb Scraping
0 likes · 6 min read
Python Web Scraping Tutorial: Crawling QDaily, Storing in SQLite, Analyzing Data and Generating a Word Cloud
MaGe Linux Operations
MaGe Linux Operations
Jul 3, 2022 · Backend Development

How to Automate 10,000 Video‑Channel Posts with Python and OCR for Massive Traffic

This guide shows how to use Python to scrape high‑quality chat screenshots, apply OCR, generate silent chat videos, batch‑download matching audio from short‑video platforms, and combine them into thousands of unique WeChat Video Channel clips, leveraging volume to outsmart recommendation algorithms and boost traffic.

AutomationOCRPython
0 likes · 11 min read
How to Automate 10,000 Video‑Channel Posts with Python and OCR for Massive Traffic
Python Programming Learning Circle
Python Programming Learning Circle
May 23, 2022 · Backend Development

Simulating Zhihu Login with Python Using urllib and Fiddler

This article demonstrates how to automate Zhihu login on Windows by analyzing network traffic with Fiddler, extracting required parameters, and implementing a Python script that builds HTTP requests using urllib2, handles cookies, captcha retrieval, and logs the results, complete with sample code and execution screenshots.

FiddlerHTTPLogin Automation
0 likes · 8 min read
Simulating Zhihu Login with Python Using urllib and Fiddler
Sohu Tech Products
Sohu Tech Products
May 18, 2022 · Fundamentals

Overview of a Web Page Content Extraction Algorithm and Its Practical Demo

This article introduces a web page content extraction algorithm that automatically structures titles, timestamps, body text, authors, and sources from arbitrary news pages, explains how to use an online demo, compares it with existing solutions, and discusses its broader applications and limitations.

Content ExtractionGNEWeb Scraping
0 likes · 8 min read
Overview of a Web Page Content Extraction Algorithm and Its Practical Demo
Programmer DD
Programmer DD
May 15, 2022 · Backend Development

How to Automate Gym Reservations with Selenium: A Google Engineer’s Design Doc

This article explains how a Google engineer designed and implemented an automated system using Python and Selenium to book gym slots two days in advance, detailing problem definition, requirements, architecture, code snippets, and operational workflow for reliable, headless execution on macOS.

AutomationPythonScheduling
0 likes · 11 min read
How to Automate Gym Reservations with Selenium: A Google Engineer’s Design Doc
MaGe Linux Operations
MaGe Linux Operations
May 12, 2022 · Backend Development

7 Fun Python Projects You Can Build in Minutes

This article shares seven practical Python scripts—ranging from a 30‑line Zhihu image scraper and chatbot conversation loop to poetry author classification, lottery number generation, automatic apology letter creation, screen‑capture automation, and GIF assembly—demonstrating how to avoid reinventing the wheel while learning useful automation techniques.

AutomationChatbotLottery Generator
0 likes · 8 min read
7 Fun Python Projects You Can Build in Minutes
Python Programming Learning Circle
Python Programming Learning Circle
May 11, 2022 · Frontend Development

Comprehensive Guide to Configuring Chrome DevTools for Python Web Scraping

This article provides a detailed walkthrough of Chrome DevTools configuration and usage—including global settings, shortcuts, element inspection, network throttling, and code extraction—to help Python developers efficiently collect web data, with step‑by‑step instructions, screenshots, and code snippets.

Chrome DevToolsDebuggingPython
0 likes · 9 min read
Comprehensive Guide to Configuring Chrome DevTools for Python Web Scraping
21CTO
21CTO
Apr 19, 2022 · Information Security

Web Scraping Legalized and Chrome Zero‑Day Patched: Key Tech Updates

Recent developments include a US appellate court affirming that publicly accessible web data can be scraped legally, Google releasing an emergency Chrome 100.0.4896.127 patch for the critical CVE‑2022‑1364 V8 type‑confusion flaw, DB‑Engines’ latest database popularity rankings highlighting Redis’s rise, and Mullvad’s Firefox‑only privacy extension becoming open‑source.

Browser SecurityWeb Scrapingdatabases
0 likes · 6 min read
Web Scraping Legalized and Chrome Zero‑Day Patched: Key Tech Updates
Open Source Linux
Open Source Linux
Apr 11, 2022 · Mobile Development

Automate Douyin Video Scraping with Python, mitmproxy, and Appium

This tutorial shows how to combine mitmproxy packet capture and Appium mobile automation in Python to automatically collect and download Douyin video URLs, covering environment setup, code snippets, and practical steps for a fully automated scraper.

AppiumDouyinMobile Automation
0 likes · 8 min read
Automate Douyin Video Scraping with Python, mitmproxy, and Appium
IT Services Circle
IT Services Circle
Mar 26, 2022 · Information Security

Common Reasons Why Your Proxy Fails to Hide Your Web Scraper

The article explains several typical situations—such as not configuring HTTPS proxies, using server IPs, non‑anonymous proxies, polluted IP pools, and lack of HTTP/2 support—that cause websites to easily detect that a request is made through a proxy, even for beginner Python scrapers.

HTTPProxyPython
0 likes · 7 min read
Common Reasons Why Your Proxy Fails to Hide Your Web Scraper
IT Architects Alliance
IT Architects Alliance
Mar 14, 2022 · Backend Development

Design Document: Automated Gym Reservation Bot Using Selenium and Python

This article presents a detailed engineering design document for a Python‑based automation tool that uses Selenium to reserve gym slots during the COVID‑19 pandemic, covering problem description, requirements, overview and detailed design, implementation specifics, and operational workflow.

Gym BookingPythonSelenium
0 likes · 10 min read
Design Document: Automated Gym Reservation Bot Using Selenium and Python
IT Services Circle
IT Services Circle
Mar 7, 2022 · Big Data

Awesome Web Scraping – A Comprehensive Chinese Collection of Web Scraping Resources

This article introduces the renowned "awesome" GitHub repository, highlights its extensive sub‑lists for various domains, focuses on the awesome‑web‑scraping collection, and presents a newly created Chinese version that aggregates Python, JavaScript, Go, and other language‑specific web‑scraping tools and libraries.

Awesome ListData ExtractionGitHub
0 likes · 4 min read
Awesome Web Scraping – A Comprehensive Chinese Collection of Web Scraping Resources
MaGe Linux Operations
MaGe Linux Operations
Mar 4, 2022 · Backend Development

How to Build a Local QR‑Code Login Scraper for QQ Music with Python

This tutorial walks through creating a Python‑based local QR‑code login scraper for QQ Music, covering the extraction of dynamic parameters, handling of encrypted cookies, displaying and removing QR images, and ultimately obtaining a usable session for further automation.

Login AutomationPythonQR code
0 likes · 15 min read
How to Build a Local QR‑Code Login Scraper for QQ Music with Python
Sohu Tech Products
Sohu Tech Products
Mar 2, 2022 · Backend Development

Using requests‑cache to Cache HTTP Requests in Python Web Scraping

This article introduces the requests‑cache library, explains how to install it, demonstrates basic and advanced usage—including session patching, backend selection, expiration policies, request/response filtering, and cache‑control header handling—to efficiently avoid duplicate HTTP requests during Python web scraping.

HTTPPythonWeb Scraping
0 likes · 11 min read
Using requests‑cache to Cache HTTP Requests in Python Web Scraping
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 9, 2022 · Artificial Intelligence

How to Turn Crawled CSV Data into Word Clouds and Sentiment Scores with Python

This guide walks you through extracting text from a CSV obtained via Python web scraping, cleaning it with stop‑words, generating a word‑cloud, performing jieba tokenization and frequency analysis, and finally applying SnowNLP for sentiment scoring, with all code snippets and data links provided.

Sentiment AnalysisSnowNLPWeb Scraping
0 likes · 12 min read
How to Turn Crawled CSV Data into Word Clouds and Sentiment Scores with Python
MaGe Linux Operations
MaGe Linux Operations
Feb 8, 2022 · Backend Development

Automate Qutoutiao Short Video Uploads with Python & Selenium

This tutorial demonstrates how to use Python and Selenium to automatically log in, upload videos and cover images, set titles, descriptions, tags, and publish short videos on the Qutoutiao platform, providing complete source code and step‑by‑step instructions.

PythonQutoutiaoSelenium
0 likes · 7 min read
Automate Qutoutiao Short Video Uploads with Python & Selenium
FunTester
FunTester
Feb 8, 2022 · Backend Development

How to Automatically Extract Publication Dates from WeChat Articles with Groovy

The article explains how the author built a Groovy‑based scraper that reads a Markdown list of WeChat links, fetches each article’s HTML, extracts the hidden publication timestamp with a regex, and rewrites the Markdown file to include the dates, using simple HTTP calls and a brief pause to avoid anti‑scraping measures.

AutomationGroovyWeChat
0 likes · 6 min read
How to Automatically Extract Publication Dates from WeChat Articles with Groovy
Python Crawling & Data Mining
Python Crawling & Data Mining
Feb 2, 2022 · Backend Development

Bypass SVG Anti‑Scraping and Extract Data with Selenium and requests‑html

This article explains how to scrape data protected by SVG background‑image anti‑scraping by using Selenium to retrieve the SVG URL, parsing the SVG with requests‑html to map background offsets to characters, replacing SVG nodes with text, and finally extracting structured information such as phone numbers and reviews.

Data ExtractionSVGSelenium
0 likes · 11 min read
Bypass SVG Anti‑Scraping and Extract Data with Selenium and requests‑html
MaGe Linux Operations
MaGe Linux Operations
Jan 9, 2022 · Big Data

How to Scrape Maoyan Movie Data and Visualize Trends with Python

This tutorial walks you through collecting movie information from Maoyan using Python web‑scraping, storing the results in CSV, and then applying pandas, matplotlib, and WordCloud to analyze and visualize trends such as release years, genres, regions, durations, and ratings across China and the world.

Movie DataPythonWeb Scraping
0 likes · 13 min read
How to Scrape Maoyan Movie Data and Visualize Trends with Python
MaGe Linux Operations
MaGe Linux Operations
Dec 30, 2021 · Backend Development

Download Watermark‑Free Douyin Videos with a Simple Python Script

This article explains a streamlined method to extract and download watermark‑free Douyin short videos by inspecting network requests, locating hidden video URLs, and using a concise Python script with the jsonpath library, highlighting its advantages and remaining limitations.

DouyinJsonPathPython
0 likes · 5 min read
Download Watermark‑Free Douyin Videos with a Simple Python Script
Programmer DD
Programmer DD
Dec 28, 2021 · Backend Development

Master Web Scraping with Java: Getting Started with Jsoup

This article introduces Jsoup, an open‑source Java library for extracting and manipulating HTML, explains its key features such as DOM traversal and CSS selectors, and provides a concise code example that fetches Wikipedia headlines, helping developers automate web data collection.

BackendData ExtractionJava
0 likes · 3 min read
Master Web Scraping with Java: Getting Started with Jsoup
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 20, 2021 · Fundamentals

What Weibo Comments Reveal About Wang Leehom’s Divorce: A Python Data Dive

This article walks through using Python to scrape Wang Leehom’s divorce‑related Weibo comments, clean the noisy dataset, visualize hourly comment trends, compare with his ex‑wife’s posts, generate word‑clouds and emoji frequency charts, and provides full code and data for reproducible analysis.

Data visualizationEmoji AnalysisWeb Scraping
0 likes · 10 min read
What Weibo Comments Reveal About Wang Leehom’s Divorce: A Python Data Dive