Tagged articles
193 articles
Page 2 of 2
MaGe Linux Operations
MaGe Linux Operations
Mar 4, 2022 · Backend Development

How to Build a Local QR‑Code Login Scraper for QQ Music with Python

This tutorial walks through creating a Python‑based local QR‑code login scraper for QQ Music, covering the extraction of dynamic parameters, handling of encrypted cookies, displaying and removing QR images, and ultimately obtaining a usable session for further automation.

Login AutomationPythonQR code
0 likes · 15 min read
How to Build a Local QR‑Code Login Scraper for QQ Music with Python
IT Services Circle
IT Services Circle
Feb 25, 2022 · Backend Development

Detecting and Handling Gzip Bombs in Web Crawling with Python Requests

This article explains how to identify gzip‑compressed responses that may be gzip bombs, how to inspect HTTP headers and raw response data using Python's requests library, and provides command‑line and code examples for measuring compressed and uncompressed sizes without triggering decompression.

GzipWeb Crawlingcompression
0 likes · 5 min read
Detecting and Handling Gzip Bombs in Web Crawling with Python Requests
MaGe Linux Operations
MaGe Linux Operations
Sep 18, 2021 · Backend Development

How to Build a Python Crawler to Grab TV Drama Links Automatically

This article explains how to create a Python web crawler that automatically generates URLs for a drama‑download site, filters out invalid pages, extracts ed2k links using requests and regular expressions, saves them to text files, and employs multithreading to speed up processing, while discussing challenges such as duplicate URLs and filename sanitization.

CrawlerWeb Scrapingmultithreading
0 likes · 7 min read
How to Build a Python Crawler to Grab TV Drama Links Automatically
Python Programming Learning Circle
Python Programming Learning Circle
Aug 31, 2021 · Backend Development

Python Web Crawler for Downloading Drama Links from cn163.net

This article describes how to build a Python web crawler that automatically generates numeric URLs, checks their validity, extracts download links for TV dramas from cn163.net, saves them to text files, and discusses practical challenges such as regex parsing, filename restrictions, and multithreading performance.

Pythoncrawlingfile-handling
0 likes · 7 min read
Python Web Crawler for Downloading Drama Links from cn163.net
Python Programming Learning Circle
Python Programming Learning Circle
Jul 28, 2021 · Backend Development

Python Requests Tutorial: GET, POST, HTTPS, File Upload/Download, Timeout, Authentication, Session, Token, and Test Automation Framework

This article provides a comprehensive Python tutorial covering how to send GET and POST requests, handle HTTPS verification, perform file uploads and downloads, set request timeouts, use basic and token authentication, manage sessions, generate signatures, and organize a test automation framework with configuration, common utilities, test data, test cases, reporting, and execution scripts.

API testingAutomationBackend
0 likes · 12 min read
Python Requests Tutorial: GET, POST, HTTPS, File Upload/Download, Timeout, Authentication, Session, Token, and Test Automation Framework
Python Programming Learning Circle
Python Programming Learning Circle
Jul 14, 2021 · Backend Development

Bypassing Anti‑Scraping Mechanisms: User‑Agent Spoofing and IP Rate Limiting with Python

This article explains how to overcome common anti‑scraping defenses such as identity verification and IP rate limiting by spoofing the User‑Agent header and adding request delays, providing complete Python code examples using requests and BeautifulSoup to scrape Douban's Top 250 movies.

IP throttlingUser-AgentWeb Scraping
0 likes · 6 min read
Bypassing Anti‑Scraping Mechanisms: User‑Agent Spoofing and IP Rate Limiting with Python
MaGe Linux Operations
MaGe Linux Operations
May 14, 2021 · Fundamentals

Boost Your Python Productivity with 7 Essential Efficiency Tools

This article introduces seven powerful Python tools—including Pandas, Selenium, Flask, Scrapy, Requests, Faker, and Pillow—explaining their core features and providing ready-to-use code snippets to help developers automate data analysis, testing, web development, crawling, API calls, fake data generation, and image processing.

FakerFlaskScrapy
0 likes · 6 min read
Boost Your Python Productivity with 7 Essential Efficiency Tools
Python Programming Learning Circle
Python Programming Learning Circle
Mar 29, 2021 · Fundamentals

Understanding Python Context Managers: Basics, Custom Implementations, and Advanced Applications

This article explains Python's context manager mechanism, covering the basic with statement, custom __enter__/__exit__ classes, the contextlib.contextmanager decorator, nesting, combining multiple managers with ExitStack, and practical applications such as SQLAlchemy session handling, exception management, and persistent HTTP requests.

DecoratorPythonResource Management
0 likes · 6 min read
Understanding Python Context Managers: Basics, Custom Implementations, and Advanced Applications
Python Programming Learning Circle
Python Programming Learning Circle
Mar 26, 2021 · Backend Development

Building a Python API Automation Testing Framework: Design, Configuration, Logging, Database, and Email Integration

This article details the step‑by‑step creation of a Python‑based API automation testing framework, covering project structure, configuration files, a reusable logging module, HTTP request handling with requests, common utilities for Excel and XML, MySQL database access, email reporting, and the test runner setup.

API testingAutomationEmail
0 likes · 20 min read
Building a Python API Automation Testing Framework: Design, Configuration, Logging, Database, and Email Integration
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 31, 2020 · Backend Development

How to Scrape Thousands of New‑House Listings in Python: A Step‑by‑Step Guide

This article demonstrates how to use Python's requests, fake_useragent, and lxml libraries to batch‑scrape nearly a thousand new‑house listings from the 惠民之家 website, extracting 41 fields such as name, price, layout, opening date, plot ratio and green ratio, while handling pagination and anti‑scraping measures.

CSVPythonReal Estate Data
0 likes · 9 min read
How to Scrape Thousands of New‑House Listings in Python: A Step‑by‑Step Guide
21CTO
21CTO
Dec 29, 2020 · Backend Development

How to Fetch ISS Data with Python: A Beginner’s Guide to APIs

This tutorial explains what an API is, shows how to install and use the Python requests library to make GET requests, interprets common HTTP status codes, and demonstrates parsing JSON data from the Open Notify ISS API using the built‑in json module.

JSONTutorialrequests
0 likes · 7 min read
How to Fetch ISS Data with Python: A Beginner’s Guide to APIs
FunTester
FunTester
Dec 8, 2020 · Backend Development

Automating API Tests for Special Character Handling with Python Requests

This article explains how to use Python's requests library and CSV handling to automate API testing of payment interfaces, focusing on correctly processing special characters in usernames and account names to ensure secure and fast transactions.

API testingAutomationCSV
0 likes · 6 min read
Automating API Tests for Special Character Handling with Python Requests
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 30, 2020 · Backend Development

Build a Python Movie Scraper: Download Films from FilmSky with Ease

This guide walks you through setting up a Python environment, installing required libraries, constructing a FilmSky scraper class, handling pagination, parsing HTML with regex, and saving movie titles and download links, enabling you to browse and download movies from the FilmSky website efficiently.

PythonWeb Scrapingmovie downloader
0 likes · 6 min read
Build a Python Movie Scraper: Download Films from FilmSky with Ease
MaGe Linux Operations
MaGe Linux Operations
Oct 29, 2020 · Backend Development

Master HTTP Requests with Python httpx: GET, POST, PUT, Streaming & More

This guide walks you through using the Python httpx library to perform various HTTP methods—including GET, POST, PUT, DELETE, HEAD, and OPTIONS—handle query parameters, decode responses, work with JSON, custom headers, form data, file uploads, streaming, cookies, redirects, and authentication, all with clear code examples.

APIAuthenticationHTTP
0 likes · 10 min read
Master HTTP Requests with Python httpx: GET, POST, PUT, Streaming & More
MaGe Linux Operations
MaGe Linux Operations
Oct 21, 2020 · Backend Development

Scrape Tenda Router Data with Python Requests and Serve It via a Flask API

This guide walks through analyzing a Tenda router's web interface to discover its data‑fetching URLs, extracting all available modules, and then using Python's requests library to retrieve the information and expose it through a simple Flask API, with both object‑oriented and functional code examples.

BackendFlaskRouter
0 likes · 7 min read
Scrape Tenda Router Data with Python Requests and Serve It via a Flask API
MaGe Linux Operations
MaGe Linux Operations
Oct 11, 2020 · Backend Development

Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio

This tutorial walks you through using various Python modules—including requests, wget, urllib, urllib3, boto3, and asyncio—to download regular files, web pages, Amazon S3 objects, handle redirects, large files, parallel downloads, proxies, and progress bars, all with clear code examples.

Boto3File Downloadasyncio
0 likes · 8 min read
Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 5, 2020 · Backend Development

Build a Python Image Scraper for 51miz.com in Minutes

This tutorial walks you through creating a Python web scraper that fetches image URLs from 51miz.com using requests and lxml, filters them with regular expressions, downloads the images, and demonstrates the complete workflow with code snippets and screenshots.

PythonWeb ScrapingXPath
0 likes · 5 min read
Build a Python Image Scraper for 51miz.com in Minutes
Python Crawling & Data Mining
Python Crawling & Data Mining
May 28, 2020 · Backend Development

Multithreaded Python Crawl of Xiaomi App Store Games

This tutorial demonstrates how to use Python's requests, threading, and queue modules to build a multithreaded crawler that extracts game names, download links, and execution time from the Xiaomi App Store, complete with code examples and performance tips.

PythonWeb ScrapingXiaomi App Store
0 likes · 7 min read
Multithreaded Python Crawl of Xiaomi App Store Games
21CTO
21CTO
Apr 21, 2020 · Fundamentals

Master Python File Downloads: Requests, Wget, urllib, Async & More

This tutorial walks through multiple Python approaches for downloading files—including simple requests and wget calls, handling redirects, large and multi‑file downloads, proxy usage, urllib/urllib3 methods, and asynchronous techniques—providing complete code snippets and practical tips for each scenario.

Pythonasynciofile-download
0 likes · 11 min read
Master Python File Downloads: Requests, Wget, urllib, Async & More
MaGe Linux Operations
MaGe Linux Operations
Dec 25, 2019 · Backend Development

Master Web Crawling in Python: From urllib to requests and Robots.txt

This guide explains the fundamentals of web crawling, covering crawler types, the Robots.txt protocol, Python's urllib and urllib3 modules, the requests library, handling HTTP methods, user‑agents, HTTPS certificates, and practical code examples for extracting data from websites.

Pythonrequestsrobots.txt
0 likes · 18 min read
Master Web Crawling in Python: From urllib to requests and Robots.txt
HomeTech
HomeTech
Dec 4, 2019 · Databases

Automating Test Data Generation with SQLAlchemy ORM, Faker, and Requests in Python

This article demonstrates how to automate e‑commerce test data creation by retrieving schema information with SQLAlchemy ORM, defining mapping classes, generating realistic data using Faker, and submitting it via HTTP requests with the Requests library, covering database interactions, data assembly, and API communication.

FakerHTTPORM
0 likes · 13 min read
Automating Test Data Generation with SQLAlchemy ORM, Faker, and Requests in Python
Python Crawling & Data Mining
Python Crawling & Data Mining
Nov 21, 2019 · Backend Development

Essential Python Web Scraping Libraries Every Developer Should Know

This guide introduces the most important Python libraries for web scraping—including requests, urllib3, Selenium, aiohttp, BeautifulSoup, lxml, pyquery, PyMySQL, PyMongo, and redisdump—explaining their core features, typical use cases, and providing concise code examples to help beginners get started quickly.

aiohttpbeautifulsouplxml
0 likes · 7 min read
Essential Python Web Scraping Libraries Every Developer Should Know
MaGe Linux Operations
MaGe Linux Operations
Jul 11, 2019 · Backend Development

How to Scrape JD Product Reviews and Create Word Clouds with Python

This tutorial walks you through analyzing JD product pages, extracting comment data via requests with proper headers, handling pagination, saving results, cleaning text using jieba, and visualizing frequent terms as a word cloud, all illustrated with step‑by‑step screenshots and code snippets.

Pythonjiebarequests
0 likes · 10 min read
How to Scrape JD Product Reviews and Create Word Clouds with Python
MaGe Linux Operations
MaGe Linux Operations
Apr 4, 2019 · Backend Development

Build a Python Crawler to Auto‑Collect TV Drama Download Links

This article describes how the author built a Python web crawler to automatically generate numeric URLs, fetch TV drama pages from the 天天美剧 site, extract ed2k download links using regular expressions, and save them into organized text files, streamlining the download process with Thunder.

Crawlerdata collectionmultithreading
0 likes · 6 min read
Build a Python Crawler to Auto‑Collect TV Drama Download Links
转转QA
转转QA
Mar 20, 2019 · Operations

Real-time Monitoring of H5 Pages Using Headless Browser and Puppeteer

This article describes a real‑time monitoring solution for large numbers of H5 pages that combines Python's Requests library for data crawling with a headless Chrome browser driven by Puppeteer to detect resource errors, API failures, and DOM anomalies, automatically alerting stakeholders.

AutomationHeadless BrowserNode.js
0 likes · 8 min read
Real-time Monitoring of H5 Pages Using Headless Browser and Puppeteer
MaGe Linux Operations
MaGe Linux Operations
Jul 30, 2018 · Backend Development

Build a Python Crawler to Automatically Grab Drama Download Links

This article explains how to create a Python web‑scraper that automatically generates URLs, fetches drama pages from a download site, extracts ed2k links with regular expressions, saves them to text files, and handles missing pages and filename restrictions efficiently.

CrawlerPythondrama-download
0 likes · 7 min read
Build a Python Crawler to Automatically Grab Drama Download Links
MaGe Linux Operations
MaGe Linux Operations
Jun 12, 2018 · Backend Development

How to Scrape Douban Movie Reviews in 12 Lines of Python

Learn to quickly build a Python web scraper using requests and Xpath to extract Douban movie 'Black Panther' short reviews, covering setup, HTTP request analysis, data parsing, storage with pandas, and best practices like polite crawling intervals, all demonstrated with concise 12-line code.

XPathpandasrequests
0 likes · 10 min read
How to Scrape Douban Movie Reviews in 12 Lines of Python
MaGe Linux Operations
MaGe Linux Operations
Nov 21, 2017 · Backend Development

How to Scrape All Chinese Stock Data with Python: A Step‑by‑Step Guide

This tutorial explains how to collect the names and trading information of every stock listed on the Shanghai and Shenzhen exchanges using Python 3.5, requests, BeautifulSoup and regular expressions, then store the results in a file through a clear three‑step process of fetching the stock list, retrieving each stock's page, parsing the HTML and writing the data.

Stock Databeautifulsoupfile-output
0 likes · 11 min read
How to Scrape All Chinese Stock Data with Python: A Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Sep 9, 2017 · Backend Development

Build a Python Image Downloader: Step‑by‑Step Web Scraping Tutorial

This tutorial walks through building a Python web scraper that automatically downloads images from Baidu by analyzing requirements, inspecting page source, crafting regex patterns, and implementing the crawler with requests, offering step‑by‑step guidance, code snippets, and troubleshooting tips.

PythonWeb Scrapingimage-downloader
0 likes · 7 min read
Build a Python Image Downloader: Step‑by‑Step Web Scraping Tutorial
MaGe Linux Operations
MaGe Linux Operations
May 23, 2017 · Backend Development

How to Build a Python Zhihu Web Scraper: Login, User Data, and More

This article walks through building a Python web scraper for Zhihu, covering login simulation, extracting user profiles, answer likers, followers, avatars, and all answers of a question, and storing the collected data in SQLite, while highlighting challenges like captcha and anti‑scraping limits.

SQLitebeautifulsoupdata-extraction
0 likes · 10 min read
How to Build a Python Zhihu Web Scraper: Login, User Data, and More
MaGe Linux Operations
MaGe Linux Operations
May 20, 2017 · Backend Development

5 Must‑Use Python Libraries to Supercharge Your Projects

This article introduces five highly practical Python packages—yagmail, requests, psutil, BeautifulSoup, and a collection of utility scripts—explaining how each simplifies common tasks such as sending emails, making HTTP calls, system monitoring, web scraping, and code reuse, complete with concise code examples.

PythonWeb Scrapinglibraries
0 likes · 14 min read
5 Must‑Use Python Libraries to Supercharge Your Projects
MaGe Linux Operations
MaGe Linux Operations
May 9, 2017 · Backend Development

11 Essential Python requests Tricks to Supercharge Your HTTP Calls

This guide compiles eleven practical Python requests tips—including preserving cookies across calls, simplifying headers, adding retry logic, disabling redirects, sending JSON payloads, extracting debug information, performing asynchronous requests with grequests, customizing cookies, mocking APIs, measuring request time, and setting timeouts—to help developers write cleaner, more reliable HTTP code.

APIHTTPPython
0 likes · 4 min read
11 Essential Python requests Tricks to Supercharge Your HTTP Calls
ITPUB
ITPUB
Dec 17, 2015 · Backend Development

Build a Simple Python Image Scraper on macOS – Step‑by‑Step Guide

This tutorial walks you through setting up a macOS environment, inspecting a web page, and writing a Python script with the requests library to locate and download all images from a target site, complete with code explanations and execution tips.

PythonTutorialWeb Scraping
0 likes · 7 min read
Build a Simple Python Image Scraper on macOS – Step‑by‑Step Guide