Tagged articles
37 articles
Page 1 of 1
Python Programming Learning Circle
Python Programming Learning Circle
May 12, 2025 · Fundamentals

Basic Python Web Scraping Techniques and Tips

This article introduces beginner-friendly Python web‑scraping methods, covering the simplest urllib/requests approach, adding request headers, inspecting network traffic for hidden data, handling dynamically loaded content with Selenium, and provides links to deeper tutorials for each technique.

Network InspectionPythonSelenium
0 likes · 4 min read
Basic Python Web Scraping Techniques and Tips
Python Programming Learning Circle
Python Programming Learning Circle
Mar 25, 2025 · Backend Development

Comprehensive Python Guide to Download Files from the Web, S3, and Other Sources

This tutorial walks through multiple Python techniques for downloading regular files, web pages, Amazon S3 objects, and other resources, covering basic requests, wget, handling redirects, chunked large‑file downloads, parallel downloads, progress bars, urllib, urllib3, proxy usage, boto3 for S3, and asynchronous downloads with asyncio.

Boto3File DownloadPython
0 likes · 8 min read
Comprehensive Python Guide to Download Files from the Web, S3, and Other Sources
Python Programming Learning Circle
Python Programming Learning Circle
Oct 10, 2024 · Backend Development

Python Web Scraping Techniques: Requests, Proxies, Cookies, Headers, Captcha, Gzip, and Multithreading

This article outlines essential Python web‑scraping techniques, covering basic GET/POST requests, proxy usage, cookie handling, header manipulation to mimic browsers, simple captcha solutions, gzip compression handling, and multithreaded crawling with a thread‑pool template, providing practical code examples for each step.

GzipProxyPython
0 likes · 5 min read
Python Web Scraping Techniques: Requests, Proxies, Cookies, Headers, Captcha, Gzip, and Multithreading
Python Programming Learning Circle
Python Programming Learning Circle
Apr 11, 2024 · Backend Development

Comprehensive Guide to Downloading Files with Python Using requests, wget, urllib, urllib3, Boto3, asyncio and More

This tutorial explains how to download regular files, web pages, Amazon S3 objects, and other resources in Python by covering multiple modules such as requests, wget, urllib, urllib3, Boto3, and asyncio, and demonstrates handling redirects, large files, parallel downloads, progress bars, proxies, and asynchronous execution.

Boto3File Downloadasyncio
0 likes · 8 min read
Comprehensive Guide to Downloading Files with Python Using requests, wget, urllib, urllib3, Boto3, asyncio and More
Python Programming Learning Circle
Python Programming Learning Circle
Mar 13, 2023 · Fundamentals

Downloading Files in Python Using requests, wget, urllib, boto3, and asyncio

This tutorial demonstrates multiple Python techniques for downloading files—including simple requests.get calls, wget and urllib modules, handling redirects and large files with chunked streaming, parallel batch downloads, progress bars, proxy support, S3 retrieval via boto3, and asynchronous downloads with asyncio—providing a comprehensive guide for developers.

Boto3File Downloadasyncio
0 likes · 8 min read
Downloading Files in Python Using requests, wget, urllib, boto3, and asyncio
Python Programming Learning Circle
Python Programming Learning Circle
Nov 12, 2022 · Backend Development

Comprehensive Guide to Python urllib Library: Modules, Functions, and Usage Examples

This article provides a detailed tutorial on Python's urllib library, covering its main modules (request, error, parse, robotparser), key functions and classes, code examples for URL fetching, parsing, encoding, and handling robots.txt, making it a practical resource for backend developers and web scrapers.

Networkingurlliburlparse
0 likes · 13 min read
Comprehensive Guide to Python urllib Library: Modules, Functions, and Usage Examples
Python Programming Learning Circle
Python Programming Learning Circle
Oct 12, 2022 · Backend Development

Comprehensive Guide to Downloading Files in Python Using Requests, wget, urllib, urllib3, Boto3, and asyncio

This tutorial walks through multiple Python approaches for downloading files—including simple requests.get calls, the wget module, handling redirects, chunked large‑file downloads, parallel batch downloads, proxy usage with urllib, S3 retrieval via Boto3, and asynchronous fetching with asyncio—providing code examples and best‑practice tips.

BackendBoto3File Download
0 likes · 8 min read
Comprehensive Guide to Downloading Files in Python Using Requests, wget, urllib, urllib3, Boto3, and asyncio
Python Crawling & Data Mining
Python Crawling & Data Mining
Oct 5, 2022 · Backend Development

How to Decode URL‑Encoded Strings in Python Web Scraping

An in‑depth guide shows how to decode URL‑encoded strings encountered during Python web scraping, explains the difference between two encoding formats, and provides ready‑to‑run urllib code that prints the original Chinese characters, helping developers troubleshoot similar crawling issues.

DecodeURL encodingWeb Scraping
0 likes · 4 min read
How to Decode URL‑Encoded Strings in Python Web Scraping
Python Programming Learning Circle
Python Programming Learning Circle
May 23, 2022 · Backend Development

Simulating Zhihu Login with Python Using urllib and Fiddler

This article demonstrates how to automate Zhihu login on Windows by analyzing network traffic with Fiddler, extracting required parameters, and implementing a Python script that builds HTTP requests using urllib2, handles cookies, captcha retrieval, and logs the results, complete with sample code and execution screenshots.

FiddlerHTTPLogin Automation
0 likes · 8 min read
Simulating Zhihu Login with Python Using urllib and Fiddler
Python Crawling & Data Mining
Python Crawling & Data Mining
May 19, 2021 · Backend Development

urllib vs requests: Which Python Library Wins for Web Scraping?

This article compares Python's built‑in urllib library with the third‑party requests library, demonstrating their usage through code examples, highlighting differences in request construction, response handling, and practical considerations for web scraping, and concludes with recommendations for choosing the more convenient tool.

HTTPPythonTutorial
0 likes · 6 min read
urllib vs requests: Which Python Library Wins for Web Scraping?
MaGe Linux Operations
MaGe Linux Operations
Oct 11, 2020 · Backend Development

Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio

This tutorial walks you through using various Python modules—including requests, wget, urllib, urllib3, boto3, and asyncio—to download regular files, web pages, Amazon S3 objects, handle redirects, large files, parallel downloads, proxies, and progress bars, all with clear code examples.

Boto3File Downloadasyncio
0 likes · 8 min read
Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio
21CTO
21CTO
Apr 21, 2020 · Fundamentals

Master Python File Downloads: Requests, Wget, urllib, Async & More

This tutorial walks through multiple Python approaches for downloading files—including simple requests and wget calls, handling redirects, large and multi‑file downloads, proxy usage, urllib/urllib3 methods, and asynchronous techniques—providing complete code snippets and practical tips for each scenario.

Pythonasynciofile-download
0 likes · 11 min read
Master Python File Downloads: Requests, Wget, urllib, Async & More
MaGe Linux Operations
MaGe Linux Operations
Dec 25, 2019 · Backend Development

Master Web Crawling in Python: From urllib to requests and Robots.txt

This guide explains the fundamentals of web crawling, covering crawler types, the Robots.txt protocol, Python's urllib and urllib3 modules, the requests library, handling HTTP methods, user‑agents, HTTPS certificates, and practical code examples for extracting data from websites.

Pythonrequestsrobots.txt
0 likes · 18 min read
Master Web Crawling in Python: From urllib to requests and Robots.txt
MaGe Linux Operations
MaGe Linux Operations
Dec 1, 2018 · Backend Development

How to Scrape Baidu Tieba Images with Python and Save Locally

Learn step-by-step how to use Python's urllib and regular expressions to fetch Baidu Tieba web pages, extract image URLs, and download the images to your local machine, including code examples for getHtml and getImg functions and file naming conventions.

regular expressionsurllib
0 likes · 3 min read
How to Scrape Baidu Tieba Images with Python and Save Locally