Tagged articles

urllib

37 articles · Page 1 of 1

Feb 1, 2026 · Backend Development

Extract JD.com Product Data with Python BeautifulSoup: A Step‑by‑Step Guide

This tutorial shows how to build a JD.com search URL, fetch the page with urllib, parse the HTML using BeautifulSoup, and reliably extract each product's name, link, image and price while handling missing image URLs and avoiding regex complexity.

JD.combeautifulsoupdata extraction

0 likes · 5 min read

Extract JD.com Product Data with Python BeautifulSoup: A Step‑by‑Step Guide

Python Crawling & Data Mining

Jan 29, 2026 · Backend Development

How to Scrape JD.com Product Data with Python Regex: A Step‑by‑Step Guide

This tutorial shows how to build a JD.com search URL, encode keywords, fetch the page with Python's urllib, and extract product details using regular expressions, providing code snippets, regex explanations, and sample output for beginners.

JD.comPythonWeb Scraping

0 likes · 5 min read

How to Scrape JD.com Product Data with Python Regex: A Step‑by‑Step Guide

Python Programming Learning Circle

May 12, 2025 · Fundamentals

Basic Python Web Scraping Techniques and Tips

This article introduces beginner-friendly Python web‑scraping methods, covering the simplest urllib/requests approach, adding request headers, inspecting network traffic for hidden data, handling dynamically loaded content with Selenium, and provides links to deeper tutorials for each technique.

Network InspectionPythonSelenium

0 likes · 4 min read

Basic Python Web Scraping Techniques and Tips

Python Programming Learning Circle

Mar 25, 2025 · Backend Development

Comprehensive Python Guide to Download Files from the Web, S3, and Other Sources

This tutorial walks through multiple Python techniques for downloading regular files, web pages, Amazon S3 objects, and other resources, covering basic requests, wget, handling redirects, chunked large‑file downloads, parallel downloads, progress bars, urllib, urllib3, proxy usage, boto3 for S3, and asynchronous downloads with asyncio.

Boto3File DownloadPython

0 likes · 8 min read

Comprehensive Python Guide to Download Files from the Web, S3, and Other Sources

Python Programming Learning Circle

Jan 2, 2025 · Backend Development

Python File Download Techniques: Requests, wget, urllib, urllib3, Boto3, asyncio, and More

This tutorial teaches how to download files in Python using various modules such as requests, wget, urllib, urllib3, boto3, and asyncio, covering basic downloads, handling redirects, chunked large-file downloads, parallel batch downloads, progress bars, proxy usage, and asynchronous techniques.

Boto3File DownloadPython

0 likes · 8 min read

Python File Download Techniques: Requests, wget, urllib, urllib3, Boto3, asyncio, and More

Python Programming Learning Circle

Oct 21, 2024 · Backend Development

Python File Download Tutorial: Using requests, wget, urllib, boto3, and asyncio

This tutorial teaches how to download files in Python using various modules such as requests, wget, urllib, urllib3, boto3 for S3, and asyncio, covering basic downloads, redirects, chunked large‑file handling, parallel downloads, progress bars, proxy usage, and asynchronous techniques.

Boto3File DownloadPython

0 likes · 8 min read

Python File Download Tutorial: Using requests, wget, urllib, boto3, and asyncio

Python Programming Learning Circle

Oct 10, 2024 · Backend Development

Python Web Scraping Techniques: Requests, Proxies, Cookies, Headers, Captcha, Gzip, and Multithreading

This article outlines essential Python web‑scraping techniques, covering basic GET/POST requests, proxy usage, cookie handling, header manipulation to mimic browsers, simple captcha solutions, gzip compression handling, and multithreaded crawling with a thread‑pool template, providing practical code examples for each step.

Pythoncookiesgzip

0 likes · 5 min read

Python Web Scraping Techniques: Requests, Proxies, Cookies, Headers, Captcha, Gzip, and Multithreading

Python Programming Learning Circle

Apr 11, 2024 · Backend Development

Comprehensive Guide to Downloading Files with Python Using requests, wget, urllib, urllib3, Boto3, asyncio and More

This tutorial explains how to download regular files, web pages, Amazon S3 objects, and other resources in Python by covering multiple modules such as requests, wget, urllib, urllib3, Boto3, and asyncio, and demonstrates handling redirects, large files, parallel downloads, progress bars, proxies, and asynchronous execution.

Boto3File Downloadasyncio

0 likes · 8 min read

Comprehensive Guide to Downloading Files with Python Using requests, wget, urllib, urllib3, Boto3, asyncio and More

Test Development Learning Exchange

Jul 13, 2023 · Fundamentals

An Introduction to Python's urllib Module and Its Submodules with Example Code

This article introduces Python's urllib module, explains its main submodules—urllib.request, urllib.parse, urllib.error, and urllib.robotparser—and provides practical code examples demonstrating URL opening, parsing, error handling, and robots.txt processing for interface automation tasks in Python.

Error handlingNetworkPython

0 likes · 4 min read

An Introduction to Python's urllib Module and Its Submodules with Example Code

Python Programming Learning Circle

Mar 13, 2023 · Fundamentals

Downloading Files in Python Using requests, wget, urllib, boto3, and asyncio

This tutorial demonstrates multiple Python techniques for downloading files—including simple requests.get calls, wget and urllib modules, handling redirects and large files with chunked streaming, parallel batch downloads, progress bars, proxy support, S3 retrieval via boto3, and asynchronous downloads with asyncio—providing a comprehensive guide for developers.

Boto3File Downloadasyncio

0 likes · 8 min read

Downloading Files in Python Using requests, wget, urllib, boto3, and asyncio

Python Programming Learning Circle

Nov 12, 2022 · Backend Development

Comprehensive Guide to Python urllib Library: Modules, Functions, and Usage Examples

This article provides a detailed tutorial on Python's urllib library, covering its main modules (request, error, parse, robotparser), key functions and classes, code examples for URL fetching, parsing, encoding, and handling robots.txt, making it a practical resource for backend developers and web scrapers.

networkingurlliburlparse

0 likes · 13 min read

Comprehensive Guide to Python urllib Library: Modules, Functions, and Usage Examples

Python Programming Learning Circle

Oct 12, 2022 · Backend Development

Comprehensive Guide to Downloading Files in Python Using Requests, wget, urllib, urllib3, Boto3, and asyncio

This tutorial walks through multiple Python approaches for downloading files—including simple requests.get calls, the wget module, handling redirects, chunked large‑file downloads, parallel batch downloads, proxy usage with urllib, S3 retrieval via Boto3, and asynchronous fetching with asyncio—providing code examples and best‑practice tips.

Boto3File Downloadasyncio

0 likes · 8 min read

Comprehensive Guide to Downloading Files in Python Using Requests, wget, urllib, urllib3, Boto3, and asyncio

Python Crawling & Data Mining

Oct 5, 2022 · Backend Development

How to Decode URL‑Encoded Strings in Python Web Scraping

An in‑depth guide shows how to decode URL‑encoded strings encountered during Python web scraping, explains the difference between two encoding formats, and provides ready‑to‑run urllib code that prints the original Chinese characters, helping developers troubleshoot similar crawling issues.

DecodeURL encodingWeb Scraping

0 likes · 4 min read

How to Decode URL‑Encoded Strings in Python Web Scraping

Python Crawling & Data Mining

Sep 15, 2022 · Fundamentals

How to Decode URL Parameters in Python Web Crawlers: A Step‑by‑Step Guide

This article explains how to use Python's urllib library to decode URL‑encoded strings encountered in web crawling, walks through a real example with code, and shows the resulting decoded URL, helping developers troubleshoot common encoding issues.

URL encodingstring decodingurllib

0 likes · 3 min read

How to Decode URL Parameters in Python Web Crawlers: A Step‑by‑Step Guide

Python Crawling & Data Mining

Aug 31, 2022 · Backend Development

Master Web Crawling in Python: From Data Fetching to Image Download

This article explains how to build a Python web crawler that fetches HTML pages with urllib, parses them using BeautifulSoup to extract image URLs, and downloads the images, covering all three stages with complete code examples and parser options.

Image DownloadWeb Scrapingbeautifulsoup

0 likes · 7 min read

Master Web Crawling in Python: From Data Fetching to Image Download

Python Crawling & Data Mining

Jun 3, 2022 · Backend Development

How to Scrape Real-Time Stock Data from Eastmoney with Python

This article demonstrates two Python approaches—using high-level requests and low-level HTTPConnection—to fetch intraday stock data from Eastmoney, providing complete code examples, URL discovery steps, and practical tips for data‑mining enthusiasts.

EastmoneyHttpClientPython

0 likes · 7 min read

How to Scrape Real-Time Stock Data from Eastmoney with Python

Python Programming Learning Circle

May 23, 2022 · Backend Development

Simulating Zhihu Login with Python Using urllib and Fiddler

This article demonstrates how to automate Zhihu login on Windows by analyzing network traffic with Fiddler, extracting required parameters, and implementing a Python script that builds HTTP requests using urllib2, handles cookies, captcha retrieval, and logs the results, complete with sample code and execution screenshots.

FiddlerHTTPLogin Automation

0 likes · 8 min read

Simulating Zhihu Login with Python Using urllib and Fiddler

Python Programming Learning Circle

Nov 11, 2021 · Fundamentals

Python Techniques for Crawling TXT, CSV, PDF, and Word Documents

This article introduces Python 3 methods for retrieving various document types—including TXT, CSV, PDF, and Word files—using urllib, regular expressions, and file‑specific processing steps, providing practical code examples and workflow guidance for building effective web crawlers.

File ParsingPythonWeb Scraping

0 likes · 3 min read

Python Techniques for Crawling TXT, CSV, PDF, and Word Documents

Python Crawling & Data Mining

May 19, 2021 · Backend Development

urllib vs requests: Which Python Library Wins for Web Scraping?

This article compares Python's built‑in urllib library with the third‑party requests library, demonstrating their usage through code examples, highlighting differences in request construction, response handling, and practical considerations for web scraping, and concludes with recommendations for choosing the more convenient tool.

HTTPPythontutorial

0 likes · 6 min read

urllib vs requests: Which Python Library Wins for Web Scraping?

Python Programming Learning Circle

Apr 30, 2021 · Fundamentals

Downloading Files with Python: requests, wget, urllib, urllib3, boto3, and asyncio

This tutorial explains how to download files, webpages, and Amazon S3 objects using Python, covering the requests and wget modules, handling redirects, chunked and parallel downloads, progress bars, proxy support, urllib/urllib3 usage, and asynchronous downloading with asyncio.

Boto3File DownloadPython

0 likes · 8 min read

Downloading Files with Python: requests, wget, urllib, urllib3, boto3, and asyncio

Python Programming Learning Circle

Apr 15, 2021 · Backend Development

Scraping Stock Data with Python: From Extracting Stock Codes to Saving Results in Excel

This tutorial guides readers through analyzing web pages, using Python to write network programs, applying regular expressions, and operating Excel to scrape all listed company stock data for a specified date range, save it by stock code, and optionally store results in databases.

PythonStock DataWeb Scraping

0 likes · 6 min read

Scraping Stock Data with Python: From Extracting Stock Codes to Saving Results in Excel

Python Programming Learning Circle

Apr 12, 2021 · Fundamentals

Python Web Scraping Tutorial: Extracting Fast Track 100 Companies with BeautifulSoup

This tutorial walks through using Python's urllib and BeautifulSoup libraries to fetch, parse, clean, and export the Fast Track 100 company table into a CSV file, covering installation, page inspection, element extraction, data cleaning, link handling, and file writing.

Pythonbeautifulsoupurllib

0 likes · 10 min read

Python Web Scraping Tutorial: Extracting Fast Track 100 Companies with BeautifulSoup

Python Programming Learning Circle

Apr 10, 2021 · Backend Development

Simple Python Web Scraping with urllib and Beautiful Soup

This tutorial demonstrates how to use Python's urllib module to simulate browser requests, parse HTML with Beautiful Soup, extract text and image URLs, and store the scraped data locally using file I/O and the with statement, providing complete code examples.

file-ioimage-downloadingurllib

0 likes · 10 min read

Simple Python Web Scraping with urllib and Beautiful Soup

Python Crawling & Data Mining

Dec 12, 2020 · Fundamentals

Master Python’s urllib: From Basics to Advanced Web Scraping

Learn how to use Python’s built-in urllib library for web requests, handling GET/POST, adding headers, managing proxies, processing cookies, handling errors, parsing URLs, and respecting robots.txt, with clear code examples and a practical case of scraping a novel site.

HTTP requestsPythoncookies

0 likes · 12 min read

Master Python’s urllib: From Basics to Advanced Web Scraping

MaGe Linux Operations

Oct 11, 2020 · Backend Development

Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio

This tutorial walks you through using various Python modules—including requests, wget, urllib, urllib3, boto3, and asyncio—to download regular files, web pages, Amazon S3 objects, handle redirects, large files, parallel downloads, proxies, and progress bars, all with clear code examples.

Boto3File Downloadasyncio

0 likes · 8 min read

Master Python File Downloads: Requests, Wget, urllib, Boto3 & Asyncio

Python Crawling & Data Mining

Sep 10, 2020 · Backend Development

How to Query Chinese Courier Tracking Info with Python and Kuaidi100 API

This tutorial shows how to use Python's urllib and json libraries to call the Kuaidi100 API, retrieve real‑time logistics data for various Chinese courier companies, and display the tracking timeline, while explaining how to discover the correct request URL via browser dev tools.

APIPythonWeb Scraping

0 likes · 6 min read

How to Query Chinese Courier Tracking Info with Python and Kuaidi100 API

21CTO

Apr 21, 2020 · Fundamentals

Master Python File Downloads: Requests, Wget, urllib, Async & More

This tutorial walks through multiple Python approaches for downloading files—including simple requests and wget calls, handling redirects, large and multi‑file downloads, proxy usage, urllib/urllib3 methods, and asynchronous techniques—providing complete code snippets and practical tips for each scenario.

Pythonasynciofile-download

0 likes · 11 min read

Master Python File Downloads: Requests, Wget, urllib, Async & More

Python Programming Learning Circle

Jan 7, 2020 · Backend Development

Master Python Web Scraping: From urllib to Scrapy with Real-World Examples

This comprehensive guide walks you through Python web crawling fundamentals, covering request handling, URL encoding, regular expressions, the requests library, XPath parsing, and lxml, complete with code snippets and practical examples to help you build effective scrapers.

XPathlxmlregex

0 likes · 13 min read

Master Python Web Scraping: From urllib to Scrapy with Real-World Examples

MaGe Linux Operations

Dec 25, 2019 · Backend Development

Master Web Crawling in Python: From urllib to requests and Robots.txt

This guide explains the fundamentals of web crawling, covering crawler types, the Robots.txt protocol, Python's urllib and urllib3 modules, the requests library, handling HTTP methods, user‑agents, HTTPS certificates, and practical code examples for extracting data from websites.

Pythonrequestsrobots.txt

0 likes · 18 min read

Master Web Crawling in Python: From urllib to requests and Robots.txt

MaGe Linux Operations

Aug 10, 2019 · Backend Development

Master Web Scraping in Python: From Basics to Bypassing Anti‑Scraping

Learn how to start web scraping with Python by mastering the three core steps—fetching, analyzing, and storing data—using urllib and requests, handling login, evading anti‑scraping measures like user‑agents and IP proxies, and saving results to JSON, CSV, or MongoDB.

PythonSeleniumanti-scraping

0 likes · 9 min read

Master Web Scraping in Python: From Basics to Bypassing Anti‑Scraping

MaGe Linux Operations

Dec 1, 2018 · Backend Development

How to Scrape Baidu Tieba Images with Python and Save Locally

Learn step-by-step how to use Python's urllib and regular expressions to fetch Baidu Tieba web pages, extract image URLs, and download the images to your local machine, including code examples for getHtml and getImg functions and file naming conventions.

regular expressionsurllib

0 likes · 3 min read

How to Scrape Baidu Tieba Images with Python and Save Locally

MaGe Linux Operations

Sep 19, 2018 · Backend Development

How to Crawl Baidu Tieba Images with Python and Save Locally

This tutorial explains how to use Python's urllib module to fetch a Baidu Tieba page, apply regular expressions to extract image URLs, and then download and rename those images to your local directory.

Pythonimage-downloadregex

0 likes · 3 min read

How to Crawl Baidu Tieba Images with Python and Save Locally

Python Crawling & Data Mining

Jan 17, 2018 · Backend Development

How to Scrape JD.com Product Data with Python Regex: A Step‑by‑Step Guide

This tutorial shows how to build a keyword‑driven web crawler for JD.com using Python's urllib for URL encoding and opening, combined with powerful regular expressions to accurately extract product information such as dog food listings, and explains how to extend the scraper for multi‑page data collection.

JD.comPythonregex

0 likes · 5 min read

AI Large-Model Wave and Transformation Guide

Nov 23, 2017 · Backend Development

How to Build a Simple Python Spider to Download Images from Baidu Tieba

This tutorial walks through using Python's urllib and regular expressions to crawl a Baidu Tieba page, extract all .jpg image URLs, and download each image locally with a sequential naming scheme.

PythonWeb Scrapingimage-downloader

0 likes · 6 min read

How to Build a Simple Python Spider to Download Images from Baidu Tieba

MaGe Linux Operations

Apr 26, 2017 · Backend Development

How to Crawl Baidu Tieba Images with Python and Save Them Locally

Learn to use Python's urllib and regular expressions to fetch Baidu Tieba web pages, extract image URLs, and download the images to your local machine, with step-by-step code examples and explanations for beginners tackling web scraping challenges.

Image DownloadPythonregex

0 likes · 4 min read

How to Crawl Baidu Tieba Images with Python and Save Them Locally

MaGe Linux Operations

Mar 22, 2017 · Backend Development

Mastering Python HTTP Error Handling: URLError and HTTPError Explained

The article explains how Python's urllib2 raises URLError and HTTPError, details their causes and attributes, and presents two practical approaches for catching and processing these HTTP exceptions to build more robust networked applications.

exception-handlinghttp-errorurllib

0 likes · 6 min read

Mastering Python HTTP Error Handling: URLError and HTTPError Explained

360 Quality & Efficiency

Feb 22, 2017 · Backend Development

Using Python urllib and urllib2 for Data Transfer, Headers, Cookies, and Proxy Configuration

This article explains how Python's urllib and urllib2 modules differ and demonstrates practical techniques for sending HTTP requests, encoding query strings, setting custom headers, handling JSON payloads, managing cookies, and configuring proxies using Request objects and build_opener handlers.

HTTPHeadersPython

0 likes · 4 min read

Using Python urllib and urllib2 for Data Transfer, Headers, Cookies, and Proxy Configuration