Tag

lxml

0 views collected around this technical thread.

Code Mala Tang
Code Mala Tang
Apr 19, 2025 · Fundamentals

Master HTML Parsing in Python: BeautifulSoup, lxml, and html.parser Compared

Learn why HTML parsing is essential for web scraping, explore three popular Python libraries—BeautifulSoup, lxml, and the built‑in html.parser—covering installation, core usage, advanced techniques, and a comparative analysis to help you choose the right tool for your project.

BeautifulSoupHTML parsingPython
0 likes · 11 min read
Master HTML Parsing in Python: BeautifulSoup, lxml, and html.parser Compared
Python Programming Learning Circle
Python Programming Learning Circle
Jun 25, 2024 · Backend Development

Web Scraping Anjuke Real Estate Data with Python: A Step‑by‑Step Guide

This article provides a comprehensive Python tutorial for scraping second‑hand housing community data from Anjuke, covering city selection, URL collection, HTML parsing with lxml, data cleaning, CSV export, and full‑city crawling strategies, complete with runnable code examples.

CSVData ExtractionReal Estate
0 likes · 20 min read
Web Scraping Anjuke Real Estate Data with Python: A Step‑by‑Step Guide
Python Programming Learning Circle
Python Programming Learning Circle
Mar 8, 2022 · Backend Development

XPath Basics and Web Scraping with Python lxml: Concepts, Syntax, and Practical Examples

This tutorial explains the fundamental concepts and parsing principles of XPath, shows how to set up the Python lxml environment, demonstrates instantiating etree objects, details XPath expression syntax, and provides multiple real‑world web‑scraping examples with complete code snippets.

HTML parsingWeb Scrapinglxml
0 likes · 9 min read
XPath Basics and Web Scraping with Python lxml: Concepts, Syntax, and Practical Examples
Python Programming Learning Circle
Python Programming Learning Circle
Dec 14, 2021 · Backend Development

Python Web Scraping of China Weather Forecast (7‑Day) Using Requests, lxml, and CSV Export

This tutorial shows how to crawl the China Meteorological Administration website to obtain today's weather and a six‑day forecast, handle Chinese encoding, extract data with XPath, and save the results into a CSV file using Python's requests and lxml libraries.

CSVPythonRequests
0 likes · 5 min read
Python Web Scraping of China Weather Forecast (7‑Day) Using Requests, lxml, and CSV Export
Python Programming Learning Circle
Python Programming Learning Circle
Oct 7, 2021 · Backend Development

Python Web Scraping: Intelligent Pagination for Batch File Download

The guide explains how to use Python libraries such as requests, pandas, lxml, and regex to automatically paginate through a website, extract PDF file names and URLs, create organized folders, and download all files in bulk with minimal manual effort.

AutomationPythonRequests
0 likes · 5 min read
Python Web Scraping: Intelligent Pagination for Batch File Download
Python Programming Learning Circle
Python Programming Learning Circle
Sep 30, 2021 · Backend Development

Python Web Scraper for VIP Anime Collection

This article demonstrates how to build a Python web scraper using requests, lxml, regular expressions, and tqdm to locate, extract, and download video files from a VIP anime website, covering header configuration, XPath parsing, URL reconstruction, and file saving.

PythonRequestsWeb Scraping
0 likes · 6 min read
Python Web Scraper for VIP Anime Collection
Python Programming Learning Circle
Python Programming Learning Circle
Sep 27, 2021 · Backend Development

Python Web Scraping Tutorial: Downloading Images from 3gbizhi.com

This tutorial demonstrates how to use Python's requests and lxml libraries to scrape static web pages, extract image URLs via XPath, convert thumbnail links to full‑size URLs, and download the images to a local folder, providing a complete end‑to‑end example.

Image DownloadRequestsWeb Scraping
0 likes · 5 min read
Python Web Scraping Tutorial: Downloading Images from 3gbizhi.com
Python Programming Learning Circle
Python Programming Learning Circle
Mar 25, 2021 · Backend Development

Downloading Music from NetEase Cloud Music Using Python Web Scraping

This article explains how to use Python's requests and lxml libraries to scrape NetEase Cloud Music's song list, extract song IDs, construct download URLs, and save the audio files locally, while also discussing common obstacles such as anti‑scraping measures and network errors.

NetEasePythonRequests
0 likes · 6 min read
Downloading Music from NetEase Cloud Music Using Python Web Scraping
Python Programming Learning Circle
Python Programming Learning Circle
Dec 19, 2020 · Fundamentals

XPath Basics and lxml Usage in Python

This article introduces the fundamentals of XPath syntax, common rules, and example expressions, then explains how to use the lxml library in Python for HTML/XML parsing, including practical tips and a complete code example for extracting links and text from a sample document.

PythonWeb ScrapingXML
0 likes · 6 min read
XPath Basics and lxml Usage in Python
Python Programming Learning Circle
Python Programming Learning Circle
Jan 7, 2020 · Backend Development

Master Python Web Scraping: From urllib to Scrapy with Real-World Examples

This comprehensive guide walks you through Python web crawling fundamentals, covering request handling, URL encoding, regular expressions, the requests library, XPath parsing, and lxml, complete with code snippets and practical examples to help you build effective scrapers.

RequestsWeb Scrapinglxml
0 likes · 13 min read
Master Python Web Scraping: From urllib to Scrapy with Real-World Examples