Python Crawling & Data Mining
Author

Python Crawling & Data Mining

Life's short, I code in Python. This channel shares Python web crawling, data mining, analysis, processing, visualization, automated testing, DevOps, big data, AI, cloud computing, machine learning tools, resources, news, technical articles, tutorial videos and learning materials. Join us!

1.5k
Articles
0
Likes
2.1k
Views
0
Comments
Recent Articles

Latest from Python Crawling & Data Mining

100 recent articles max
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 11, 2026 · Operations

Real Fan Request: Python Automation for Bulk Bill-of-Lading Watermark Replacement (Part 1)

The article outlines a fan's request to automate the replacement of header watermarks in dozens of foreign‑trade bill‑of‑lading .doc files using Python, explains the inefficiency of manual editing, describes converting .doc to .docx and applying python‑docx, and previews a detailed multi‑part walkthrough of the implementation challenges.

DOCXPythonautomation
0 likes · 4 min read
Real Fan Request: Python Automation for Bulk Bill-of-Lading Watermark Replacement (Part 1)
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 10, 2026 · Artificial Intelligence

Automating Validation of 300,000 Records with Python + AI to Detect Errors and Dirty Data

Even with 99 % accuracy, tens of thousands of errors remain in a 300 k‑row dataset, so the author builds a Python‑AI pipeline that preprocesses images, performs high‑precision OCR, merges data, applies custom validation rules, and automatically generates an error report, dramatically reducing manual effort.

AIData ValidationOCR
0 likes · 6 min read
Automating Validation of 300,000 Records with Python + AI to Detect Errors and Dirty Data