Tagged articles
16 articles
Page 1 of 1
Geek Labs
Geek Labs
Apr 2, 2026 · Industry Insights

5 Must‑Try Open‑Source Tools: AI Writing, CLI Scraper, Visual OpenClaw, and More

The article showcases five practical open‑source projects—including a flip‑board TV simulator, a high‑speed Rust CLI scraper, a Brazilian public‑data MCP server, a visual OpenClaw client, and an AI‑powered WeChat writing workflow—detailing their features, benchmarks, installation steps, and ideal use cases.

AICLIdata-scraping
0 likes · 12 min read
5 Must‑Try Open‑Source Tools: AI Writing, CLI Scraper, Visual OpenClaw, and More
Old Meng AI Explorer
Old Meng AI Explorer
Jan 27, 2026 · Artificial Intelligence

Three Must‑Try Open‑Source AI Tools for Data Mining, PPT Creation, and Video Generation

In the era of abundant AI utilities, this article highlights three recently popular open‑source projects—Spider_XHS for comprehensive Xiaohongshu data collection and automated posting, PPTAgent for one‑click, multi‑scene PowerPoint generation, and Code2Video for code‑driven, high‑quality video creation—detailing their core features, deployment steps, and GitHub links.

AI toolsPPT automationVideo Generation
0 likes · 7 min read
Three Must‑Try Open‑Source AI Tools for Data Mining, PPT Creation, and Video Generation
Java Architect Essentials
Java Architect Essentials
Dec 24, 2024 · Information Security

Beijing Chaoyang Court Rules Unfair Competition in Navigation Data Scraping, Awards 12.5 Million Yuan Compensation

The Chaoyang District People's Court in Beijing found that a technology company illegally scraped the "congestion delay index" from a navigation map, used it for commercial purposes, and ordered it to stop the infringement and pay a total of 12.5 million yuan in damages, highlighting the legal protection of competitive data rights.

Chinadata-scrapinginformation security
0 likes · 5 min read
Beijing Chaoyang Court Rules Unfair Competition in Navigation Data Scraping, Awards 12.5 Million Yuan Compensation
Test Development Learning Exchange
Test Development Learning Exchange
Jan 16, 2024 · Artificial Intelligence

Python Code Samples for Data Scraping and Analysis Across Various Business Scenarios

This article presents a collection of Python code examples demonstrating how to scrape, process, visualize, and analyze data from news sites, social media, stock markets, e‑commerce, web traffic, text, images, and more, covering tasks such as clustering, time‑series forecasting, and sentiment analysis.

PythonWeb Scrapingdata analysis
0 likes · 12 min read
Python Code Samples for Data Scraping and Analysis Across Various Business Scenarios
Python Crawling & Data Mining
Python Crawling & Data Mining
Aug 24, 2022 · Backend Development

Why Your Python Web Crawler Returns Wrong Data and How to Fix It

This article examines a Python web‑crawler issue where the original script returns incorrect results, explains the underlying cause related to mutable data handling, and provides a corrected version of the code that successfully retrieves the desired price data from the Xinfadi website.

data-scrapingmutable objectsrequests
0 likes · 6 min read
Why Your Python Web Crawler Returns Wrong Data and How to Fix It
Architecture Digest
Architecture Digest
Feb 19, 2022 · Information Security

Case Study: Illegal Web Crawling and Criminal Conviction in China

This article recounts how a corporate web‑crawling tool designed to automate housing‑loan data collection overloaded a municipal residence‑permit system, triggered a large‑scale denial‑of‑service attack, and led to the CTO and programmer being prosecuted for damaging a computer information system.

Web Crawlingcomputer crimecyberlaw
0 likes · 8 min read
Case Study: Illegal Web Crawling and Criminal Conviction in China
Java High-Performance Architecture
Java High-Performance Architecture
Feb 18, 2022 · Information Security

When Web Crawlers Cross the Line: A Legal Case Study on Unauthorized Data Scraping

This article recounts how a Chinese fintech company's automated web‑crawler, built to query a municipal residence‑permit system, overloaded the server, triggered police action, led to criminal charges for the CTO and programmer, and offers lessons on the legal risks of large‑scale data scraping.

Web Crawlingcloud computingcomputer crime
0 likes · 9 min read
When Web Crawlers Cross the Line: A Legal Case Study on Unauthorized Data Scraping
21CTO
21CTO
Feb 5, 2022 · Information Security

When Web Crawlers Turn Criminal: A Real‑World Data Scraping Case Study

This article recounts how a fintech company's automated web‑scraping tool overloaded a municipal residence‑permit system, leading to massive data leakage, legal prosecution of its CTO and programmer, and highlights the severe legal risks of unchecked crawling practices.

Web Crawlingcomputer crimedata-scraping
0 likes · 9 min read
When Web Crawlers Turn Criminal: A Real‑World Data Scraping Case Study
ITPUB
ITPUB
Jun 17, 2021 · Information Security

How Illegal Web Crawlers Stole Over 1 Billion Chinese Users’ Data and Got Sent to Prison

A recent Chinese court case reveals that a university graduate used a custom web‑crawler to harvest more than 1.18 billion Taobao user records, which were then sold to a partner who ran fraudulent WeChat groups, leading to both perpetrators’ conviction for violating personal information protection laws.

ChinaWeb Crawlerdata-scraping
0 likes · 10 min read
How Illegal Web Crawlers Stole Over 1 Billion Chinese Users’ Data and Got Sent to Prison
21CTO
21CTO
Nov 16, 2019 · Fundamentals

From Early Crawlers to ByteDance: A History of Web Scraping

This article traces the evolution of web crawlers—from early Perl scripts to modern ByteDance agents—explaining their role in search engines, business models, anti‑crawling measures, and the impact on content creation and competition.

Web Crawlingcontent aggregationdata-scraping
0 likes · 6 min read
From Early Crawlers to ByteDance: A History of Web Scraping
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 3, 2018 · Backend Development

How to Build a DingTalk Bot for Meican Meal Ordering and Data Analysis

This article walks through the process of scraping Meican's ordering data via its APIs, analyzing restaurant popularity, and creating a Node.js‑based DingTalk robot that automatically notifies users when it’s time to place their lunch orders, complete with code snippets and visual insights.

DingTalk botMeican APINode.js
0 likes · 10 min read
How to Build a DingTalk Bot for Meican Meal Ordering and Data Analysis
Baidu Intelligent Testing
Baidu Intelligent Testing
Jul 25, 2017 · Mobile Development

Using UIAutomator for Mobile App Data Scraping and Quality Evaluation in K12 Education Apps

This article describes how to employ UIAutomator to automate data extraction from K12 education mobile apps, handling device identity spoofing, image input normalization, and UI control reverse‑engineering to overcome encryption, token checks, and non‑standard input challenges.

Image NormalizationK12 EducationUI Reverse Engineering
0 likes · 5 min read
Using UIAutomator for Mobile App Data Scraping and Quality Evaluation in K12 Education Apps
Ctrip Technology
Ctrip Technology
May 22, 2017 · Information Security

The Dark Side of Web Crawling and Anti‑Crawling: Industry Realities and Technical Strategies

This article examines the hidden, often unglamorous world of web crawling and anti‑crawling, revealing why companies deploy aggressive scraping and defensive measures, the technical arms race between crawlers and defenders, the impact on engineers' careers, and future trends in this contested space.

Web Crawlinganti‑crawlingdata-scraping
0 likes · 21 min read
The Dark Side of Web Crawling and Anti‑Crawling: Industry Realities and Technical Strategies