Tagged articles
9 articles
Page 1 of 1
FunTester
FunTester
Nov 18, 2022 · Backend Development

Master Java Web Crawling: From Data Scraping to Image Storage

This guide walks beginners through building a Java web crawler that fetches bestseller book cover images, covering data scraping, HTML parsing with jsoup or regex, and saving images locally, illustrated step‑by‑step with code examples and a tiered learning roadmap.

Image DownloadWeb Crawlingbackend-development
0 likes · 5 min read
Master Java Web Crawling: From Data Scraping to Image Storage
Programmer DD
Programmer DD
Dec 28, 2021 · Backend Development

Master Web Scraping with Java: Getting Started with Jsoup

This article introduces Jsoup, an open‑source Java library for extracting and manipulating HTML, explains its key features such as DOM traversal and CSS selectors, and provides a concise code example that fetches Wikipedia headlines, helping developers automate web data collection.

BackendData ExtractionWeb Scraping
0 likes · 3 min read
Master Web Scraping with Java: Getting Started with Jsoup
Java Architecture Diary
Java Architecture Diary
Dec 2, 2020 · Information Security

How to Prevent XSS Attacks with mica-xss: A Step-by-Step Guide

This article explains what XSS attacks are, demonstrates simple exploitation scenarios, and provides a comprehensive solution using the mica-xss library with Spring MVC, including dependency setup, request filtering, testing methods, and the underlying Jsoup‑based implementation.

Web SecurityXSSinput validation
0 likes · 7 min read
How to Prevent XSS Attacks with mica-xss: A Step-by-Step Guide
Programmer DD
Programmer DD
Dec 7, 2019 · Backend Development

Why Choose Java Over Python for Web Crawling? A Practical Guide

The article shares the author's journey from manual data collection to mastering Java web crawlers, explains why Java is preferred over Python, outlines the five-step crawling workflow, covers essential Java basics, HTTP fundamentals, and provides code examples for URL queuing, time parsing, and timestamp conversion.

Data ExtractionHTTPWeb Crawling
0 likes · 12 min read
Why Choose Java Over Python for Web Crawling? A Practical Guide
Java Captain
Java Captain
Aug 4, 2018 · Backend Development

Java Web Crawler Framework with JD Book Data Extraction Example

This article introduces a modular Java web crawler framework built with Maven, explains its package structure (db, model, util, parse, main), details the data flow from URL fetching to HTML parsing using HttpClient and Jsoup, and provides a complete example that extracts book information from JD.com and stores it in MySQL.

BackendHttpClientWebCrawler
0 likes · 12 min read
Java Web Crawler Framework with JD Book Data Extraction Example