Tag

page classification

1 views collected around this technical thread.

UC Tech Team
UC Tech Team
Nov 5, 2018 · Artificial Intelligence

News Page Identification Using Machine Learning: Feature Engineering, Model Selection, and Evaluation

To accurately distinguish news pages from other web page types, this study formulates the task as a binary classification problem, extracts 19 engineered features from HTML, evaluates logistic regression and SVM models with cross‑validation, and achieves over 90% precision, recall, and F1‑score using LR with Newton method.

Feature EngineeringSVMbinary classification
0 likes · 13 min read
News Page Identification Using Machine Learning: Feature Engineering, Model Selection, and Evaluation