UC Tech Team
Nov 5, 2018 · Artificial Intelligence
News Page Identification Using Machine Learning: Feature Engineering, Model Selection, and Evaluation
To accurately distinguish news pages from other web page types, this study formulates the task as a binary classification problem, extracts 19 engineered features from HTML, evaluates logistic regression and SVM models with cross‑validation, and achieves over 90% precision, recall, and F1‑score using LR with Newton method.
Feature EngineeringSVMbinary classification
0 likes · 13 min read