Baobao Algorithm Notes
Jan 10, 2022 · Fundamentals
Why a Simple Frequency Count Can Outperform Complex Models for Categorical Features
This article explains how a five‑line pandas script can generate frequency‑based encodings for categorical columns, why such encodings help low‑frequency categories, and how they integrate with feature crossing and industry use cases like ad exposure, PV/UV counting, and fraud detection.
GBDTcategorical encodingdata preprocessing
0 likes · 6 min read
