Fundamentals 12 min read

User Profile Tagging: Construction, Feature Processing, and Evaluation

This article systematically explains the fundamentals of user profile tags, covering basic attribute tags, business and strategy-oriented tags, detailed feature processing methods such as anomaly handling, time decay, and smoothing, and provides evaluation metrics for cohesion and stability, supplemented by a practical Q&A session.

DataFunSummit
DataFunSummit
DataFunSummit
User Profile Tagging: Construction, Feature Processing, and Evaluation

The presentation begins with an overview of user profile tags, emphasizing the growing need for deep user understanding in digital transformation. It introduces three main sections: (1) basic attribute tags—attributes like gender, age, OS, and city—constructed via user input, event tracking, model prediction, or third‑party data, and applied in daily analysis and modeling.

(2) Business‑oriented tags are closely linked to KPI goals, categorized into strong and weak KPI associations. Construction methods include KPI‑based distance definitions and composite behavior calculations, while usage focuses on monitoring KPI progress, drilling down user groups, and implementing differentiated operational strategies.

(3) Strategy‑oriented tags target specific interventions (e.g., coupons) and are evaluated using uplift models, repurchase cycle predictions, or binary classifiers to maximize ROI.

The article then details feature processing techniques essential for reliable tags: anomaly detection and filling (using box plots, AVF, cap/floor percentiles), handling missing values based on metric nature, time‑decay weighting aligned with RFM dimensions (Recency, Frequency, Monetary), and logarithmic smoothing to mitigate long‑tail effects.

Evaluation of tag quality is discussed through two primary criteria: cohesion (measured by Silhouette Coefficient) and stability (measured by Coefficient of Variation). High cohesion indicates similar users within a segment, while low variation across time reflects stable segment behavior.

A Q&A segment addresses practical concerns such as calculating cohesion for different activity levels, defining activity thresholds, computational complexity of time decay, perspectives for basic business strategies, and scenarios where tag evaluation is applied.

The session concludes with acknowledgments to the speaker, Wu Zihua from ByteDance Data Science, and the organizing team.

Feature Engineeringdata analysisuser profilingRFM modelKPI alignmenttag evaluation
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.