Machine Learning-Based Anomaly Detection for Core Business Metrics
The paper proposes a containerized, machine‑learning framework that fuses rule‑based and XGBoost‑driven anomaly detection to monitor daily active users on a cloud music platform, achieving 89 % recall, 81 % precision and up to 74 % recall improvement over traditional threshold methods, while outlining future model refinement and broader metric applicability.
This paper presents a machine learning-based solution for detecting anomalies in core business metrics, specifically focusing on daily active users (DAU) for a cloud music platform. The project addresses limitations in traditional threshold-based monitoring systems that struggle with dynamic business changes and varying seasonal patterns.
The research evaluates multiple anomaly detection algorithms including 3-sigma, Holt-Winters, and XGBoost, comparing their performance using metrics like recall, precision, accuracy, and F1-score. Results show XGBoost with multiple features achieves superior performance with 89% recall and 81% precision, significantly outperforming traditional methods.
The solution integrates machine learning models with existing data quality monitoring systems through a containerized, configurable architecture. The approach combines rule-based detection with model predictions using a fusion strategy to improve robustness. The system demonstrates 74% improvement in recall, 40% in precision, and 20% in accuracy compared to baseline methods.
Future work includes model iteration for reducing false positives, expanding applicability to more business scenarios, and developing a staged approach for handling new metrics with limited historical data. The project emphasizes the evolution from digital to intelligent data systems and the importance of open-source contributions to data intelligence.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
