Tagged articles
47 articles
Page 1 of 1
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 24, 2026 · Artificial Intelligence

AI‑Powered Smart Shrimp Farming: 30‑Day Conversational Practice

This article details a 30‑day AI‑driven shrimp‑farming project built on Alibaba Cloud's Bailei platform, describing data sources, system architecture, model development, daily performance metrics, cost savings, and validation results that demonstrate how AI can replace expert judgment in aquaculture.

AIDockerOpenClaw
0 likes · 16 min read
AI‑Powered Smart Shrimp Farming: 30‑Day Conversational Practice
Woodpecker Software Testing
Woodpecker Software Testing
Mar 18, 2026 · Operations

How Test Experts Can Turn Prediction Analytics into Real‑World Impact

The article explains how test prediction analytics can replace intuition with data‑driven risk signals, detailing high‑ROI use cases, data governance practices, model selection (favoring XGBoost), and a three‑layer deployment architecture that integrates predictions into CI/CD workflows, backed by concrete results from finance and e‑commerce projects.

Data‑Driven TestingKubernetesXGBoost
0 likes · 8 min read
How Test Experts Can Turn Prediction Analytics into Real‑World Impact
Old Zhang's AI Learning
Old Zhang's AI Learning
Mar 5, 2026 · Artificial Intelligence

Timber: The “Ollama” for Traditional Machine Learning Models

Timber is a multi‑pass compiler that transforms classic ML models such as XGBoost and LightGBM into zero‑dependency C99 binaries, offering microsecond‑level inference latency, HTTP‑compatible serving, and substantial performance gains over Python runtimes, making it ideal for high‑throughput, low‑latency production scenarios.

LightGBMML compilerModel Deployment
0 likes · 8 min read
Timber: The “Ollama” for Traditional Machine Learning Models
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Nov 25, 2025 · Artificial Intelligence

FinSentLLM: A Multi‑LLM Framework for Financial Sentiment Prediction

FinSentLLM integrates multiple LLM experts with structured financial semantic signals, achieving 3‑6% higher accuracy and F1 on the Financial PhraseBank compared to baselines, while DCC‑GARCH and Johansen cointegration analyses confirm a statistically significant long‑term co‑movement between the predicted sentiment signals and stock market dynamics.

DCC-GARCHFinSentLLMFinancial Sentiment Analysis
0 likes · 12 min read
FinSentLLM: A Multi‑LLM Framework for Financial Sentiment Prediction
Data Party THU
Data Party THU
Aug 18, 2025 · Artificial Intelligence

Unlock XGBoost Performance: Master the Core Parameters

This article provides a detailed, visual guide to XGBoost's most important hyper‑parameters—such as max_depth, min_child_weight, learning_rate, gamma, subsample, colsample_bytree, scale_pos_weight, alpha, and lambda—explaining how each influences tree complexity, regularization, and model generalization, and offering practical examples for effective tuning.

Model OptimizationRegularizationXGBoost
0 likes · 12 min read
Unlock XGBoost Performance: Master the Core Parameters
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 7, 2025 · Artificial Intelligence

How AI and Machine Learning Transform Investment Budget Forecasting

Based on a public‑cloud client’s real‑world project, this article details how combining AI large‑model prompting with machine‑learning techniques—first pure large‑model forecasts, then local weighted linear regression, and finally XGBoost—enables automated, accurate investment budget prediction and allocation, reducing analyst workload and scaling to millions of daily calls.

AIInvestment ForecastingLocally Weighted Regression
0 likes · 14 min read
How AI and Machine Learning Transform Investment Budget Forecasting
Bilibili Tech
Bilibili Tech
Feb 18, 2025 · Artificial Intelligence

Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis

The article details how Bilibili leverages AI algorithms—including XGBoost, statistical rules, XDeepFM, and fine‑tuned SBERT—to optimize VOD transcoding decisions, estimate compute resources and processing time, and analyze live comments, thereby boosting streaming efficiency, utilization, and user experience.

AITranscoding OptimizationXGBoost
0 likes · 19 min read
Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis
Sohu Tech Products
Sohu Tech Products
Jun 5, 2024 · Artificial Intelligence

How Treelite Supercharges Tree Model Inference by Up to 6×

This article introduces Treelite, an open‑source library that compiles XGBoost, LightGBM, and scikit‑learn tree models into optimized shared libraries, explains its branch‑prediction and comparison‑simplification techniques, and provides step‑by‑step Python examples showing significant inference speed gains across different batch sizes.

LightGBMModel DeploymentPython
0 likes · 6 min read
How Treelite Supercharges Tree Model Inference by Up to 6×
Python Programming Learning Circle
Python Programming Learning Circle
Apr 26, 2024 · Artificial Intelligence

Five Essential Python Libraries for Machine Learning Engineers

This article introduces five essential Python libraries—MLflow, Streamlit, FastAPI, XGBoost, and ELI5—that every junior or intermediate machine‑learning engineer and data scientist should master to streamline experiment tracking, build interactive web apps, deploy models efficiently, achieve fast accurate predictions, and improve model interpretability.

ELI5FastAPIPython
0 likes · 8 min read
Five Essential Python Libraries for Machine Learning Engineers
Didi Tech
Didi Tech
Jan 25, 2024 · Artificial Intelligence

Ray-native XGBoost Training Platform: Architecture, Performance, and Technical Challenges

Didi’s new Ray‑native XGBoost training platform replaces the fault‑prone Spark solution with a fully Pythonic, fault‑tolerant architecture that leverages Ray’s autoscaling and gang‑scheduling, delivering 2–6× speedups, reduced failure rates, efficient sparse‑vector handling, scalable hyper‑parameter search, and improved resource utilization for large‑scale machine‑learning workloads.

MLOpsRayXGBoost
0 likes · 20 min read
Ray-native XGBoost Training Platform: Architecture, Performance, and Technical Challenges
360 Quality & Efficiency
360 Quality & Efficiency
Aug 4, 2023 · Artificial Intelligence

Machine Learning Model Testing Workflow and Best Practices

This article outlines the essential concepts, data preparation, model creation, training, deployment, and verification steps for testing machine‑learning models, highlighting dataset requirements, algorithm categories, framework choices, resource considerations, and provides a sample inference request.

AIModel DeploymentXGBoost
0 likes · 7 min read
Machine Learning Model Testing Workflow and Best Practices
Meituan Technology Team
Meituan Technology Team
Apr 6, 2023 · Databases

AI-Driven Index Recommendation for Slow Queries at Meituan

This article details a joint research effort between Meituan and East China Normal University that combines cost‑based methods with AI‑driven, data‑centric models to automatically generate and evaluate missing indexes for billions of daily slow queries, improving recommendation accuracy and query performance.

AICost ModelIndex Recommendation
0 likes · 16 min read
AI-Driven Index Recommendation for Slow Queries at Meituan
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Nov 18, 2022 · Artificial Intelligence

Machine Learning-Based Anomaly Detection for Core Business Metrics

The paper proposes a containerized, machine‑learning framework that fuses rule‑based and XGBoost‑driven anomaly detection to monitor daily active users on a cloud music platform, achieving 89 % recall, 81 % precision and up to 74 % recall improvement over traditional threshold methods, while outlining future model refinement and broader metric applicability.

3-sigmaData IntelligenceHolt-Winters
0 likes · 11 min read
Machine Learning-Based Anomaly Detection for Core Business Metrics
Model Perspective
Model Perspective
Sep 27, 2022 · Artificial Intelligence

Master XGBoost: Boosting Trees Explained with Python Code

This article explains the core concepts of XGBoost as a boosting tree algorithm, describes how it builds ensembles of decision trees to predict outcomes, and provides complete Python implementations for classification and regression using the Scikit-learn interface, along with visualizations of trees and feature importance.

PythonXGBoostboosting
0 likes · 4 min read
Master XGBoost: Boosting Trees Explained with Python Code
DataFunSummit
DataFunSummit
Sep 14, 2022 · Artificial Intelligence

Vertical Federated XGBoost (XGB) Algorithm: Problem Definition, Secure Training, Optimization, and Prediction

This article presents a comprehensive overview of the vertical federated XGB algorithm, covering its problem definition, secure multi‑party training techniques, performance‑optimizing oblivious permutation methods, and prediction workflow, while evaluating its scalability and efficiency under various network conditions.

XGBoostprivacy-preserving
0 likes · 12 min read
Vertical Federated XGBoost (XGB) Algorithm: Problem Definition, Secure Training, Optimization, and Prediction
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
May 5, 2022 · Artificial Intelligence

Time Series Forecasting Algorithm System in E-commerce: Practice and Applications at NetEase Yanxuan

NetEase Yanxuan built an end‑to‑end time‑series forecasting system for e‑commerce that integrates rich user, product, business and external features with a suite of statistical, machine‑learning and deep‑learning models, delivers predictions via a Tornado‑based service for thousands of SKUs, warehouses, advertising and app traffic, and shows that simpler models like XGBoost often outperform complex deep nets while interpretability and external shocks remain key challenges.

Data ScienceSales PredictionXGBoost
0 likes · 10 min read
Time Series Forecasting Algorithm System in E-commerce: Practice and Applications at NetEase Yanxuan
Code DAO
Code DAO
Dec 17, 2021 · Artificial Intelligence

How to Accelerate XGBoost Training with Tree Methods, Cloud Computing, and Ray

The article explains why XGBoost training can be slow despite its speed focus and presents three acceleration techniques—choosing an optimal tree_method, leveraging cloud resources for larger memory, and using Ray for distributed training—complete with code examples and benchmark results.

Distributed TrainingRayXGBoost
0 likes · 5 min read
How to Accelerate XGBoost Training with Tree Methods, Cloud Computing, and Ray
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
Sep 15, 2021 · Artificial Intelligence

Can Predictive Models Uncover Causal Effects? A Truck Risk Case Study

Using a road freight accident prediction example, the article warns that interpreting predictive model explanations as causal effects can be misleading, explains when such models may answer causal questions, demonstrates SHAP analysis on an XGBoost model, and recommends causal inference tools like ecoml for reliable effect estimation.

Risk PredictionSHAPXGBoost
0 likes · 10 min read
Can Predictive Models Uncover Causal Effects? A Truck Risk Case Study
Sohu Tech Products
Sohu Tech Products
Jul 21, 2021 · Artificial Intelligence

Kaggle Jane Street Market Prediction Competition Summary and Model Insights

This article summarizes the author's participation in the Kaggle Jane Street Market Prediction competition, detailing the anonymous feature dataset, utility‑score metric, data preprocessing, the combined AE‑MLP and XGBoost modeling approach, threshold tuning, experimental findings, and references for further study.

AutoencoderKaggleMLP
0 likes · 8 min read
Kaggle Jane Street Market Prediction Competition Summary and Model Insights
58 Tech
58 Tech
Dec 25, 2020 · Artificial Intelligence

User Identity Recognition on Internet Platforms: Solving Cold‑Start with Keyword Matching, XGBoost, TextCNN, and an Improved Wide & Deep Model

This article presents a comprehensive study on C‑side user identity recognition for internet platforms, addressing cold‑start and sample‑scarcity challenges by comparing keyword matching, XGBoost, TextCNN, a fusion model, and an improved Wide & Deep architecture, showing that the latter achieves the highest F1 score of 80.67%.

Model EvaluationTextCNNWide&Deep
0 likes · 13 min read
User Identity Recognition on Internet Platforms: Solving Cold‑Start with Keyword Matching, XGBoost, TextCNN, and an Improved Wide & Deep Model
360 Quality & Efficiency
360 Quality & Efficiency
Dec 20, 2019 · Artificial Intelligence

Automated APK Test Script Recommendation: Data Processing and Model Training Pipeline

This article describes a complete pipeline for recommending automated test scripts for APK releases, covering CSV data preprocessing, feature encoding, tokenization with pkuseg and jieba, and training various machine‑learning models such as LDA, word2vec, XGBoost, deep neural networks, and multi‑label classifiers to predict script execution order.

APK testingDeep LearningModel Training
0 likes · 14 min read
Automated APK Test Script Recommendation: Data Processing and Model Training Pipeline
58 Tech
58 Tech
Nov 11, 2019 · Artificial Intelligence

Design and Implementation of the 58 Car Price Estimation System Using Machine Learning

The article describes the end‑to‑end architecture, data collection, preprocessing, feature engineering, model selection, training, and hyper‑parameter tuning of 58’s car price estimation platform, which leverages Spark, XGBoost, LightGBM and custom business rules to predict vehicle resale values.

LightGBMXGBoostcar price estimation
0 likes · 11 min read
Design and Implementation of the 58 Car Price Estimation System Using Machine Learning
Mafengwo Technology
Mafengwo Technology
Nov 7, 2019 · Artificial Intelligence

Inside MaFengWo’s Scalable Ranking Platform: Architecture, Verification & Explainability

This article explains how MaFengWo’s recommendation system combines recall, ranking, and rerank stages, details the evolution of its sorting algorithm platform, and shows how data verification and model‑explainability techniques like SHAP and LIME improve online performance and accelerate model iteration.

Data verificationModel ExplainabilityXGBoost
0 likes · 13 min read
Inside MaFengWo’s Scalable Ranking Platform: Architecture, Verification & Explainability
Didi Tech
Didi Tech
Oct 8, 2019 · Artificial Intelligence

Didi and Ant Financial Co‑Develop SQLFlow to Bring AI Capabilities to Data Analysts

Partnering with Ant Financial, Didi enhanced the open-source SQLFlow platform—translating SQL into end-to-end AI workflows with added deep-learning, XGBoost, clustering and SHAP explanation capabilities and Hive support—to create a “SQL garden” marketplace where analysts can deploy ready-made AI models via simple SQL, speeding enterprise AI adoption.

AIData ScienceSHAP
0 likes · 9 min read
Didi and Ant Financial Co‑Develop SQLFlow to Bring AI Capabilities to Data Analysts
AntTech
AntTech
Sep 27, 2019 · Artificial Intelligence

Didi and Ant Financial Co‑Develop SQLFlow to Bring AI Capabilities to Data Analysts

The article describes how Didi's data science team partnered with Ant Financial to co‑build the open‑source SQLFlow platform, enabling analysts to launch AI models via simple SQL, detailing the models contributed, technical extensions, and the broader vision for a universal AI ecosystem.

AIData ScienceSHAP
0 likes · 8 min read
Didi and Ant Financial Co‑Develop SQLFlow to Bring AI Capabilities to Data Analysts
Meituan Technology Team
Meituan Technology Team
Aug 15, 2019 · Big Data

Inconsistent Predictions in XGBoost on Spark Due to Different Missing Value Handling

The discrepancy between XGBoost’s Java engine and Spark arose because XGBoost4j treats zero as the default missing value while Spark’s sparse vectors use NaN, causing inconsistent predictions, and was resolved by explicitly setting Float.NaN as the missing value or converting sparse vectors to dense so both engines handle zeros uniformly.

SparkSparseVectorXGBoost
0 likes · 13 min read
Inconsistent Predictions in XGBoost on Spark Due to Different Missing Value Handling
DataFunTalk
DataFunTalk
Jun 6, 2019 · Artificial Intelligence

Design and Machine Learning Practices for Automotive Finance Risk Control

This article outlines the end‑to‑end design of automotive finance risk‑control processes, discusses key data integrity and customer segmentation considerations, and details machine‑learning modeling practices—including logistic regression, decision trees, GBDT, XGBoost, LightGBM and CatBoost—along with an automated platform to streamline model development and deployment.

Credit ScoringGBDTXGBoost
0 likes · 17 min read
Design and Machine Learning Practices for Automotive Finance Risk Control
37 Interactive Technology Team
37 Interactive Technology Team
Apr 28, 2019 · Artificial Intelligence

Application of Machine Learning Algorithms in Mobile Game Recharge Monitoring

By applying XGBoost‑based regression models that are retrained daily on two‑week order data and tuned per sub‑package, the mobile‑game recharge monitoring system predicts 10‑minute order volumes, sharply cuts false alarms from hundreds to dozens, and delivers precise, scalable alerts for game operations.

Mobile GamingModel EvaluationXGBoost
0 likes · 8 min read
Application of Machine Learning Algorithms in Mobile Game Recharge Monitoring
Tencent Advertising Technology
Tencent Advertising Technology
Apr 23, 2019 · Artificial Intelligence

Tencent Advertising Algorithm Competition: Experience and Tips from the Runner‑Up

This article shares the experience of Xu An, runner‑up in the 2019 Tencent Advertising Algorithm Competition, detailing practical advice on feature engineering, model selection, efficiency tricks, personal habits, contest rhythm, and learning resources for aspiring participants.

Algorithm ContestLightGBMTencent Advertising Competition
0 likes · 6 min read
Tencent Advertising Algorithm Competition: Experience and Tips from the Runner‑Up
58 Tech
58 Tech
Sep 7, 2018 · Artificial Intelligence

Cupid Push Control System: Machine‑Learning‑Driven Notification Optimization at 58.com

The article details how 58.com’s Cupid push control system leverages machine‑learning models, especially XGBoost‑based CTR prediction, to prioritize and filter billions of daily push notifications, improving click‑through rates, reducing user annoyance, and providing a scalable, data‑driven architecture for diverse business services.

AB testingCTR predictionSystem Architecture
0 likes · 13 min read
Cupid Push Control System: Machine‑Learning‑Driven Notification Optimization at 58.com
Ctrip Technology
Ctrip Technology
Sep 4, 2018 · Artificial Intelligence

Call Center Volume Forecasting and Staffing Optimization at Ctrip: From Data Cleaning to V2.0 Predictive System

This article describes Ctrip's call‑center staffing challenge, detailing data cleaning, trend analysis, feature engineering, the initial ARIMAX‑Fourier model (V1.0), its limitations, and the improved V2.0 solution that combines TBATS, ARIMA residuals and XGBoost, achieving up to 89.5% prediction accuracy.

Time SeriesXGBoostcall center
0 likes · 9 min read
Call Center Volume Forecasting and Staffing Optimization at Ctrip: From Data Cleaning to V2.0 Predictive System
Ctrip Technology
Ctrip Technology
Aug 7, 2018 · Artificial Intelligence

Forecasting and Monitoring in Business Intelligence: Practical Data‑Analysis Methods and Model‑Building Tips

The article explains how a data analyst can use statistical and machine‑learning models such as linear regression, tree‑based boosting, STL decomposition, and Prophet for both non‑time‑series forecasting and time‑series monitoring, highlighting data‑quality concerns, feature‑engineering practices, and deployment considerations like PMML packaging.

BIProphetSTL
0 likes · 13 min read
Forecasting and Monitoring in Business Intelligence: Practical Data‑Analysis Methods and Model‑Building Tips
21CTO
21CTO
Sep 20, 2017 · Big Data

Winning O2O Coupon Redemption with XGBoost, GBDT, and Feature Engineering

This article details a data-driven solution for the 2016 O2O coupon redemption competition, describing dataset partitioning, extensive feature engineering across user, merchant, and coupon dimensions, handling leakage, and model fusion using XGBoost, GBDT, and RandomForest, achieving top AUC scores through weighted ensemble.

GBDTXGBoostcoupon redemption
0 likes · 12 min read
Winning O2O Coupon Redemption with XGBoost, GBDT, and Feature Engineering
Tencent Advertising Technology
Tencent Advertising Technology
Jun 25, 2017 · Artificial Intelligence

Interview with ‘拔萝卜’: Lessons Learned from the Tencent Social Ads Algorithm Competition

In this interview, a solo female participant from Shanghai Jiao Tong University shares her experience, challenges, and technical insights—including feature engineering, memory management, and model tuning with XGBoost and LightGBM—gained while competing in the Tencent Social Ads algorithm contest.

Model tuningTencentXGBoost
0 likes · 5 min read
Interview with ‘拔萝卜’: Lessons Learned from the Tencent Social Ads Algorithm Competition
Tencent Advertising Technology
Tencent Advertising Technology
Jun 17, 2017 · Artificial Intelligence

SkullGreymon Team’s Progress and Technical Insights in the Tencent Social Ads Algorithm Competition

The SkullGreymon team, winners of the biggest improvement award in the Tencent Social Ads university algorithm competition, share their journey from a late start in the preliminaries to significant performance gains in the finals, detailing memory‑saving feature extraction techniques, pandas and numpy usage, and their XGBoost modeling approach.

Memory OptimizationXGBoostalgorithm competition
0 likes · 6 min read
SkullGreymon Team’s Progress and Technical Insights in the Tencent Social Ads Algorithm Competition
Tencent Advertising Technology
Tencent Advertising Technology
May 27, 2017 · Artificial Intelligence

Weekly Champion Interview: Groot Team Shares Competition Strategies

The article introduces the Tencent Social Ads university algorithm contest weekly champion team Groot, details their background, and outlines their practical machine‑learning approach—including training set construction, XGBoost model selection, and feature engineering—while encouraging broader participation in such competitions.

AIAdvertisingXGBoost
0 likes · 4 min read
Weekly Champion Interview: Groot Team Shares Competition Strategies