Tagged articles
283 articles
Page 1 of 3
Data Party THU
Data Party THU
May 19, 2026 · Artificial Intelligence

Model Performance Lagging? Master Feature Engineering with a Complete Step‑by‑Step Guide

This article walks through the entire feature‑engineering pipeline—data cleaning, missing‑value imputation, encoding, outlier handling, scaling, feature construction, and selection—using Pandas and Scikit‑learn, and shows how to wrap the steps into a reproducible Scikit‑learn Pipeline.

Pipelinedata preprocessingfeature engineering
0 likes · 9 min read
Model Performance Lagging? Master Feature Engineering with a Complete Step‑by‑Step Guide
Data Party THU
Data Party THU
May 15, 2026 · Artificial Intelligence

2026 Big Data Challenge Announces Monthly Star Winners and Shares Winning Teams’ Insights

The 2026 China University Computer Competition – Big Data Challenge reveals the Monthly Star award winners, each receiving 800 RMB, and presents detailed experience reports from the top teams covering feature engineering, model selection, training validation, and ensemble strategies for stock prediction.

Big DataModel FusionStock Prediction
0 likes · 7 min read
2026 Big Data Challenge Announces Monthly Star Winners and Shares Winning Teams’ Insights
DeepHub IMBA
DeepHub IMBA
May 12, 2026 · Artificial Intelligence

Hands‑On Feature Engineering with Pandas and Scikit‑Learn: Complete Code Walkthrough

This article walks through a full feature‑engineering pipeline using Pandas and Scikit‑Learn, covering data inspection, missing‑value imputation, categorical encoding, outlier handling, scaling, feature construction, selection, and a final Pipeline that prepares clean, predictive features for a logistic‑regression model.

Pipelinedata preprocessingfeature engineering
0 likes · 9 min read
Hands‑On Feature Engineering with Pandas and Scikit‑Learn: Complete Code Walkthrough
PMTalk Product Manager Community
PMTalk Product Manager Community
May 5, 2026 · Product Management

What Kind of Product Manager Drives Algorithm Engineers Crazy?

The article explains why algorithm engineers resent product managers who treat models as black‑boxes, make vague data‑blind demands, and ignore experimental cycles, and it offers three concrete practices—feature‑focused communication, metric quantification, and respecting experiment timelines—to become a trusted teammate.

algorithm collaborationcross‑functional communicationdata‑driven requirements
0 likes · 8 min read
What Kind of Product Manager Drives Algorithm Engineers Crazy?
Data Party THU
Data Party THU
Apr 21, 2026 · Artificial Intelligence

Can LLM Attack Detection Work Without Storing Any Conversation Text?

This article experimentally evaluates a privacy‑preserving LLM security pipeline that discards raw dialogue after extracting 28 telemetry features, showing that using only 11 text‑independent signals retains about 98.5% of detection performance while reducing false‑positive rates.

LLM Securityfeature engineeringjailbreak detection
0 likes · 10 min read
Can LLM Attack Detection Work Without Storing Any Conversation Text?
Huolala Safety Emergency Response Center
Huolala Safety Emergency Response Center
Apr 15, 2026 · Information Security

How to Auto‑Label 10K APIs with 95% Confidence Using Self‑Learning Feature Engineering

This article presents a detailed case study of how a large‑scale API security team built an automated, self‑learning classification system that tags tens of thousands of APIs with business labels, improves model accuracy by five points, and maintains high precision through a confidence‑driven feedback loop.

API SecurityCatBoostSHAP
0 likes · 13 min read
How to Auto‑Label 10K APIs with 95% Confidence Using Self‑Learning Feature Engineering
Huolala Tech
Huolala Tech
Apr 15, 2026 · Information Security

How We Built a Self‑Learning API Classification System for Security

This article details a real‑world case study of how a large logistics platform created an automated, self‑evolving API asset‑classification pipeline—covering data collection, feature engineering, model training with CatBoost, confidence‑based label feedback, and lessons learned—to improve API security monitoring and reduce manual labeling effort.

API SecurityCatBoostSHAP
0 likes · 13 min read
How We Built a Self‑Learning API Classification System for Security
DeepHub IMBA
DeepHub IMBA
Apr 6, 2026 · Artificial Intelligence

Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation

The article explains why good features matter more than fancy algorithms and walks through practical techniques—scaling, log transforms, binning, interaction, various encoding schemes, datetime extraction, text statistics, geospatial distances, aggregation, feature selection, and automated feature generation—illustrated with concrete pandas and scikit‑learn code examples.

automationencodingfeature engineering
0 likes · 16 min read
Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation
PMTalk Product Manager Community
PMTalk Product Manager Community
Mar 2, 2026 · Product Management

The Product Manager Traits That Drive Algorithm Engineers Crazy

Algorithm engineers often resent product managers who treat models as black‑box wish‑lists, ignore data feasibility, and issue vague demands, leading to broken trust, wasted effort, and stalled projects, but adopting feature‑focused communication, concrete metrics, and realistic experiment timelines can turn a problematic partnership into a productive collaboration.

algorithm collaborationexperiment planningfeature engineering
0 likes · 8 min read
The Product Manager Traits That Drive Algorithm Engineers Crazy
Data Party THU
Data Party THU
Jan 25, 2026 · Big Data

How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU

The Tsinghua University Big Data Capability Project applied GRU‑based deep learning, pulse‑event encoding, and advanced feature engineering to transform discrete refinery energy data into continuous sequences, achieving prediction accuracies of 84.2%, 82.7% and 81.6% for fuel gas, medium‑pressure and low‑pressure steam respectively.

GRUenergy predictionfeature engineering
0 likes · 9 min read
How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU
Kuaishou Tech
Kuaishou Tech
Nov 12, 2025 · Artificial Intelligence

How KaiFG Lets Python Feature Engineering Run at C++ Speed

KaiFG, Kuaishou's self‑built AI Feature Generator, unifies fragmented feature extraction frameworks, replaces slow C++ compilation cycles with Python‑level development, and achieves near‑C++ performance through Codon‑based compilation, reference‑counted memory management, and aggressive LLVM optimizations, dramatically shortening iteration time.

AI InfrastructureHigh‑performance computingfeature engineering
0 likes · 14 min read
How KaiFG Lets Python Feature Engineering Run at C++ Speed
Instant Consumer Technology Team
Instant Consumer Technology Team
Oct 29, 2025 · Big Data

Revolutionizing Feature Engineering with Distributed Tech & Configurable Services

Facing PB‑scale user behavior data and millions of feature dimensions, the platform transformed its search, advertising, and recommendation pipelines by adopting a distributed, configurable‑service architecture that delivers high‑throughput streaming, elastic storage, rapid feature iteration, and robust fault‑tolerance for AI‑driven personalization.

Big DataData ArchitectureDistributed Systems
0 likes · 17 min read
Revolutionizing Feature Engineering with Distributed Tech & Configurable Services
Instant Consumer Technology Team
Instant Consumer Technology Team
Oct 28, 2025 · Artificial Intelligence

Can Data Virtualization Deliver Millisecond Real‑Time Features Across Stores?

This article shares a three‑year journey of building a data‑virtualization‑based, multi‑environment feature management framework for real‑time risk decision platforms, detailing challenges like heterogeneous storage, cold‑start, and operational stability, and presenting a unified architecture that decouples physical storage from business logic.

Big DataReal-time analyticsdata virtualization
0 likes · 16 min read
Can Data Virtualization Deliver Millisecond Real‑Time Features Across Stores?
Data STUDIO
Data STUDIO
Oct 28, 2025 · Artificial Intelligence

8 Proven Ways to Boost Machine Learning Model Accuracy

This article outlines eight practical techniques—including data augmentation, handling missing values, feature engineering, algorithm selection, hyperparameter tuning, ensemble methods, and cross‑validation—to systematically improve the accuracy of Python machine‑learning models, supported by explanations, examples, and code snippets.

cross-validationdata preprocessingensemble methods
0 likes · 16 min read
8 Proven Ways to Boost Machine Learning Model Accuracy
IT Services Circle
IT Services Circle
Sep 28, 2025 · Artificial Intelligence

How to Build a Python AI Model for Predicting User Behavior

This article walks through the complete machine‑learning workflow for predicting user actions—covering core concepts, data collection, preprocessing, feature engineering, model training, evaluation, hyper‑parameter tuning, deployment, and future directions—using Python and popular AI libraries.

Model EvaluationPythonfeature engineering
0 likes · 11 min read
How to Build a Python AI Model for Predicting User Behavior
Data Party THU
Data Party THU
Sep 12, 2025 · Big Data

Key Lessons from Winning the 2025 China University Big Data Competition

The author shares a detailed account of their experience in the 2025 China University Big Data Competition, describing the team’s top national ranking, the shift from absolute stock price prediction to robust ranking learning, extensive feature engineering, and reflections on balancing technical ambition with real‑world constraints.

Big DataStock Predictiondata competition
0 likes · 5 min read
Key Lessons from Winning the 2025 China University Big Data Competition
Data Party THU
Data Party THU
Sep 11, 2025 · Big Data

How We Conquered the 2025 Chinese University Big Data Challenge: Financial Time‑Series Lessons

Our team "Stay Overnight" from Chongqing University of Posts and Telecommunications placed second nationally in the 2025 China University Computer Competition Big Data Challenge, navigating volatile financial data, shifting from time‑series to supervised learning, and emphasizing feature engineering to boost model performance.

Big DataModel Selectioncompetition report
0 likes · 4 min read
How We Conquered the 2025 Chinese University Big Data Challenge: Financial Time‑Series Lessons
Data Party THU
Data Party THU
Sep 8, 2025 · Big Data

What We Learned from the 2025 China University Big Data Competition

The article shares a top‑5 team's experience in the 2025 China University Big Data Challenge, detailing their roster, competition rules, four key technical insights on data pitfalls, model alignment, generalization, and leveraging SOTA models, plus reflections on the event's excellent support and collaborative atmosphere.

Big Datafeature engineeringmodel generalization
0 likes · 6 min read
What We Learned from the 2025 China University Big Data Competition
Tencent Advertising Technology
Tencent Advertising Technology
Aug 27, 2025 · Artificial Intelligence

How Three Undergrads Turned a Last‑Minute Team into a Top‑4 Finish in Tencent’s AI Competition

A trio of freshly graduated undergraduates formed a team just two days before the deadline, overcame early setbacks by reordering feature engineering, leveraged score‑driven decisions, and ultimately surged to fourth place in Tencent's highly competitive algorithm contest, illustrating resilience and practical AI teamwork.

AI competitionTencentfeature engineering
0 likes · 7 min read
How Three Undergrads Turned a Last‑Minute Team into a Top‑4 Finish in Tencent’s AI Competition
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 17, 2025 · Artificial Intelligence

How to Build a House Price Prediction Model with Python: A Step‑by‑Step Guide

This tutorial walks developers through the complete workflow of building a house‑price regression model—from problem definition, data collection and preprocessing, feature engineering, and model selection, to training, hyper‑parameter tuning, evaluation, optimization, deployment as a Flask service, and ongoing monitoring—using Python, pandas, scikit‑learn, and visualisation libraries.

Model DeploymentPythonfeature engineering
0 likes · 29 min read
How to Build a House Price Prediction Model with Python: A Step‑by‑Step Guide
JD Retail Technology
JD Retail Technology
Jun 10, 2025 · Artificial Intelligence

How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink

This article explains JD's complex recommendation system data pipeline—from indexing, sampling, and feature engineering to explainability and real‑time metrics—highlighting challenges such as data consistency, latency, and the use of Flink for massive, low‑latency processing.

Flinkexplainabilityfeature engineering
0 likes · 23 min read
How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink
Architect
Architect
May 31, 2025 · Artificial Intelligence

Edge Intelligence Implementation in the Vivo Official App: Architecture, Feature Engineering, and Model Deployment

The article details how edge intelligence is applied to the Vivo official app to improve product recommendation on the smart‑hardware floor by abstracting the problem, designing feature engineering pipelines, training TensorFlow models, converting them to TFLite, and deploying inference on mobile devices, while also covering monitoring and performance considerations.

Model DeploymentTensorFlow Liteedge AI
0 likes · 19 min read
Edge Intelligence Implementation in the Vivo Official App: Architecture, Feature Engineering, and Model Deployment
AI Algorithm Path
AI Algorithm Path
May 27, 2025 · Artificial Intelligence

Reinforcement Learning Tutorial 8: Building State Feature Representations for Objective Optimization

This tutorial explains how to construct state feature vectors for reinforcement‑learning value‑function approximation, covering linear, polynomial, Fourier, and radial‑basis representations, as well as state aggregation techniques such as coarse coding and tile coding, and discusses non‑parametric approaches like kernel methods.

Reinforcement Learningfeature engineeringfourier basis
0 likes · 16 min read
Reinforcement Learning Tutorial 8: Building State Feature Representations for Objective Optimization
vivo Internet Technology
vivo Internet Technology
May 21, 2025 · Artificial Intelligence

How Vivo’s App Leverages Edge AI to Personalize Product Recommendations

This article details how Vivo’s official app implements edge intelligence to dynamically rank and recommend hardware products on its homepage, covering problem abstraction, data collection, feature engineering, model design, TensorFlow‑Lite conversion, on‑device inference, and monitoring for a personalized user experience.

AndroidModel DeploymentTensorFlow Lite
0 likes · 19 min read
How Vivo’s App Leverages Edge AI to Personalize Product Recommendations
Python Programming Learning Circle
Python Programming Learning Circle
Mar 24, 2025 · Artificial Intelligence

Comprehensive List of Aggregation Functions and Custom Feature Engineering Utilities for Python

This article presents a detailed collection of built‑in pandas aggregation methods and numerous custom Python functions for time‑series feature engineering, offering beginners practical tools to enhance data preprocessing and model performance in machine‑learning projects.

Time Seriesaggregation functionsfeature engineering
0 likes · 10 min read
Comprehensive List of Aggregation Functions and Custom Feature Engineering Utilities for Python
Cognitive Technology Team
Cognitive Technology Team
Mar 17, 2025 · Artificial Intelligence

Leveraging Large Language Models to Optimize Traditional Machine Learning Pipelines

Large language models can assist and enhance each stage of traditional machine learning—including sample generation, data cleaning, feature engineering, model selection, hyper‑parameter tuning, and workflow automation—by generating synthetic data, refining features, selecting models, and orchestrating pipelines, though challenges such as bias, privacy, and noise remain.

Data GenerationLLMfeature engineering
0 likes · 11 min read
Leveraging Large Language Models to Optimize Traditional Machine Learning Pipelines
JavaEdge
JavaEdge
Mar 15, 2025 · Artificial Intelligence

Boost NLP Model Performance with n-gram Feature Engineering

This article explains why feature engineering is crucial for NLP tasks, introduces n‑gram enhancements, provides Python implementations for generating bi‑gram and higher‑order features, demonstrates dynamic padding for text length standardization, and offers practical deployment tips such as feature dimension control and monitoring.

Deep LearningN-gramNLP
0 likes · 7 min read
Boost NLP Model Performance with n-gram Feature Engineering
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 6, 2025 · Artificial Intelligence

From Linear Regression to Transformers: Mastering Machine Learning Foundations

This comprehensive guide walks readers through the evolution of machine learning, starting with basic linear models and feature engineering, progressing through logistic regression, decision trees, and deep learning architectures like MLPs, CNNs, RNNs, and transformers, and demonstrates practical implementations with code examples and evaluation metrics.

Deep LearningEvaluation Metricsfeature engineering
0 likes · 64 min read
From Linear Regression to Transformers: Mastering Machine Learning Foundations
AI Code to Success
AI Code to Success
Feb 25, 2025 · Artificial Intelligence

Master Logistic Regression: Theory, Practice, and Real‑World Tips

This comprehensive guide explains logistic regression fundamentals, the role of the Sigmoid function, loss and optimization methods, step‑by‑step Python implementation with data preparation, model training, evaluation, hyper‑parameter tuning, handling over‑ and under‑fitting, multi‑class extensions, and diverse application scenarios across medicine, finance, e‑commerce, and text analysis.

Model EvaluationPythonclassification
0 likes · 23 min read
Master Logistic Regression: Theory, Practice, and Real‑World Tips
Airbnb Technology Team
Airbnb Technology Team
Jan 24, 2025 · Artificial Intelligence

Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning

Chronon is an open‑source framework that centralizes feature definitions to guarantee training‑inference consistency, eliminates complex ETL pipelines, and supports real‑time and batch processing across diverse data sources, cutting feature‑development cycles from months to under a week, as demonstrated by Airbnb’s 40,000‑feature deployment.

ChrononSparkfeature engineering
0 likes · 10 min read
Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning
Bilibili Tech
Bilibili Tech
Dec 27, 2024 · Big Data

Consistency Architecture for Bilibili Recommendation Model Data Flow

The article outlines Bilibili’s revamped recommendation data‑flow architecture that eliminates timing and calculation inconsistencies by snapshotting online features, unifying feature computation in a single C++ library accessed via JNI, and orchestrating label‑join and sample extraction through near‑line Kafka/Flink pipelines, with further performance gains and Iceberg‑based future extensions.

Data ConsistencyFlinkIceberg
0 likes · 12 min read
Consistency Architecture for Bilibili Recommendation Model Data Flow
Tencent Advertising Technology
Tencent Advertising Technology
Dec 6, 2024 · Big Data

Building a High‑Performance Advertising Feature Data Lake with Apache Iceberg at Tencent

Tencent's advertising team replaced a traditional HDFS‑Hive warehouse with an Apache Iceberg‑based data lake, adding primary‑key tables, multi‑stream merging, adaptive compaction, and Spark SPJ optimizations to achieve minute‑level feature update latency, 10× back‑fill speed, and up to 60% storage savings.

Big DataCDCData Lake
0 likes · 25 min read
Building a High‑Performance Advertising Feature Data Lake with Apache Iceberg at Tencent
Test Development Learning Exchange
Test Development Learning Exchange
Nov 26, 2024 · Artificial Intelligence

Comprehensive Python Tutorial for Data Preprocessing, Feature Engineering, Model Training, Evaluation, and Deployment

This tutorial walks through consolidating the first ten days of learning by covering data preprocessing, feature engineering, model training with linear regression, decision tree, and random forest, model evaluation using cross‑validation, and finally saving and loading the best model, all illustrated with complete Python code examples.

Model TrainingPythondata preprocessing
0 likes · 9 min read
Comprehensive Python Tutorial for Data Preprocessing, Feature Engineering, Model Training, Evaluation, and Deployment
DataFunSummit
DataFunSummit
Nov 20, 2024 · Artificial Intelligence

How Data Lakes Empower AI: Expert Insights on Feature Management, Columnar Storage, and Vector Formats

In a panel discussion, experts explain how data‑lake‑warehouse integration, columnar formats like Apache Iceberg, and emerging variant types enable efficient feature engineering, support large‑language‑model workloads, and provide flexible vector storage, thereby driving the evolution of AI from traditional ML to the GenAI era.

Apache IcebergArtificial IntelligenceData Lake
0 likes · 6 min read
How Data Lakes Empower AI: Expert Insights on Feature Management, Columnar Storage, and Vector Formats
DataFunTalk
DataFunTalk
Nov 6, 2024 · Big Data

How Data Lakes Empower AI: Insights from Industry Experts

In a panel discussion, experts from Kuaishou, Ping An, and Datastrato explain how data lake architectures, columnar storage formats like Apache Iceberg, and vector‑enabled lake formats are enhancing feature management, supporting generative AI workloads, and accelerating machine‑learning pipelines.

AIApache IcebergBig Data
0 likes · 6 min read
How Data Lakes Empower AI: Insights from Industry Experts
DataFunSummit
DataFunSummit
Oct 11, 2024 · Artificial Intelligence

Feature Production and Component Modeling in the Intelligent Era: From Feature Generation to Modular Modeling

This article introduces a cloud‑based feature production platform that simplifies feature engineering for recommendation, risk control and machine learning, explains its component‑based modeling framework, and answers common questions about deployment, performance, and customization, highlighting cross‑platform compatibility and optimization techniques.

Artificial IntelligenceBig DataFeature Store
0 likes · 19 min read
Feature Production and Component Modeling in the Intelligent Era: From Feature Generation to Modular Modeling
Python Programming Learning Circle
Python Programming Learning Circle
Sep 10, 2024 · Artificial Intelligence

Time Series Feature Engineering Techniques in Python

This article explains how to extract a variety of date‑time based features—including date, time, lag, rolling, expanding, and domain‑specific attributes—from a time‑series dataset using pandas, and discusses proper validation strategies for building reliable forecasting models.

Time Seriesfeature engineeringforecasting
0 likes · 14 min read
Time Series Feature Engineering Techniques in Python
DataFunTalk
DataFunTalk
Jul 27, 2024 · Information Security

Classification of Risk Control and Full-Scenario Anti-Cheat Strategies in the Internet

The article outlines how internet and financial risk control are categorized into anti‑cheat, anti‑fraud, and content security, describes full‑scenario cheating types, and presents a three‑step joint defense framework using perception, identification, and mitigation with feature‑based analysis.

Information Securityanti-cheatfeature engineering
0 likes · 7 min read
Classification of Risk Control and Full-Scenario Anti-Cheat Strategies in the Internet
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 5, 2024 · Big Data

RiskFactor: An Integrated Real‑Time and Offline Feature Platform for Risk Control

RiskFactor unifies iQIYI’s legacy real‑time and offline feature platforms onto Opal’s DAG‑plus‑SQL engine, accelerating feature production fifteen‑fold, cutting latency from hours to minutes, streamlining development, lowering costs, and delivering more reliable, versioned risk‑control capabilities against sophisticated online threats.

Big DataDAGReal-time Streaming
0 likes · 14 min read
RiskFactor: An Integrated Real‑Time and Offline Feature Platform for Risk Control
Python Programming Learning Circle
Python Programming Learning Circle
Jun 21, 2024 · Artificial Intelligence

Using scikit-learn for Data Mining: Feature Engineering, Parallel Processing, Pipelines, and Model Persistence

This article demonstrates how to perform data mining with scikit-learn by detailing the full workflow—from data acquisition and feature engineering, through parallel and pipeline processing, to automated hyper‑parameter tuning and model persistence—using the Iris dataset as an example.

Pipelinedata miningfeature engineering
0 likes · 13 min read
Using scikit-learn for Data Mining: Feature Engineering, Parallel Processing, Pipelines, and Model Persistence
DataFunTalk
DataFunTalk
Jun 13, 2024 · Artificial Intelligence

A/B Testing and Model Grayscale in Credit Risk Control: Concepts, Requirements, and Integrated Solutions

This article explains how A/B testing and model grayscale are applied in credit risk control, discusses the specific requirements for effective testing, compares upstream and risk‑system traffic splitting methods, and proposes an integrated all‑in‑one solution to simplify feature engineering, model evaluation, and deployment.

A/B testingcredit riskfeature engineering
0 likes · 5 min read
A/B Testing and Model Grayscale in Credit Risk Control: Concepts, Requirements, and Integrated Solutions
DataFunTalk
DataFunTalk
Jun 11, 2024 · Artificial Intelligence

Intelligent Risk Control: Concepts, Challenges, and Integrated Operational Architecture for Banking

This article explores the concept of intelligent risk control in banking, detailing its AI‑driven architecture, current challenges such as external data costs and model‑deployment friction, and proposes an integrated operational framework that leverages big data, knowledge graphs, and MLOps to enhance risk detection and decision‑making.

Artificial IntelligenceBankingMLOps
0 likes · 14 min read
Intelligent Risk Control: Concepts, Challenges, and Integrated Operational Architecture for Banking
DataFunSummit
DataFunSummit
Jun 7, 2024 · Artificial Intelligence

Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform

Feature engineering, the process of creating input variables for machine learning models, is crucial for banking risk control; this article explains the concepts of features, variables, and metrics, outlines challenges in real‑time feature pipelines, and proposes a practical architecture and best practices for building an efficient, low‑code feature platform.

feature engineeringmachine learningplatform design
0 likes · 10 min read
Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform
DataFunSummit
DataFunSummit
Jun 5, 2024 · Fundamentals

User Portrait Tagging: Construction, Feature Processing, and Evaluation

This article provides a comprehensive guide on building user portrait tags—from basic attribute tags to business and strategy tags—detailing data collection methods, feature engineering techniques such as cleaning, time decay, and smoothing, and evaluation metrics for cohesion and stability, aimed at data product managers and analysts.

Evaluation Metricsfeature engineeringproduct-management
0 likes · 12 min read
User Portrait Tagging: Construction, Feature Processing, and Evaluation
DataFunTalk
DataFunTalk
Jun 4, 2024 · Artificial Intelligence

Building an Integrated Intelligent Risk Control System for Banking

The article explores the concept, challenges, and future directions of intelligent banking risk control, emphasizing data integration, AI-driven modeling, feature engineering, MLOps, knowledge graphs, and large‑model applications to create a unified, automated risk management platform.

AIBankingMLOps
0 likes · 10 min read
Building an Integrated Intelligent Risk Control System for Banking
iQIYI Technical Product Team
iQIYI Technical Product Team
May 31, 2024 · Artificial Intelligence

How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform

Opal is iQIYI's end‑to‑end machine‑learning platform that integrates feature production, sample construction, model training, and deployment with big‑data services, addressing duplicated effort, weak data processing, and fragmented pipelines to boost efficiency across recommendation, advertising, and risk‑control scenarios.

AI OperationsBig Data IntegrationDistributed Training
0 likes · 19 min read
How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform
Python Programming Learning Circle
Python Programming Learning Circle
May 23, 2024 · Artificial Intelligence

Comprehensive Collection of Aggregation Functions for Feature Engineering in Python

This article presents a detailed compilation of pandas built‑in aggregation methods and a wide range of custom Python functions for time‑series feature engineering, providing ready‑to‑use code snippets that cover statistical descriptors, drawdown metrics, peak detection, and more for data science practitioners.

PythonTime Seriesaggregation
0 likes · 17 min read
Comprehensive Collection of Aggregation Functions for Feature Engineering in Python
StarRocks
StarRocks
Apr 2, 2024 · Big Data

How We Unified Real‑Time and Batch Financial Risk Features with StarRocks

This article details the challenges of maintaining separate real‑time and batch risk‑control features, evaluates Lambda and Kappa architectures, explores storage‑unified and compute‑unified alternatives, compares Hologres, StarRocks and ClickHouse, and presents a validated StarRocks‑based solution that dramatically reduces feature delivery latency and improves accuracy.

Kappa architectureLambda architecturefeature engineering
0 likes · 19 min read
How We Unified Real‑Time and Batch Financial Risk Features with StarRocks
Didi Tech
Didi Tech
Mar 28, 2024 · Big Data

How We Unified Real‑Time and Batch Features with StarRocks in Financial Risk Control

This article analyzes the challenges of building real‑time and batch risk‑control features, compares Lambda and Kappa architectures, evaluates storage‑unified and compute‑unified solutions, and details how StarRocks was selected, validated, and deployed to achieve high‑performance, low‑latency feature serving in a financial context.

Big DataReal-time analyticsStarRocks
0 likes · 19 min read
How We Unified Real‑Time and Batch Features with StarRocks in Financial Risk Control
DataFunSummit
DataFunSummit
Mar 26, 2024 · Fundamentals

User Profile Tagging: Construction, Feature Processing, and Evaluation

This article systematically explains the fundamentals of user profile tags, covering basic attribute tags, business and strategy-oriented tags, detailed feature processing methods such as anomaly handling, time decay, and smoothing, and provides evaluation metrics for cohesion and stability, supplemented by a practical Q&A session.

KPI alignmentRFM modelfeature engineering
0 likes · 12 min read
User Profile Tagging: Construction, Feature Processing, and Evaluation
HelloTech
HelloTech
Mar 14, 2024 · Artificial Intelligence

Feature Engineering: Concepts, Methods, and Automation

Feature engineering transforms existing data into new predictive variables through manual analysis or automated pipelines, encompassing single‑variable encoding, pairwise arithmetic, group‑statistics, multi‑variable combinations, time‑series and text derivations, with tools like Deep Feature Synthesis and beam‑search to generate and select useful features.

Time Seriesautomated featuresdata preprocessing
0 likes · 17 min read
Feature Engineering: Concepts, Methods, and Automation
Efficient Ops
Efficient Ops
Mar 10, 2024 · Databases

How Machine Learning Can Automate MySQL Index Optimization

This article explains how applying machine learning to database operations—specifically AIOps for MySQL—can automate index recommendation by parsing SQL, extracting semantic and statistical features, generating candidate index combinations, and training an XGBoost model to predict optimal indexes, reducing reliance on manual DBA work.

Index Optimizationaiopsfeature engineering
0 likes · 10 min read
How Machine Learning Can Automate MySQL Index Optimization
DataFunTalk
DataFunTalk
Feb 5, 2024 · Fundamentals

User Portrait Tagging: Construction, Feature Processing, and Evaluation

This article explains how to build user portrait tags—from basic attribute tags to business and strategy tags—covers methods for data collection, anomaly handling, time decay, smoothing, and evaluates tag quality using cohesion, stability, and AUC-related metrics to support data‑driven product decisions.

Data ScienceEvaluation Metricsfeature engineering
0 likes · 12 min read
User Portrait Tagging: Construction, Feature Processing, and Evaluation
Data Thinking Notes
Data Thinking Notes
Jan 16, 2024 · Fundamentals

Building Comprehensive Risk Feature Portraits for All Loan Stages

This article explains how to construct risk control feature portraits across four key scenarios—marketing, pre‑loan, in‑loan, and post‑loan—by selecting appropriate data dimensions, describing usable customer, behavior, and ID‑linked data, and illustrating each portrait with visual examples to guide accurate risk assessment.

Risk Modelingcredit riskdata dimensions
0 likes · 9 min read
Building Comprehensive Risk Feature Portraits for All Loan Stages
Sohu Tech Products
Sohu Tech Products
Jan 10, 2024 · Artificial Intelligence

Baidu's Practices and Insights on Recommendation Ranking

Baidu’s recommendation ranking system handles billions of daily impressions and millions of users by combining discrete and cross features, bias mitigation, and long‑short sequence modeling within a multi‑stage funnel and hierarchical architecture, while planning to integrate large language models for generative, interpretable, and decision‑oriented recommendations.

AIBaidufeature engineering
0 likes · 19 min read
Baidu's Practices and Insights on Recommendation Ranking
DataFunTalk
DataFunTalk
Jan 7, 2024 · Artificial Intelligence

Baidu's Recommendation Ranking: Background, Feature Design, Algorithms, Architecture, and Future Directions

This article presents Baidu's comprehensive approach to feed recommendation ranking, covering business and data background, feature engineering principles, core algorithmic strategies, system architecture design, and upcoming plans to integrate large language models for more intelligent and fair recommendations.

Baidufeature engineeringlarge-scale AI
0 likes · 19 min read
Baidu's Recommendation Ranking: Background, Feature Design, Algorithms, Architecture, and Future Directions
Huolala Tech
Huolala Tech
Dec 19, 2023 · Information Security

How to Build Effective Bot Management: Strategies, Architecture, and Tools

This article explains bot fundamentals, classifies bot types, analyzes the rising threat of malicious bot traffic, compares major vendor solutions, and outlines a four‑layer architecture with data, feature, model, and policy layers for robust bot management in modern web services.

Bot ManagementTraffic analysiscloud security
0 likes · 14 min read
How to Build Effective Bot Management: Strategies, Architecture, and Tools
DataFunSummit
DataFunSummit
Dec 15, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article explores how large language models can be incorporated into recommender systems, discussing background challenges, specific integration points across the recommendation pipeline, practical implementation methods, experimental results, and future research directions, while highlighting industrial considerations and potential improvements.

Industrial ApplicationsLLMModel Fusion
0 likes · 20 min read
Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges
DataFunTalk
DataFunTalk
Oct 10, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article surveys how large language models can be incorporated into recommender systems, discussing their strengths and limitations, outlining where and how they can be applied across the recommendation pipeline, presenting recent research examples, and highlighting challenges and future directions for industrial deployment.

LLMfeature engineeringrecommender systems
0 likes · 20 min read
Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges
政采云技术
政采云技术
Oct 10, 2023 · Artificial Intelligence

Predicting Membership Purchase with Logistic Regression: Feature Engineering, Model Training, Evaluation, and Deployment

This article presents a complete workflow for predicting whether users will purchase a membership using logistic regression, covering data collection, feature selection, handling imbalanced samples, model training, hyper‑parameter tuning, threshold optimization, evaluation metrics such as accuracy, precision, recall, AUC, lift, and finally deployment on a big‑data platform with PySpark.

Big DataModel Evaluationfeature engineering
0 likes · 17 min read
Predicting Membership Purchase with Logistic Regression: Feature Engineering, Model Training, Evaluation, and Deployment
HomeTech
HomeTech
Sep 21, 2023 · Artificial Intelligence

Homepage Pop‑up Recommendation System for Car Purchase Intent: Background, Feature Engineering, Model and Strategy Optimization, and Results

This article details how AutoHome's homepage pop‑up leverages precise targeting, extensive feature engineering, and multi‑stage DeepFM‑based models with attention and LHUC modules to accurately identify car‑buying users, improve vehicle‑series recommendations, and achieve a 355% conversion rate increase.

AIDeep Learningcar buying
0 likes · 7 min read
Homepage Pop‑up Recommendation System for Car Purchase Intent: Background, Feature Engineering, Model and Strategy Optimization, and Results
HelloTech
HelloTech
Sep 13, 2023 · Artificial Intelligence

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

The Haro AI platform automates customer‑service ticket routing by applying a four‑step pipeline—feature processing, model training, evaluation, and deployment—using BERT/ALBERT‑based intent recognition, configurable feature storage, AutoML or expert modes, and Faas‑style deployment, as demonstrated in the Universal Ticket System case study, dramatically improving accuracy and efficiency.

AI PlatformALBERTBERT
0 likes · 11 min read
AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition
Bilibili Tech
Bilibili Tech
Jul 25, 2023 · Artificial Intelligence

Bilibili Game Center Recommendation System: Architecture, Core Technologies, and Experimental Results

The Bilibili Game Center recommendation system combines a unified feature platform, multi‑stage recall, ranking and re‑ranking models, online services, and AB experimentation to deliver personalized game suggestions, resulting in up to 78% higher click‑through, 76% higher conversion, and substantial increases in user engagement and revenue.

AB testingfeature engineeringgame-platform
0 likes · 26 min read
Bilibili Game Center Recommendation System: Architecture, Core Technologies, and Experimental Results
GuanYuan Data Tech Team
GuanYuan Data Tech Team
May 25, 2023 · Artificial Intelligence

How to Build a Comprehensive ML Model Quality Assessment Framework

This article explains why and how to evaluate machine learning model quality through a structured framework that covers data validation, feature checks, and algorithm testing, helping ensure accuracy, reliability, and maintainability before deployment.

AI GovernanceModel Evaluationdata validation
0 likes · 19 min read
How to Build a Comprehensive ML Model Quality Assessment Framework
HelloTech
HelloTech
May 8, 2023 · Artificial Intelligence

One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions

The article presents a comprehensive one‑stop AI platform that unifies training, model, feature, and decision services across cloud, edge, mobile, Flink, and application environments, detailing its architecture, the limitations of cloud‑centric inference, the advantages of localized inference, and the challenges and solutions for model and feature localization, SDK design, and future AutoML enhancements.

AI PlatformDistributed SystemsFlink
0 likes · 17 min read
One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions
HelloTech
HelloTech
Apr 12, 2023 · Artificial Intelligence

Integrating Machine Learning Ranking into Elasticsearch: Architecture, Components, and Performance

The team embedded a full machine‑learning ranking pipeline as an Elasticsearch plug‑in—combining real‑time and offline feature stores, hot‑loadable model jars via Dragonfly, an MLeap execution engine, and a DSL for feature definition—replacing the coarse‑ranking logistic‑regression with a tree model that adds ~10 ms latency but yields a 1.2 % AB‑test lift, while maintaining high throughput, low CPU usage, and supporting future batch deep‑learning rescoring.

Model Deploymentfeature engineeringonline prediction
0 likes · 16 min read
Integrating Machine Learning Ranking into Elasticsearch: Architecture, Components, and Performance
Meituan Technology Team
Meituan Technology Team
Apr 6, 2023 · Databases

AI-Driven Index Recommendation for Slow Queries at Meituan

This article details a joint research effort between Meituan and East China Normal University that combines cost‑based methods with AI‑driven, data‑centric models to automatically generate and evaluate missing indexes for billions of daily slow queries, improving recommendation accuracy and query performance.

AICost ModelIndex Recommendation
0 likes · 16 min read
AI-Driven Index Recommendation for Slow Queries at Meituan
Tencent Advertising Technology
Tencent Advertising Technology
Mar 30, 2023 · Artificial Intelligence

Tencent's Taiji Machine Learning Platform: End-to-End MLOps for Advertising

Tencent’s Taiji machine learning platform, a cloud‑native, distributed parameter‑server system, provides end‑to‑end MLOps for advertising by integrating data ingestion, feature engineering, model training, evaluation, deployment, and monitoring, supporting massive models up to billions of parameters while improving efficiency, scalability, and resource management.

Distributed TrainingMLOpsMachine Learning Platform
0 likes · 18 min read
Tencent's Taiji Machine Learning Platform: End-to-End MLOps for Advertising
DataFunTalk
DataFunTalk
Mar 28, 2023 · Artificial Intelligence

FeatHub: An Open‑Source Feature Store for Real‑Time and Offline Feature Engineering

This article introduces FeatHub, an open‑source feature‑store project from Alibaba Cloud that provides a Python SDK, flexible architecture, and execution engines such as Flink and Spark to simplify the development, deployment, monitoring, and sharing of real‑time and offline machine‑learning features across multi‑cloud environments.

Feature StoreFlinkPython SDK
0 likes · 21 min read
FeatHub: An Open‑Source Feature Store for Real‑Time and Offline Feature Engineering
DaTaobao Tech
DaTaobao Tech
Mar 13, 2023 · Artificial Intelligence

AI‑Driven 3D Content Creation and Optimization for E‑commerce

The article presents an AI‑driven pipeline that creates, delivers, and optimizes 3D e‑commerce content by leveraging diffusion‑based generation, txt2img/img2img style transfer, Shapley‑value interpretability, and a multi‑level traffic amplification framework to overcome modeling efficiency, asset scarcity, and production cost challenges.

3D contentAI Generationfeature engineering
0 likes · 12 min read
AI‑Driven 3D Content Creation and Optimization for E‑commerce
Tencent Cloud Developer
Tencent Cloud Developer
Mar 8, 2023 · Artificial Intelligence

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

The article describes WeChat Games’ scalable recommendation system, detailing its four‑component architecture—offline ML platform, unified management, online DAG‑based engine, and peripheral services—along with a hybrid algorithm library, feature engineering, real‑time monitoring, and solutions that boost engagement across diverse game recommendation scenarios.

Data ManagementDeep LearningReal-time Processing
0 likes · 28 min read
Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation
DataFunSummit
DataFunSummit
Mar 3, 2023 · Artificial Intelligence

Intelligent Risk Control System Architecture and Development Trends

This article introduces the architecture of intelligent risk control, detailing its four-layer structure, the underlying data, feature, model, and decision components, platform interactions, and future development trends, highlighting how AI and big data enhance risk management efficiency and accuracy.

Big DataDecision Systemsfeature engineering
0 likes · 12 min read
Intelligent Risk Control System Architecture and Development Trends
DataFunSummit
DataFunSummit
Jan 25, 2023 · Artificial Intelligence

Expert Insights on Recommendation System Architecture, Data, Features, Recall, Ranking and Evaluation

This interview compiles expert opinions on the end‑to‑end recommendation system pipeline—including architecture, data collection, user profiling, content structuring, feature engineering, recall strategies, ranking algorithms, multi‑objective optimization, multi‑modal fusion, re‑ranking, cold‑start solutions, evaluation metrics and real‑world applications—highlighting the technical challenges and practical solutions.

Evaluation Metricscold startfeature engineering
0 likes · 15 min read
Expert Insights on Recommendation System Architecture, Data, Features, Recall, Ranking and Evaluation
DataFunSummit
DataFunSummit
Jan 11, 2023 · Artificial Intelligence

Intelligent Financial Risk Control Platform Architecture and Expert Insights

This article outlines the architecture of an intelligent financial risk control platform, detailing data sources, big‑data processing, feature engineering, decision engines, model types, and real‑world application scenarios, while highlighting expert‑identified challenges such as compliance, data quality, real‑time performance, and fraud detection.

Financial AIdecision enginefeature engineering
0 likes · 11 min read
Intelligent Financial Risk Control Platform Architecture and Expert Insights
Xianyu Technology
Xianyu Technology
Dec 21, 2022 · Artificial Intelligence

Xianyu Recommendation System: Architecture, Challenges, and Deployment

The Xianyu recommendation system, built by backend expert Wan Xiaoyong, evolved from offline scoring to a full‑graph, serverless recall‑ranking pipeline that tackles C2C uncertainties through centralized feature engineering, model compression, staged deployment, flexible experimentation, robust governance, and plans for automated attribution and interpretability.

AIBig DataModel Deployment
0 likes · 10 min read
Xianyu Recommendation System: Architecture, Challenges, and Deployment
Hulu Beijing
Hulu Beijing
Dec 2, 2022 · Artificial Intelligence

How Disney+ Designs a Multi‑Task Video Search Ranking Model

This article explains the architecture of a video search ranking system that combines a deep encoding network, multi‑task expert networks, and a bias‑correction module to jointly optimize relevance, click‑through rate, and watch time for streaming platforms.

Bias CorrectionDeep Learningfeature engineering
0 likes · 15 min read
How Disney+ Designs a Multi‑Task Video Search Ranking Model
Alipay Experience Technology
Alipay Experience Technology
Nov 29, 2022 · Mobile Development

How Ant's Mobile Edge Computing Container Powers Real‑Time AI on Devices

This article explains the challenges of deploying intelligent features on mobile clients and describes Ant Group's edge computing container, its three‑layer architecture, real‑time compute, feature, and decision engines, and the low‑code platform that enables fast, stable, and scalable AI solutions on devices.

Real-time analyticsdecision enginefeature engineering
0 likes · 15 min read
How Ant's Mobile Edge Computing Container Powers Real‑Time AI on Devices
Python Programming Learning Circle
Python Programming Learning Circle
Nov 22, 2022 · Artificial Intelligence

Using tsfresh for Automated Time Series Feature Extraction in Python

This article introduces the tsfresh Python package, explains why traditional machine‑learning models struggle with time‑series data, and demonstrates how tsfresh can automatically generate and select hundreds of useful features—including statistical, nonlinear, and signal‑processing metrics—while supporting big‑data frameworks such as Dask and Spark.

PythonTime Seriesfeature engineering
0 likes · 5 min read
Using tsfresh for Automated Time Series Feature Extraction in Python
Model Perspective
Model Perspective
Oct 24, 2022 · Artificial Intelligence

Understanding RuleFit: Combining Tree Rules with Linear Models for Interpretable AI

RuleFit, introduced by Friedman and Popescu in 2008, integrates decision‑tree‑derived rules with linear regression to boost predictive accuracy while maintaining strong interpretability, and this article explains its definition, rule extraction, algorithmic implementation, code example, advantages, limitations, and practical insights.

RuleFitensemble treesfeature engineering
0 likes · 10 min read
Understanding RuleFit: Combining Tree Rules with Linear Models for Interpretable AI
Kuaishou Tech
Kuaishou Tech
Oct 21, 2022 · Artificial Intelligence

Real-time Short Video Recommendation on Mobile Devices: System Design, Model Architecture, and Experimental Evaluation

The paper presents a lightweight on‑device re‑ranking system for short‑video recommendation that leverages real‑time user feedback and context‑aware generative ranking, detailing its architecture, feature engineering, beam‑search optimization, and both offline and online experimental results showing significant performance gains.

Beam SearchContext-Awarefeature engineering
0 likes · 12 min read
Real-time Short Video Recommendation on Mobile Devices: System Design, Model Architecture, and Experimental Evaluation
vivo Internet Technology
vivo Internet Technology
Oct 9, 2022 · Artificial Intelligence

vivo Machine Learning Platform: Architecture Design and Practice

vivo’s machine‑learning platform, built for its massive app‑store and e‑commerce ecosystem, streamlines data processing, model training, and deployment through quota‑based resource management, a custom ultra‑large‑scale TensorFlow‑vlps framework, OpenAPI‑driven training, and Jupyter‑integrated interactive development, boosting efficiency for billions of samples and features.

Distributed TrainingMLOpsMachine Learning Platform
0 likes · 12 min read
vivo Machine Learning Platform: Architecture Design and Practice
ITPUB
ITPUB
Sep 15, 2022 · Artificial Intelligence

Why Precise Feature Engineering Still Matters in Recommendation Systems

In the era of deep learning, feature engineering remains crucial for recommendation and search advertising because it bridges raw relational data and models, improves performance, reduces complexity, and handles high‑cardinality, large‑scale, and time‑sensitive scenarios with robust transformations and statistical encoding.

AIdata preprocessingfeature engineering
0 likes · 20 min read
Why Precise Feature Engineering Still Matters in Recommendation Systems
Model Perspective
Model Perspective
Aug 31, 2022 · Fundamentals

How to Build a Watermelon Sweetness Dataset: From Field to Features

This article describes how the author collected a watermelon dataset, defined measurable features such as size, color, sugar content, seed count, and texture, and documented the process with photos, tables, and a brief discussion of data characteristics for future machine‑learning analysis.

data analysisdata collectionfeature engineering
0 likes · 12 min read
How to Build a Watermelon Sweetness Dataset: From Field to Features
DataFunTalk
DataFunTalk
Aug 30, 2022 · Artificial Intelligence

Feature Engineering for Recommendation and Search Advertising

This article explains why meticulous feature engineering remains crucial in recommendation and search advertising, outlines what constitutes good features, describes common transformation techniques such as scaling, binning, and encoding, and provides practical examples and Q&A for practitioners.

AIdata preprocessingfeature engineering
0 likes · 18 min read
Feature Engineering for Recommendation and Search Advertising
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Aug 29, 2022 · Artificial Intelligence

Building Yanxuan Machine Learning Platform: Architecture and Implementation

Yanxuan built a Kubeflow‑based machine‑learning platform that unifies data preprocessing, feature engineering, model training, validation, and deployment, using Smart‑jobs, Smart‑Infer, Smart‑backend, Airflow pipelines, Jupyter notebooks, and Istio‑enhanced inference services to boost algorithm engineers’ efficiency and integrate with Kubernetes, HDFS, and Hive.

Airflow orchestrationAlgorithm DevelopmentInference Service
0 likes · 14 min read
Building Yanxuan Machine Learning Platform: Architecture and Implementation
DataFunSummit
DataFunSummit
Aug 25, 2022 · Big Data

Managing the Full Lifecycle of Risk Features: Pitfalls, Solutions, and Future Directions

The talk by Tang Gengyang from Citic Baixin Bank details the challenges faced in risk feature engineering, presents two solution frameworks (1.0 and 2.0) for accelerating deployment, improving reuse, handling offline/online consistency, and outlines future enhancements for a more efficient, automated feature pipeline.

Flinkasynchronous processingdata pipelines
0 likes · 14 min read
Managing the Full Lifecycle of Risk Features: Pitfalls, Solutions, and Future Directions
DataFunSummit
DataFunSummit
Aug 22, 2022 · Big Data

Design and Practice of 360ShuKe Risk Control System Architecture

This presentation details 360ShuKe's risk control system architecture, covering its layered design, credit data lifecycle management, real‑time indicator computation, feature platform evolution, and solutions to challenges such as data loss, rapid model iteration, and feature drift.

Credit Scoringdata pipelinefeature engineering
0 likes · 12 min read
Design and Practice of 360ShuKe Risk Control System Architecture
DataFunSummit
DataFunSummit
Aug 10, 2022 · Artificial Intelligence

Leveraging Cross-Industry Data and Quantum-Inspired Feature Engineering for SME Supply Chain Finance

This article presents Huace Data Science's practical approaches to digital supply‑chain finance for SMEs, detailing challenges of cross‑industry data, the SME engine for authentic business assessment, graph‑based fraud detection, and quantum‑inspired feature‑engineering methods that enhance credit‑risk models.

Big DataQuantum-Inspired Algorithmsfeature engineering
0 likes · 15 min read
Leveraging Cross-Industry Data and Quantum-Inspired Feature Engineering for SME Supply Chain Finance
Zhuanzhuan Tech
Zhuanzhuan Tech
Aug 3, 2022 · Backend Development

Rule Engine Definition, Execution, and Its Implementation in ZhiZhuan Risk Control

This article explains the concept and execution methods of rule engines, describes the forward‑link inference approach used by ZhiZhuan’s risk‑control engine, and details the evolution from hard‑coded rules to feature‑engineered expressions using the Aviator expression engine, highlighting configuration, data sources, and practical examples.

Expression Enginebackend-developmentfeature engineering
0 likes · 13 min read
Rule Engine Definition, Execution, and Its Implementation in ZhiZhuan Risk Control
Meituan Technology Team
Meituan Technology Team
Jul 6, 2022 · Artificial Intelligence

Engineering Practices for Large-Scale Deep Learning Models in Meituan Takeaway Advertising

The article details Meituan's engineering journey from small DNNs to hundred‑gigabyte deep learning models for food‑delivery ads, analyzing online latency and offline efficiency challenges and presenting distributed storage, CPU/GPU acceleration, OpenVINO, TensorRT, CodeGen, and data‑pipeline optimizations that dramatically improve throughput, memory usage, and sample‑building speed.

CPU accelerationDeep LearningGPU Acceleration
0 likes · 45 min read
Engineering Practices for Large-Scale Deep Learning Models in Meituan Takeaway Advertising