Tagged articles

feature engineering

290 articles · Page 1 of 3

Jun 22, 2026 · Artificial Intelligence

Who Won the 2026 Big Data Challenge Monthly Star Awards? Winners Share Their Competition Insights

The 2026 China University Big Data Challenge announced its Monthly Star winners, each receiving a prize, and the top three teams detailed their data processing, feature engineering, model design, training strategies, and post‑processing techniques for cross‑sectional stock ranking.

big data competitionfeature engineeringmachine learning

0 likes · 10 min read

Who Won the 2026 Big Data Challenge Monthly Star Awards? Winners Share Their Competition Insights

AI Large-Model Wave and Transformation Guide

Jun 17, 2026 · Artificial Intelligence

Don’t Mix Prediction, Reasoning, Inference, and Decision in the Ontology Era

The article explains how prediction, reasoning, and inference differ, why a pure prediction model leaves the decision chain broken, and how a dynamic ontology‑driven feature framework—temporal, functional, and relational features—creates explainable, verifiable, and iterative decision loops.

Ontologydecision makingdynamic ontology

0 likes · 9 min read

Don’t Mix Prediction, Reasoning, Inference, and Decision in the Ontology Era

Zhihu Tech Column

Jun 8, 2026 · Industry Insights

Unifying Global User Portrait Data to Power Business Growth

The project consolidates fragmented user data across departments into a unified OneID system, builds a comprehensive tagging and feature platform, and leverages real‑time and offline signals for algorithmic push, custom models, and crowd selection, enabling precise, data‑driven growth initiatives.

Data IntegrationID mappingfeature engineering

0 likes · 21 min read

Unifying Global User Portrait Data to Power Business Growth

Data Party THU

May 19, 2026 · Artificial Intelligence

Model Performance Lagging? Master Feature Engineering with a Complete Step‑by‑Step Guide

This article walks through the entire feature‑engineering pipeline—data cleaning, missing‑value imputation, encoding, outlier handling, scaling, feature construction, and selection—using Pandas and Scikit‑learn, and shows how to wrap the steps into a reproducible Scikit‑learn Pipeline.

Data preprocessingPandasScikit-learn

0 likes · 9 min read

Model Performance Lagging? Master Feature Engineering with a Complete Step‑by‑Step Guide

Data Party THU

May 15, 2026 · Artificial Intelligence

2026 Big Data Challenge Announces Monthly Star Winners and Shares Winning Teams’ Insights

The 2026 China University Computer Competition – Big Data Challenge reveals the Monthly Star award winners, each receiving 800 RMB, and presents detailed experience reports from the top teams covering feature engineering, model selection, training validation, and ensemble strategies for stock prediction.

Big DataModel FusionTime Series Validation

0 likes · 7 min read

2026 Big Data Challenge Announces Monthly Star Winners and Shares Winning Teams’ Insights

DeepHub IMBA

May 12, 2026 · Artificial Intelligence

Hands‑On Feature Engineering with Pandas and Scikit‑Learn: Complete Code Walkthrough

This article walks through a full feature‑engineering pipeline using Pandas and Scikit‑Learn, covering data inspection, missing‑value imputation, categorical encoding, outlier handling, scaling, feature construction, selection, and a final Pipeline that prepares clean, predictive features for a logistic‑regression model.

Data preprocessingPandasScikit-learn

0 likes · 9 min read

Hands‑On Feature Engineering with Pandas and Scikit‑Learn: Complete Code Walkthrough

PMTalk Product Manager Community

May 5, 2026 · Product Management

What Kind of Product Manager Drives Algorithm Engineers Crazy?

The article explains why algorithm engineers resent product managers who treat models as black‑boxes, make vague data‑blind demands, and ignore experimental cycles, and it offers three concrete practices—feature‑focused communication, metric quantification, and respecting experiment timelines—to become a trusted teammate.

Product Managementalgorithm collaborationcross‑functional communication

0 likes · 8 min read

What Kind of Product Manager Drives Algorithm Engineers Crazy?

Chen Tian Universe

Apr 28, 2026 · Industry Insights

A Beginner’s Guide to Payment Risk Control: All You Need in One Article

This article systematically introduces payment risk‑control fundamentals, covering core terminology, risk categories, black‑market attack methods, system architecture, data and feature pipelines, list management, rule engines, modeling techniques, decision flows, key performance metrics, compliance requirements, and operational best practices.

black market attackscompliancefeature engineering

0 likes · 34 min read

A Beginner’s Guide to Payment Risk Control: All You Need in One Article

Data Party THU

Apr 21, 2026 · Artificial Intelligence

Can LLM Attack Detection Work Without Storing Any Conversation Text?

This article experimentally evaluates a privacy‑preserving LLM security pipeline that discards raw dialogue after extracting 28 telemetry features, showing that using only 11 text‑independent signals retains about 98.5% of detection performance while reducing false‑positive rates.

LLM securityfeature engineeringjailbreak detection

0 likes · 10 min read

Can LLM Attack Detection Work Without Storing Any Conversation Text?

Huolala Safety Emergency Response Center

Apr 15, 2026 · Information Security

How to Auto‑Label 10K APIs with 95% Confidence Using Self‑Learning Feature Engineering

This article presents a detailed case study of how a large‑scale API security team built an automated, self‑learning classification system that tags tens of thousands of APIs with business labels, improves model accuracy by five points, and maintains high precision through a confidence‑driven feedback loop.

API SecurityCatBoostSHAP

0 likes · 13 min read

How to Auto‑Label 10K APIs with 95% Confidence Using Self‑Learning Feature Engineering

Huolala Tech

Apr 15, 2026 · Information Security

How We Built a Self‑Learning API Classification System for Security

This article details a real‑world case study of how a large logistics platform created an automated, self‑evolving API asset‑classification pipeline—covering data collection, feature engineering, model training with CatBoost, confidence‑based label feedback, and lessons learned—to improve API security monitoring and reduce manual labeling effort.

API SecurityCatBoostSHAP

0 likes · 13 min read

How We Built a Self‑Learning API Classification System for Security

DeepHub IMBA

Apr 6, 2026 · Artificial Intelligence

Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation

The article explains why good features matter more than fancy algorithms and walks through practical techniques—scaling, log transforms, binning, interaction, various encoding schemes, datetime extraction, text statistics, geospatial distances, aggregation, feature selection, and automated feature generation—illustrated with concrete pandas and scikit‑learn code examples.

AutomationPandasScikit-learn

0 likes · 16 min read

Mastering Machine Learning Feature Engineering: Scaling, Encoding, Aggregation, Embedding, and Automation

Woodpecker Software Testing

Apr 5, 2026 · Artificial Intelligence

How Open‑Source Test Prediction Analytics Makes Testing Smarter: A Practical Guide

This article explains how open‑source test prediction analytics can transform limited testing resources into smarter, data‑driven test selection, detailing the end‑to‑end stack, real‑world case study results, common pitfalls, and steps toward autonomous testing.

CI/CDOpen-sourceautonomous testing

0 likes · 8 min read

How Open‑Source Test Prediction Analytics Makes Testing Smarter: A Practical Guide

PMTalk Product Manager Community

Mar 2, 2026 · Product Management

The Product Manager Traits That Drive Algorithm Engineers Crazy

Algorithm engineers often resent product managers who treat models as black‑box wish‑lists, ignore data feasibility, and issue vague demands, leading to broken trust, wasted effort, and stalled projects, but adopting feature‑focused communication, concrete metrics, and realistic experiment timelines can turn a problematic partnership into a productive collaboration.

Product Managementalgorithm collaborationexperiment planning

0 likes · 8 min read

The Product Manager Traits That Drive Algorithm Engineers Crazy

Data Party THU

Jan 25, 2026 · Big Data

How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU

The Tsinghua University Big Data Capability Project applied GRU‑based deep learning, pulse‑event encoding, and advanced feature engineering to transform discrete refinery energy data into continuous sequences, achieving prediction accuracies of 84.2%, 82.7% and 81.6% for fuel gas, medium‑pressure and low‑pressure steam respectively.

GRUenergy predictionfeature engineering

0 likes · 9 min read

How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU

Past Memory Big Data

Dec 9, 2025 · Artificial Intelligence

A Decade of Evolution: Inside Pinterest’s AI Platform Journey

Over ten years Pinterest transformed a fragmented machine‑learning stack into a unified AI platform, iterating through stages from early ad‑hoc pipelines to scalable GPU‑accelerated services, while learning that timing, organization alignment, and efficiency are crucial for lasting impact.

AI platformGPU inferenceML Ops

0 likes · 25 min read

A Decade of Evolution: Inside Pinterest’s AI Platform Journey

Kuaishou Tech

Nov 12, 2025 · Artificial Intelligence

How KaiFG Lets Python Feature Engineering Run at C++ Speed

KaiFG, Kuaishou's self‑built AI Feature Generator, unifies fragmented feature extraction frameworks, replaces slow C++ compilation cycles with Python‑level development, and achieves near‑C++ performance through Codon‑based compilation, reference‑counted memory management, and aggressive LLVM optimizations, dramatically shortening iteration time.

AI InfrastructureHigh-performance computingfeature engineering

0 likes · 14 min read

How KaiFG Lets Python Feature Engineering Run at C++ Speed

Instant Consumer Technology Team

Oct 29, 2025 · Big Data

Revolutionizing Feature Engineering with Distributed Tech & Configurable Services

Facing PB‑scale user behavior data and millions of feature dimensions, the platform transformed its search, advertising, and recommendation pipelines by adopting a distributed, configurable‑service architecture that delivers high‑throughput streaming, elastic storage, rapid feature iteration, and robust fault‑tolerance for AI‑driven personalization.

Big DataData ArchitectureReal-time Processing

0 likes · 17 min read

Revolutionizing Feature Engineering with Distributed Tech & Configurable Services

Instant Consumer Technology Team

Oct 28, 2025 · Artificial Intelligence

Can Data Virtualization Deliver Millisecond Real‑Time Features Across Stores?

This article shares a three‑year journey of building a data‑virtualization‑based, multi‑environment feature management framework for real‑time risk decision platforms, detailing challenges like heterogeneous storage, cold‑start, and operational stability, and presenting a unified architecture that decouples physical storage from business logic.

Big DataData Virtualizationfeature engineering

0 likes · 16 min read

Can Data Virtualization Deliver Millisecond Real‑Time Features Across Stores?

Data STUDIO

Oct 28, 2025 · Artificial Intelligence

8 Proven Ways to Boost Machine Learning Model Accuracy

This article outlines eight practical techniques—including data augmentation, handling missing values, feature engineering, algorithm selection, hyperparameter tuning, ensemble methods, and cross‑validation—to systematically improve the accuracy of Python machine‑learning models, supported by explanations, examples, and code snippets.

Data preprocessingcross-validationensemble methods

0 likes · 16 min read

8 Proven Ways to Boost Machine Learning Model Accuracy

IT Services Circle

Sep 28, 2025 · Artificial Intelligence

How to Build a Python AI Model for Predicting User Behavior

This article walks through the complete machine‑learning workflow for predicting user actions—covering core concepts, data collection, preprocessing, feature engineering, model training, evaluation, hyper‑parameter tuning, deployment, and future directions—using Python and popular AI libraries.

Pythonfeature engineeringmodel evaluation

0 likes · 11 min read

How to Build a Python AI Model for Predicting User Behavior

Data Party THU

Sep 12, 2025 · Big Data

Key Lessons from Winning the 2025 China University Big Data Competition

The author shares a detailed account of their experience in the 2025 China University Big Data Competition, describing the team’s top national ranking, the shift from absolute stock price prediction to robust ranking learning, extensive feature engineering, and reflections on balancing technical ambition with real‑world constraints.

Big Datadata competitionfeature engineering

0 likes · 5 min read

Key Lessons from Winning the 2025 China University Big Data Competition

Data Party THU

Sep 11, 2025 · Big Data

How We Conquered the 2025 Chinese University Big Data Challenge: Financial Time‑Series Lessons

Our team "Stay Overnight" from Chongqing University of Posts and Telecommunications placed second nationally in the 2025 China University Computer Competition Big Data Challenge, navigating volatile financial data, shifting from time‑series to supervised learning, and emphasizing feature engineering to boost model performance.

Big Datacompetition reportfeature engineering

0 likes · 4 min read

How We Conquered the 2025 Chinese University Big Data Challenge: Financial Time‑Series Lessons

Data Party THU

Sep 8, 2025 · Big Data

What We Learned from the 2025 China University Big Data Competition

The article shares a top‑5 team's experience in the 2025 China University Big Data Challenge, detailing their roster, competition rules, four key technical insights on data pitfalls, model alignment, generalization, and leveraging SOTA models, plus reflections on the event's excellent support and collaborative atmosphere.

Big Datafeature engineeringmodel generalization

0 likes · 6 min read

What We Learned from the 2025 China University Big Data Competition

Instant Consumer Technology Team

Aug 29, 2025 · Fundamentals

Master Variable Feature Engineering: IV, PSI, WOE & Dimensionality Reduction

Learn how to transform raw data into model-ready features through comprehensive preprocessing, variable derivation, dimensionality reduction, and the application of IV, PSI, and WOE metrics, with practical guidelines, thresholds, and visual examples to enhance risk model performance.

IVPSIWOE

0 likes · 16 min read

Tencent Advertising Technology

Aug 27, 2025 · Artificial Intelligence

How Three Undergrads Turned a Last‑Minute Team into a Top‑4 Finish in Tencent’s AI Competition

A trio of freshly graduated undergraduates formed a team just two days before the deadline, overcame early setbacks by reordering feature engineering, leveraged score‑driven decisions, and ultimately surged to fourth place in Tencent's highly competitive algorithm contest, illustrating resilience and practical AI teamwork.

AI competitionTencentfeature engineering

0 likes · 7 min read

How Three Undergrads Turned a Last‑Minute Team into a Top‑4 Finish in Tencent’s AI Competition

Alibaba Cloud Developer

Jul 17, 2025 · Artificial Intelligence

How to Build a House Price Prediction Model with Python: A Step‑by‑Step Guide

This tutorial walks developers through the complete workflow of building a house‑price regression model—from problem definition, data collection and preprocessing, feature engineering, and model selection, to training, hyper‑parameter tuning, evaluation, optimization, deployment as a Flask service, and ongoing monitoring—using Python, pandas, scikit‑learn, and visualisation libraries.

Model DeploymentPythonRegression

0 likes · 29 min read

How to Build a House Price Prediction Model with Python: A Step‑by‑Step Guide

JD Retail Technology

Jun 10, 2025 · Artificial Intelligence

How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink

This article explains JD's complex recommendation system data pipeline—from indexing, sampling, and feature engineering to explainability and real‑time metrics—highlighting challenges such as data consistency, latency, and the use of Flink for massive, low‑latency processing.

FlinkReal-time Dataexplainability

0 likes · 23 min read

How JD Builds a Scalable AI‑Powered Recommendation Data System with Flink

Architect

May 31, 2025 · Artificial Intelligence

Edge Intelligence Implementation in the Vivo Official App: Architecture, Feature Engineering, and Model Deployment

The article details how edge intelligence is applied to the Vivo official app to improve product recommendation on the smart‑hardware floor by abstracting the problem, designing feature engineering pipelines, training TensorFlow models, converting them to TFLite, and deploying inference on mobile devices, while also covering monitoring and performance considerations.

Mobile AppModel DeploymentTensorFlow Lite

0 likes · 19 min read

Edge Intelligence Implementation in the Vivo Official App: Architecture, Feature Engineering, and Model Deployment

AI Algorithm Path

May 27, 2025 · Artificial Intelligence

Reinforcement Learning Tutorial 8: Building State Feature Representations for Objective Optimization

This tutorial explains how to construct state feature vectors for reinforcement‑learning value‑function approximation, covering linear, polynomial, Fourier, and radial‑basis representations, as well as state aggregation techniques such as coarse coding and tile coding, and discusses non‑parametric approaches like kernel methods.

feature engineeringfourier basisfunction approximation

0 likes · 16 min read

Reinforcement Learning Tutorial 8: Building State Feature Representations for Objective Optimization

vivo Internet Technology

May 21, 2025 · Artificial Intelligence

How Vivo’s App Leverages Edge AI to Personalize Product Recommendations

This article details how Vivo’s official app implements edge intelligence to dynamically rank and recommend hardware products on its homepage, covering problem abstraction, data collection, feature engineering, model design, TensorFlow‑Lite conversion, on‑device inference, and monitoring for a personalized user experience.

AndroidModel DeploymentTensorFlow Lite

0 likes · 19 min read

How Vivo’s App Leverages Edge AI to Personalize Product Recommendations

Past Memory Big Data

Apr 19, 2025 · Artificial Intelligence

Databricks Acquires Fennel: Is Real-Time Computing + AI the Ultimate Data Platform?

The article examines Databricks' acquisition of the incremental computation engine Fennel, detailing how its unified batch‑stream processing, incremental updates, Python‑native development, and built‑in data governance can eliminate data silos, cut costs by up to 90 % and accelerate real‑time feature engineering for AI models, while also discussing industry impact and future roadmap.

AI InfrastructureDatabricksFennel

0 likes · 6 min read

Databricks Acquires Fennel: Is Real-Time Computing + AI the Ultimate Data Platform?

Python Programming Learning Circle

Mar 24, 2025 · Artificial Intelligence

Comprehensive List of Aggregation Functions and Custom Feature Engineering Utilities for Python

This article presents a detailed collection of built‑in pandas aggregation methods and numerous custom Python functions for time‑series feature engineering, offering beginners practical tools to enhance data preprocessing and model performance in machine‑learning projects.

Pandasaggregation functionsfeature engineering

0 likes · 10 min read

Comprehensive List of Aggregation Functions and Custom Feature Engineering Utilities for Python

Cognitive Technology Team

Mar 17, 2025 · Artificial Intelligence

Leveraging Large Language Models to Optimize Traditional Machine Learning Pipelines

Large language models can assist and enhance each stage of traditional machine learning—including sample generation, data cleaning, feature engineering, model selection, hyper‑parameter tuning, and workflow automation—by generating synthetic data, refining features, selecting models, and orchestrating pipelines, though challenges such as bias, privacy, and noise remain.

Data GenerationLLMfeature engineering

0 likes · 11 min read

Leveraging Large Language Models to Optimize Traditional Machine Learning Pipelines

JavaEdge

Mar 15, 2025 · Artificial Intelligence

Boost NLP Model Performance with n-gram Feature Engineering

This article explains why feature engineering is crucial for NLP tasks, introduces n‑gram enhancements, provides Python implementations for generating bi‑gram and higher‑order features, demonstrates dynamic padding for text length standardization, and offers practical deployment tips such as feature dimension control and monitoring.

Deep LearningN-gramNLP

0 likes · 7 min read

Boost NLP Model Performance with n-gram Feature Engineering

Alibaba Cloud Developer

Mar 6, 2025 · Artificial Intelligence

From Linear Regression to Transformers: Mastering Machine Learning Foundations

This comprehensive guide walks readers through the evolution of machine learning, starting with basic linear models and feature engineering, progressing through logistic regression, decision trees, and deep learning architectures like MLPs, CNNs, RNNs, and transformers, and demonstrates practical implementations with code examples and evaluation metrics.

Deep LearningEvaluation MetricsRecommendation Systems

0 likes · 64 min read

From Linear Regression to Transformers: Mastering Machine Learning Foundations

AI Code to Success

Feb 25, 2025 · Artificial Intelligence

Master Logistic Regression: Theory, Practice, and Real‑World Tips

This comprehensive guide explains logistic regression fundamentals, the role of the Sigmoid function, loss and optimization methods, step‑by‑step Python implementation with data preparation, model training, evaluation, hyper‑parameter tuning, handling over‑ and under‑fitting, multi‑class extensions, and diverse application scenarios across medicine, finance, e‑commerce, and text analysis.

PythonScikit-learnclassification

0 likes · 23 min read

Master Logistic Regression: Theory, Practice, and Real‑World Tips

Airbnb Technology Team

Jan 24, 2025 · Artificial Intelligence

Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning

Chronon is an open‑source framework that centralizes feature definitions to guarantee training‑inference consistency, eliminates complex ETL pipelines, and supports real‑time and batch processing across diverse data sources, cutting feature‑development cycles from months to under a week, as demonstrated by Airbnb’s 40,000‑feature deployment.

ChrononHiveReal-time Data

0 likes · 10 min read

Chronon — An Open-Source Framework for Production-Level Feature Engineering in Machine Learning

Bilibili Tech

Dec 27, 2024 · Big Data

Consistency Architecture for Bilibili Recommendation Model Data Flow

The article outlines Bilibili’s revamped recommendation data‑flow architecture that eliminates timing and calculation inconsistencies by snapshotting online features, unifying feature computation in a single C++ library accessed via JNI, and orchestrating label‑join and sample extraction through near‑line Kafka/Flink pipelines, with further performance gains and Iceberg‑based future extensions.

Data ConsistencyFlinkIceberg

0 likes · 12 min read

Consistency Architecture for Bilibili Recommendation Model Data Flow

Tencent Advertising Technology

Dec 6, 2024 · Big Data

Building a High‑Performance Advertising Feature Data Lake with Apache Iceberg at Tencent

Tencent's advertising team replaced a traditional HDFS‑Hive warehouse with an Apache Iceberg‑based data lake, adding primary‑key tables, multi‑stream merging, adaptive compaction, and Spark SPJ optimizations to achieve minute‑level feature update latency, 10× back‑fill speed, and up to 60% storage savings.

Big DataCDCCompaction

0 likes · 25 min read

Building a High‑Performance Advertising Feature Data Lake with Apache Iceberg at Tencent

Test Development Learning Exchange

Nov 26, 2024 · Artificial Intelligence

Comprehensive Python Tutorial for Data Preprocessing, Feature Engineering, Model Training, Evaluation, and Deployment

This tutorial walks through consolidating the first ten days of learning by covering data preprocessing, feature engineering, model training with linear regression, decision tree, and random forest, model evaluation using cross‑validation, and finally saving and loading the best model, all illustrated with complete Python code examples.

Data preprocessingModel TrainingPython

0 likes · 9 min read

Comprehensive Python Tutorial for Data Preprocessing, Feature Engineering, Model Training, Evaluation, and Deployment

DataFunSummit

Nov 20, 2024 · Artificial Intelligence

How Data Lakes Empower AI: Expert Insights on Feature Management, Columnar Storage, and Vector Formats

In a panel discussion, experts explain how data‑lake‑warehouse integration, columnar formats like Apache Iceberg, and emerging variant types enable efficient feature engineering, support large‑language‑model workloads, and provide flexible vector storage, thereby driving the evolution of AI from traditional ML to the GenAI era.

Apache IcebergArtificial IntelligenceData Lake

0 likes · 6 min read

How Data Lakes Empower AI: Expert Insights on Feature Management, Columnar Storage, and Vector Formats

DataFunTalk

Nov 6, 2024 · Big Data

How Data Lakes Empower AI: Insights from Industry Experts

In a panel discussion, experts from Kuaishou, Ping An, and Datastrato explain how data lake architectures, columnar storage formats like Apache Iceberg, and vector‑enabled lake formats are enhancing feature management, supporting generative AI workloads, and accelerating machine‑learning pipelines.

AIApache IcebergBig Data

0 likes · 6 min read

How Data Lakes Empower AI: Insights from Industry Experts

Test Development Learning Exchange

Oct 28, 2024 · Big Data

Data Preprocessing with Pandas: A Comprehensive Guide

This article provides a comprehensive guide to data preprocessing using Pandas, covering essential steps like data cleaning, feature engineering, and data transformation for machine learning projects.

Dataset SplittingPandascategorical encoding

0 likes · 5 min read

Data Preprocessing with Pandas: A Comprehensive Guide

DataFunSummit

Oct 11, 2024 · Artificial Intelligence

Feature Production and Component Modeling in the Intelligent Era: From Feature Generation to Modular Modeling

This article introduces a cloud‑based feature production platform that simplifies feature engineering for recommendation, risk control and machine learning, explains its component‑based modeling framework, and answers common questions about deployment, performance, and customization, highlighting cross‑platform compatibility and optimization techniques.

Artificial IntelligenceBig DataFeature Store

0 likes · 19 min read

Feature Production and Component Modeling in the Intelligent Era: From Feature Generation to Modular Modeling

Python Programming Learning Circle

Sep 10, 2024 · Artificial Intelligence

Time Series Feature Engineering Techniques in Python

This article explains how to extract a variety of date‑time based features—including date, time, lag, rolling, expanding, and domain‑specific attributes—from a time‑series dataset using pandas, and discusses proper validation strategies for building reliable forecasting models.

Pandasfeature engineeringforecasting

0 likes · 14 min read

Time Series Feature Engineering Techniques in Python

DataFunTalk

Jul 27, 2024 · Information Security

Classification of Risk Control and Full-Scenario Anti-Cheat Strategies in the Internet

The article outlines how internet and financial risk control are categorized into anti‑cheat, anti‑fraud, and content security, describes full‑scenario cheating types, and presents a three‑step joint defense framework using perception, identification, and mitigation with feature‑based analysis.

anti-cheatfeature engineeringfraud detection

0 likes · 7 min read

Classification of Risk Control and Full-Scenario Anti-Cheat Strategies in the Internet

iQIYI Technical Product Team

Jul 5, 2024 · Big Data

RiskFactor: An Integrated Real‑Time and Offline Feature Platform for Risk Control

RiskFactor unifies iQIYI’s legacy real‑time and offline feature platforms onto Opal’s DAG‑plus‑SQL engine, accelerating feature production fifteen‑fold, cutting latency from hours to minutes, streamlining development, lowering costs, and delivering more reliable, versioned risk‑control capabilities against sophisticated online threats.

Big DataDAGReal-time Streaming

0 likes · 14 min read

RiskFactor: An Integrated Real‑Time and Offline Feature Platform for Risk Control

Ops Development & AI Practice

Jun 30, 2024 · Fundamentals

What Are Vector Components and Why They Matter in Machine Learning

This article explains the concept of vector components, how they are represented as column or row vectors, zero and unit vectors, basic operations, and their crucial role in machine‑learning tasks such as feature representation, distance measurement, PCA, and gradient descent.

PCAfeature engineeringlinear algebra

0 likes · 4 min read

What Are Vector Components and Why They Matter in Machine Learning

Python Programming Learning Circle

Jun 21, 2024 · Artificial Intelligence

Using scikit-learn for Data Mining: Feature Engineering, Parallel Processing, Pipelines, and Model Persistence

This article demonstrates how to perform data mining with scikit-learn by detailing the full workflow—from data acquisition and feature engineering, through parallel and pipeline processing, to automated hyper‑parameter tuning and model persistence—using the Iris dataset as an example.

Scikit-learndata miningfeature engineering

0 likes · 13 min read

Using scikit-learn for Data Mining: Feature Engineering, Parallel Processing, Pipelines, and Model Persistence

DataFunTalk

Jun 13, 2024 · Artificial Intelligence

A/B Testing and Model Grayscale in Credit Risk Control: Concepts, Requirements, and Integrated Solutions

This article explains how A/B testing and model grayscale are applied in credit risk control, discusses the specific requirements for effective testing, compares upstream and risk‑system traffic splitting methods, and proposes an integrated all‑in‑one solution to simplify feature engineering, model evaluation, and deployment.

A/B testingcredit riskfeature engineering

0 likes · 5 min read

A/B Testing and Model Grayscale in Credit Risk Control: Concepts, Requirements, and Integrated Solutions

DataFunTalk

Jun 11, 2024 · Artificial Intelligence

Intelligent Risk Control: Concepts, Challenges, and Integrated Operational Architecture for Banking

This article explores the concept of intelligent risk control in banking, detailing its AI‑driven architecture, current challenges such as external data costs and model‑deployment friction, and proposes an integrated operational framework that leverages big data, knowledge graphs, and MLOps to enhance risk detection and decision‑making.

Artificial IntelligenceMLOpsbanking

0 likes · 14 min read

Intelligent Risk Control: Concepts, Challenges, and Integrated Operational Architecture for Banking

DataFunSummit

Jun 7, 2024 · Artificial Intelligence

Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform

Feature engineering, the process of creating input variables for machine learning models, is crucial for banking risk control; this article explains the concepts of features, variables, and metrics, outlines challenges in real‑time feature pipelines, and proposes a practical architecture and best practices for building an efficient, low‑code feature platform.

Platform designReal-time Datafeature engineering

0 likes · 10 min read

Understanding Feature Engineering for Risk Control Systems and Building an Easy-to-Use Feature Platform

DataFunSummit

Jun 5, 2024 · Fundamentals

User Portrait Tagging: Construction, Feature Processing, and Evaluation

This article provides a comprehensive guide on building user portrait tags—from basic attribute tags to business and strategy tags—detailing data collection methods, feature engineering techniques such as cleaning, time decay, and smoothing, and evaluation metrics for cohesion and stability, aimed at data product managers and analysts.

Evaluation MetricsProduct Managementfeature engineering

0 likes · 12 min read

User Portrait Tagging: Construction, Feature Processing, and Evaluation

DataFunTalk

Jun 4, 2024 · Artificial Intelligence

Building an Integrated Intelligent Risk Control System for Banking

The article explores the concept, challenges, and future directions of intelligent banking risk control, emphasizing data integration, AI-driven modeling, feature engineering, MLOps, knowledge graphs, and large‑model applications to create a unified, automated risk management platform.

AIMLOpsbanking

0 likes · 10 min read

Building an Integrated Intelligent Risk Control System for Banking

iQIYI Technical Product Team

May 31, 2024 · Artificial Intelligence

How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform

Opal is iQIYI's end‑to‑end machine‑learning platform that integrates feature production, sample construction, model training, and deployment with big‑data services, addressing duplicated effort, weak data processing, and fragmented pipelines to boost efficiency across recommendation, advertising, and risk‑control scenarios.

AI OperationsData ValidationMachine Learning Platform

0 likes · 19 min read

How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform

Python Programming Learning Circle

May 23, 2024 · Artificial Intelligence

Comprehensive Collection of Aggregation Functions for Feature Engineering in Python

This article presents a detailed compilation of pandas built‑in aggregation methods and a wide range of custom Python functions for time‑series feature engineering, providing ready‑to‑use code snippets that cover statistical descriptors, drawdown metrics, peak detection, and more for data science practitioners.

AggregationPandasPython

0 likes · 17 min read

Comprehensive Collection of Aggregation Functions for Feature Engineering in Python

StarRocks

Apr 2, 2024 · Big Data

How We Unified Real‑Time and Batch Financial Risk Features with StarRocks

This article details the challenges of maintaining separate real‑time and batch risk‑control features, evaluates Lambda and Kappa architectures, explores storage‑unified and compute‑unified alternatives, compares Hologres, StarRocks and ClickHouse, and presents a validated StarRocks‑based solution that dramatically reduces feature delivery latency and improves accuracy.

Kappa architectureLambda architecturefeature engineering

0 likes · 19 min read

How We Unified Real‑Time and Batch Financial Risk Features with StarRocks

Didi Tech

Mar 28, 2024 · Big Data

How We Unified Real‑Time and Batch Features with StarRocks in Financial Risk Control

This article analyzes the challenges of building real‑time and batch risk‑control features, compares Lambda and Kappa architectures, evaluates storage‑unified and compute‑unified solutions, and details how StarRocks was selected, validated, and deployed to achieve high‑performance, low‑latency feature serving in a financial context.

Big DataData WarehouseStarRocks

0 likes · 19 min read

How We Unified Real‑Time and Batch Features with StarRocks in Financial Risk Control

DataFunSummit

Mar 26, 2024 · Fundamentals

User Profile Tagging: Construction, Feature Processing, and Evaluation

This article systematically explains the fundamentals of user profile tags, covering basic attribute tags, business and strategy-oriented tags, detailed feature processing methods such as anomaly handling, time decay, and smoothing, and provides evaluation metrics for cohesion and stability, supplemented by a practical Q&A session.

KPI alignmentRFM modelfeature engineering

0 likes · 12 min read

HelloTech

Mar 14, 2024 · Artificial Intelligence

Feature Engineering: Concepts, Methods, and Automation

Feature engineering transforms existing data into new predictive variables through manual analysis or automated pipelines, encompassing single‑variable encoding, pairwise arithmetic, group‑statistics, multi‑variable combinations, time‑series and text derivations, with tools like Deep Feature Synthesis and beam‑search to generate and select useful features.

Data preprocessingautomated featuresfeature derivation

0 likes · 17 min read

Feature Engineering: Concepts, Methods, and Automation

Efficient Ops

Mar 10, 2024 · Databases

How Machine Learning Can Automate MySQL Index Optimization

This article explains how applying machine learning to database operations—specifically AIOps for MySQL—can automate index recommendation by parsing SQL, extracting semantic and statistical features, generating candidate index combinations, and training an XGBoost model to predict optimal indexes, reducing reliance on manual DBA work.

AIOpsIndex OptimizationMySQL

0 likes · 10 min read

How Machine Learning Can Automate MySQL Index Optimization

DataFunTalk

Feb 5, 2024 · Fundamentals

User Portrait Tagging: Construction, Feature Processing, and Evaluation

This article explains how to build user portrait tags—from basic attribute tags to business and strategy tags—covers methods for data collection, anomaly handling, time decay, smoothing, and evaluates tag quality using cohesion, stability, and AUC-related metrics to support data‑driven product decisions.

Evaluation Metricsdata sciencefeature engineering

0 likes · 12 min read

Data Thinking Notes

Jan 16, 2024 · Fundamentals

Building Comprehensive Risk Feature Portraits for All Loan Stages

This article explains how to construct risk control feature portraits across four key scenarios—marketing, pre‑loan, in‑loan, and post‑loan—by selecting appropriate data dimensions, describing usable customer, behavior, and ID‑linked data, and illustrating each portrait with visual examples to guide accurate risk assessment.

credit riskdata dimensionsfeature engineering

0 likes · 9 min read

Building Comprehensive Risk Feature Portraits for All Loan Stages

Sohu Tech Products

Jan 10, 2024 · Artificial Intelligence

Baidu's Practices and Insights on Recommendation Ranking

Baidu’s recommendation ranking system handles billions of daily impressions and millions of users by combining discrete and cross features, bias mitigation, and long‑short sequence modeling within a multi‑stage funnel and hierarchical architecture, while planning to integrate large language models for generative, interpretable, and decision‑oriented recommendations.

AIBaiduRecommendation Systems

0 likes · 19 min read

Baidu's Practices and Insights on Recommendation Ranking

DataFunTalk

Jan 7, 2024 · Artificial Intelligence

Baidu's Recommendation Ranking: Background, Feature Design, Algorithms, Architecture, and Future Directions

This article presents Baidu's comprehensive approach to feed recommendation ranking, covering business and data background, feature engineering principles, core algorithmic strategies, system architecture design, and upcoming plans to integrate large language models for more intelligent and fair recommendations.

BaiduRecommendation Systemsfeature engineering

0 likes · 19 min read

Baidu's Recommendation Ranking: Background, Feature Design, Algorithms, Architecture, and Future Directions

Huolala Tech

Dec 19, 2023 · Information Security

How to Build Effective Bot Management: Strategies, Architecture, and Tools

This article explains bot fundamentals, classifies bot types, analyzes the rising threat of malicious bot traffic, compares major vendor solutions, and outlines a four‑layer architecture with data, feature, model, and policy layers for robust bot management in modern web services.

Bot ManagementTraffic analysiscloud security

0 likes · 14 min read

How to Build Effective Bot Management: Strategies, Architecture, and Tools

DataFunSummit

Dec 15, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article explores how large language models can be incorporated into recommender systems, discussing background challenges, specific integration points across the recommendation pipeline, practical implementation methods, experimental results, and future research directions, while highlighting industrial considerations and potential improvements.

Industrial ApplicationsLLMModel Fusion

0 likes · 20 min read

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

Test Development Learning Exchange

Oct 10, 2023 · Artificial Intelligence

Feature Engineering Techniques for Various Business Scenarios with Python Code Examples

This article presents practical feature‑engineering methods for ten common business domains, explaining the purpose of each feature, the extraction technique, and providing ready‑to‑run Python code snippets to help build more accurate predictive models.

Business AnalyticsData preprocessingfeature engineering

0 likes · 7 min read

Feature Engineering Techniques for Various Business Scenarios with Python Code Examples

DataFunTalk

Oct 10, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article surveys how large language models can be incorporated into recommender systems, discussing their strengths and limitations, outlining where and how they can be applied across the recommendation pipeline, presenting recent research examples, and highlighting challenges and future directions for industrial deployment.

LLMfeature engineeringrecommender systems

0 likes · 20 min read

政采云技术

Oct 10, 2023 · Artificial Intelligence

Predicting Membership Purchase with Logistic Regression: Feature Engineering, Model Training, Evaluation, and Deployment

This article presents a complete workflow for predicting whether users will purchase a membership using logistic regression, covering data collection, feature selection, handling imbalanced samples, model training, hyper‑parameter tuning, threshold optimization, evaluation metrics such as accuracy, precision, recall, AUC, lift, and finally deployment on a big‑data platform with PySpark.

Big Datafeature engineeringlogistic regression

0 likes · 17 min read

Predicting Membership Purchase with Logistic Regression: Feature Engineering, Model Training, Evaluation, and Deployment

Test Development Learning Exchange

Sep 29, 2023 · Fundamentals

Master Essential Data Preprocessing Techniques for Accurate Analysis

This guide walks through ten core data preprocessing methods—including handling missing values, type conversion, standardization, encoding, smoothing, outlier treatment, text cleaning, word‑frequency counting, sentiment analysis, and topic modeling—each illustrated with concise Python code examples.

Data preprocessingPandasPython

0 likes · 9 min read

Master Essential Data Preprocessing Techniques for Accurate Analysis

HomeTech

Sep 21, 2023 · Artificial Intelligence

Homepage Pop‑up Recommendation System for Car Purchase Intent: Background, Feature Engineering, Model and Strategy Optimization, and Results

This article details how AutoHome's homepage pop‑up leverages precise targeting, extensive feature engineering, and multi‑stage DeepFM‑based models with attention and LHUC modules to accurately identify car‑buying users, improve vehicle‑series recommendations, and achieve a 355% conversion rate increase.

AIDeep Learningcar buying

0 likes · 7 min read

Homepage Pop‑up Recommendation System for Car Purchase Intent: Background, Feature Engineering, Model and Strategy Optimization, and Results

HelloTech

Sep 13, 2023 · Artificial Intelligence

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

The Haro AI platform automates customer‑service ticket routing by applying a four‑step pipeline—feature processing, model training, evaluation, and deployment—using BERT/ALBERT‑based intent recognition, configurable feature storage, AutoML or expert modes, and Faas‑style deployment, as demonstrated in the Universal Ticket System case study, dramatically improving accuracy and efficiency.

AI platformALBERTBERT

0 likes · 11 min read

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

Bilibili Tech

Jul 25, 2023 · Artificial Intelligence

Bilibili Game Center Recommendation System: Architecture, Core Technologies, and Experimental Results

The Bilibili Game Center recommendation system combines a unified feature platform, multi‑stage recall, ranking and re‑ranking models, online services, and AB experimentation to deliver personalized game suggestions, resulting in up to 78% higher click‑through, 76% higher conversion, and substantial increases in user engagement and revenue.

AB testingfeature engineeringgame-platform

0 likes · 26 min read

Bilibili Game Center Recommendation System: Architecture, Core Technologies, and Experimental Results

GuanYuan Data Tech Team

May 25, 2023 · Artificial Intelligence

How to Build a Comprehensive ML Model Quality Assessment Framework

This article explains why and how to evaluate machine learning model quality through a structured framework that covers data validation, feature checks, and algorithm testing, helping ensure accuracy, reliability, and maintainability before deployment.

AI GovernanceData Validationfeature engineering

0 likes · 19 min read

How to Build a Comprehensive ML Model Quality Assessment Framework

HelloTech

May 8, 2023 · Artificial Intelligence

One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions

The article presents a comprehensive one‑stop AI platform that unifies training, model, feature, and decision services across cloud, edge, mobile, Flink, and application environments, detailing its architecture, the limitations of cloud‑centric inference, the advantages of localized inference, and the challenges and solutions for model and feature localization, SDK design, and future AutoML enhancements.

AI platformFlinkdistributed systems

0 likes · 17 min read

One‑Stop AI Platform for Cloud, Edge, Mobile, Flink, and Application Intelligence: Architecture, Challenges, and Solutions

HelloTech

Apr 12, 2023 · Artificial Intelligence

Integrating Machine Learning Ranking into Elasticsearch: Architecture, Components, and Performance

The team embedded a full machine‑learning ranking pipeline as an Elasticsearch plug‑in—combining real‑time and offline feature stores, hot‑loadable model jars via Dragonfly, an MLeap execution engine, and a DSL for feature definition—replacing the coarse‑ranking logistic‑regression with a tree model that adds ~10 ms latency but yields a 1.2 % AB‑test lift, while maintaining high throughput, low CPU usage, and supporting future batch deep‑learning rescoring.

Model Deploymentfeature engineeringonline prediction

0 likes · 16 min read

Integrating Machine Learning Ranking into Elasticsearch: Architecture, Components, and Performance

dbaplus Community

Apr 8, 2023 · Big Data

How Zhihu Built a Scalable DMP: Architecture, Data Pipelines, and Real‑Time Targeting

This article details Zhihu's Data Management Platform (DMP), covering the business problems it solves, the end‑to‑end workflow, feature taxonomy, system architecture, data pipelines for batch and streaming, audience targeting processes, performance challenges, and future technical directions.

Big DataDMPData Platform

0 likes · 8 min read

How Zhihu Built a Scalable DMP: Architecture, Data Pipelines, and Real‑Time Targeting

Meituan Technology Team

Apr 6, 2023 · Databases

AI-Driven Index Recommendation for Slow Queries at Meituan

This article details a joint research effort between Meituan and East China Normal University that combines cost‑based methods with AI‑driven, data‑centric models to automatically generate and evaluate missing indexes for billions of daily slow queries, improving recommendation accuracy and query performance.

AICost ModelIndex Recommendation

0 likes · 16 min read

AI-Driven Index Recommendation for Slow Queries at Meituan

Tencent Advertising Technology

Mar 30, 2023 · Artificial Intelligence

Tencent's Taiji Machine Learning Platform: End-to-End MLOps for Advertising

Tencent’s Taiji machine learning platform, a cloud‑native, distributed parameter‑server system, provides end‑to‑end MLOps for advertising by integrating data ingestion, feature engineering, model training, evaluation, deployment, and monitoring, supporting massive models up to billions of parameters while improving efficiency, scalability, and resource management.

MLOpsMachine Learning PlatformModel Deployment

0 likes · 18 min read

Tencent's Taiji Machine Learning Platform: End-to-End MLOps for Advertising

DataFunTalk

Mar 28, 2023 · Artificial Intelligence

FeatHub: An Open‑Source Feature Store for Real‑Time and Offline Feature Engineering

This article introduces FeatHub, an open‑source feature‑store project from Alibaba Cloud that provides a Python SDK, flexible architecture, and execution engines such as Flink and Spark to simplify the development, deployment, monitoring, and sharing of real‑time and offline machine‑learning features across multi‑cloud environments.

Feature StoreFlinkPython SDK

0 likes · 21 min read

FeatHub: An Open‑Source Feature Store for Real‑Time and Offline Feature Engineering

Smart Era Software Development

Mar 16, 2023 · Artificial Intelligence

10 Essential Elements of Machine Learning System Architecture

The article outlines ten core components—data and feature pipelines, feature store, training and retraining pipelines, metadata store, serving infrastructure, production monitoring, reusable ML pipelines, workflow orchestration, CI/CT/CD, and end‑to‑end quality control—that together form a scalable, reliable architecture for modern machine‑learning systems.

MLOpsfeature engineeringmachine learning

0 likes · 7 min read

10 Essential Elements of Machine Learning System Architecture

DaTaobao Tech

Mar 13, 2023 · Artificial Intelligence

AI‑Driven 3D Content Creation and Optimization for E‑commerce

The article presents an AI‑driven pipeline that creates, delivers, and optimizes 3D e‑commerce content by leveraging diffusion‑based generation, txt2img/img2img style transfer, Shapley‑value interpretability, and a multi‑level traffic amplification framework to overcome modeling efficiency, asset scarcity, and production cost challenges.

3D contentAI generationOptimization

0 likes · 12 min read

AI‑Driven 3D Content Creation and Optimization for E‑commerce

Tencent Cloud Developer

Mar 8, 2023 · Artificial Intelligence

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

The article describes WeChat Games’ scalable recommendation system, detailing its four‑component architecture—offline ML platform, unified management, online DAG‑based engine, and peripheral services—along with a hybrid algorithm library, feature engineering, real‑time monitoring, and solutions that boost engagement across diverse game recommendation scenarios.

Data ManagementDeep LearningReal-time Processing

0 likes · 28 min read

Building a Scalable Recommendation System for WeChat Games: Architecture and Implementation

DataFunSummit

Mar 3, 2023 · Artificial Intelligence

Intelligent Risk Control System Architecture and Development Trends

This article introduces the architecture of intelligent risk control, detailing its four-layer structure, the underlying data, feature, model, and decision components, platform interactions, and future development trends, highlighting how AI and big data enhance risk management efficiency and accuracy.

Big DataDecision Systemsfeature engineering

0 likes · 12 min read

Intelligent Risk Control System Architecture and Development Trends

DataFunSummit

Jan 25, 2023 · Artificial Intelligence

Expert Insights on Recommendation System Architecture, Data, Features, Recall, Ranking and Evaluation

This interview compiles expert opinions on the end‑to‑end recommendation system pipeline—including architecture, data collection, user profiling, content structuring, feature engineering, recall strategies, ranking algorithms, multi‑objective optimization, multi‑modal fusion, re‑ranking, cold‑start solutions, evaluation metrics and real‑world applications—highlighting the technical challenges and practical solutions.

Evaluation MetricsRankingcold-start

0 likes · 15 min read

Expert Insights on Recommendation System Architecture, Data, Features, Recall, Ranking and Evaluation

DataFunSummit

Jan 11, 2023 · Artificial Intelligence

Intelligent Financial Risk Control Platform Architecture and Expert Insights

This article outlines the architecture of an intelligent financial risk control platform, detailing data sources, big‑data processing, feature engineering, decision engines, model types, and real‑world application scenarios, while highlighting expert‑identified challenges such as compliance, data quality, real‑time performance, and fraud detection.

decision-enginefeature engineeringfinancial AI

0 likes · 11 min read

Intelligent Financial Risk Control Platform Architecture and Expert Insights

Xianyu Technology

Dec 21, 2022 · Artificial Intelligence

Xianyu Recommendation System: Architecture, Challenges, and Deployment

The Xianyu recommendation system, built by backend expert Wan Xiaoyong, evolved from offline scoring to a full‑graph, serverless recall‑ranking pipeline that tackles C2C uncertainties through centralized feature engineering, model compression, staged deployment, flexible experimentation, robust governance, and plans for automated attribution and interpretability.

AIBig DataModel Deployment

0 likes · 10 min read

Xianyu Recommendation System: Architecture, Challenges, and Deployment

Python Crawling & Data Mining

Dec 9, 2022 · Artificial Intelligence

Can Machine Learning Predict the World Cup Winner? A Complete Data Science Walkthrough

This article details how to gather FIFA and match data, engineer predictive features, build and evaluate Random Forest and Gradient Boosting models, and simulate the 2022 World Cup to forecast match outcomes and the eventual champion using Python and scikit‑learn.

Random ForestScikit-learndata simulation

0 likes · 22 min read

Can Machine Learning Predict the World Cup Winner? A Complete Data Science Walkthrough

Hulu Beijing

Dec 2, 2022 · Artificial Intelligence

How Disney+ Designs a Multi‑Task Video Search Ranking Model

This article explains the architecture of a video search ranking system that combines a deep encoding network, multi‑task expert networks, and a bias‑correction module to jointly optimize relevance, click‑through rate, and watch time for streaming platforms.

Bias CorrectionDeep LearningMulti-Task Learning

0 likes · 15 min read

How Disney+ Designs a Multi‑Task Video Search Ranking Model

Alipay Experience Technology

Nov 29, 2022 · Mobile Development

How Ant's Mobile Edge Computing Container Powers Real‑Time AI on Devices

This article explains the challenges of deploying intelligent features on mobile clients and describes Ant Group's edge computing container, its three‑layer architecture, real‑time compute, feature, and decision engines, and the low‑code platform that enables fast, stable, and scalable AI solutions on devices.

decision-enginefeature engineeringreal-time analytics

0 likes · 15 min read

How Ant's Mobile Edge Computing Container Powers Real‑Time AI on Devices

Python Programming Learning Circle

Nov 22, 2022 · Artificial Intelligence

Using tsfresh for Automated Time Series Feature Extraction in Python

This article introduces the tsfresh Python package, explains why traditional machine‑learning models struggle with time‑series data, and demonstrates how tsfresh can automatically generate and select hundreds of useful features—including statistical, nonlinear, and signal‑processing metrics—while supporting big‑data frameworks such as Dask and Spark.

Pythonfeature engineeringtime series

0 likes · 5 min read

Using tsfresh for Automated Time Series Feature Extraction in Python

Dada Group Technology

Nov 18, 2022 · Artificial Intelligence

JD Daojia Machine Learning Platform: Architecture and Implementation

This article introduces JD Daojia's machine learning platform, detailing its architecture, implementation principles, and practical applications in various business scenarios, achieving significant improvements in recommendation and search systems.

Deep LearningGraph Neural NetworksKubernetes

0 likes · 28 min read

JD Daojia Machine Learning Platform: Architecture and Implementation

Model Perspective

Oct 24, 2022 · Artificial Intelligence

Understanding RuleFit: Combining Tree Rules with Linear Models for Interpretable AI

RuleFit, introduced by Friedman and Popescu in 2008, integrates decision‑tree‑derived rules with linear regression to boost predictive accuracy while maintaining strong interpretability, and this article explains its definition, rule extraction, algorithmic implementation, code example, advantages, limitations, and practical insights.

RuleFitensemble treesfeature engineering

0 likes · 10 min read

Understanding RuleFit: Combining Tree Rules with Linear Models for Interpretable AI

Kuaishou Tech

Oct 21, 2022 · Artificial Intelligence

Real-time Short Video Recommendation on Mobile Devices: System Design, Model Architecture, and Experimental Evaluation

The paper presents a lightweight on‑device re‑ranking system for short‑video recommendation that leverages real‑time user feedback and context‑aware generative ranking, detailing its architecture, feature engineering, beam‑search optimization, and both offline and online experimental results showing significant performance gains.

Beam SearchContext-Awarefeature engineering

0 likes · 12 min read

Real-time Short Video Recommendation on Mobile Devices: System Design, Model Architecture, and Experimental Evaluation

vivo Internet Technology

Oct 9, 2022 · Artificial Intelligence

vivo Machine Learning Platform: Architecture Design and Practice

vivo’s machine‑learning platform, built for its massive app‑store and e‑commerce ecosystem, streamlines data processing, model training, and deployment through quota‑based resource management, a custom ultra‑large‑scale TensorFlow‑vlps framework, OpenAPI‑driven training, and Jupyter‑integrated interactive development, boosting efficiency for billions of samples and features.

MLOpsMachine Learning PlatformModel Deployment

0 likes · 12 min read

vivo Machine Learning Platform: Architecture Design and Practice

ITPUB

Sep 15, 2022 · Artificial Intelligence

Why Precise Feature Engineering Still Matters in Recommendation Systems

In the era of deep learning, feature engineering remains crucial for recommendation and search advertising because it bridges raw relational data and models, improves performance, reduces complexity, and handles high‑cardinality, large‑scale, and time‑sensitive scenarios with robust transformations and statistical encoding.

AIData preprocessingRecommendation Systems

0 likes · 20 min read

Why Precise Feature Engineering Still Matters in Recommendation Systems

NetEase LeiHuo UX Big Data Technology

Sep 5, 2022 · Artificial Intelligence

Feature Engineering in Game Data: Types, Missing Value and Outlier Handling

This article explains how feature engineering in game data involves classifying structured and unstructured, quantitative and qualitative features, and details practical methods for handling missing values and outliers to improve machine‑learning model performance.

Data preprocessingfeature engineeringgame data

0 likes · 9 min read

Feature Engineering in Game Data: Types, Missing Value and Outlier Handling

Model Perspective

Aug 31, 2022 · Fundamentals

How to Build a Watermelon Sweetness Dataset: From Field to Features

This article describes how the author collected a watermelon dataset, defined measurable features such as size, color, sugar content, seed count, and texture, and documented the process with photos, tables, and a brief discussion of data characteristics for future machine‑learning analysis.

data analysisdata collectionfeature engineering

0 likes · 12 min read

How to Build a Watermelon Sweetness Dataset: From Field to Features