Artificial Intelligence 18 min read

Algorithmic Optimization of Information‑Flow Advertising for Hallo Mobility

This presentation details how Hallo Mobility tackles the challenges of information‑flow ad modeling by describing the ad ecosystem, the company’s business evolution, and the advertiser‑side algorithmic solutions—including plan‑level quality detection, creative‑level uplift modeling, feature‑cross engineering, and pre‑bid user screening—while outlining future directions for automated, data‑driven ad delivery.

DataFunSummit
DataFunSummit
DataFunSummit
Algorithmic Optimization of Information‑Flow Advertising for Hallo Mobility

Guest: Zhou Bingqian, Senior Algorithm Engineer at Hallo Mobility (DataFunTalk).

1. Information‑flow ad landscape – Information‑flow ads embed native content (images, text, video) into media streams, offering value‑added experiences. Since 2006 they have grown from Facebook to TikTok, Weibo, Toutiao, etc., reaching ~40% market share by 2022 and becoming a primary channel for user acquisition.

2. Hallo business background – The ad‑tech stack evolved through four stages: exploration, initial scaling, cost‑reduction, and mature automation. The current system integrates a unified API gateway, storage (Redis, MySQL, HBase, Elasticsearch), and an application layer handling decision‑making, automated operation, and attribution.

3. Advertiser‑side algorithmic optimization

3.1 Plan dimension – The goal is to predict whether a user who submits certification will complete an order ("completion"). Due to limited exposure data, the problem is reframed as low‑quality traffic detection and completion‑rate estimation using LightGBM. Samples are built from users who complete within 7 days; data augmentation creates order‑level samples.

Features include user, environment, ad, and time attributes; the model uses LightGBM for binary classification.

3.2 Creative dimension – To avoid waste from similar low‑performing creatives, a model predicts whether a new creative can generate volume. Feature engineering combines ID embeddings (word2vec on creative/plan IDs), numeric feature cross‑features, and configuration parameters, feeding a multi‑class classifier.

3.3 Pre‑bid uplift mechanism – An uplift model (causal inference) scores users from 0‑5 to decide exposure or bidding adjustments, separating naturally converting users from those needing ad incentives, thereby reducing cost per acquisition.

4. Future directions – Plans include more precise pre‑exposure screening using exposure data, full‑automation of online delivery with algorithm‑driven creative/plan generation, budget allocation, and integration of advanced AI techniques (CTR, reinforcement learning, CV‑based creative generation).

5. Q&A highlights – Discussed handling multi‑business competition, why similar creatives behave differently across accounts, and the relationship between RTB and RTA in future pipelines.

Thank you for attending.

advertisingalgorithmmachine learningfeature engineeringAIinformation flowuplift modeling
DataFunSummit
Written by

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.