Building Fast-Iterating Machine Learning Systems at Tubi: A/B Testing, Simple Models, and Embedding Strategies

This article shares Tubi's practical experience in rapidly iterating machine‑learning systems, emphasizing the early importance of simple end‑to‑end A/B testing platforms, clear launch plans, heat‑based and embedding‑based ranking models, and a culture of fast experimentation over complex deep‑learning research.

DataFunTalk
DataFunTalk
DataFunTalk
Building Fast-Iterating Machine Learning Systems at Tubi: A/B Testing, Simple Models, and Embedding Strategies

Tubi TV, one of the largest free‑video platforms, leverages machine learning across recommendation, search, advertising, and content acquisition, making ML the core of its business.

Measurement First – Before diving into advanced deep or reinforcement learning, the team stresses building a simple, end‑to‑end experiment system (A/B testing) to validate ideas, because complex models can behave inconsistently across user segments.

Experiment Planning – Successful experiments require a concise launch plan: define one or two core metrics for go/no‑go decisions, supplement with secondary metrics for iteration, and agree on a clear rule (e.g., core metrics positive → launch).

Simple Models – Start with lightweight models such as heat‑based ranking (using recent watch time) and similarity‑based ranking via embeddings. Heat scores should exclude artificial boosts from promotions. Embeddings can be generated with Spark's word2vec or ALS, then used for cosine similarity across videos, users, or other entities.

Iteration Is King – After establishing basic algorithms, the team focuses on rapid iteration rather than pursuing the latest research. By running many small experiments (up to 50 per year in early stages), they continuously refine models, prioritize ROI, and keep the system adaptable.

Conclusion – Tubi’s experience shows that a fast, experiment‑driven ML pipeline, clear launch criteria, and simple yet effective models enable rapid product improvements and scalable growth.

Recruitment Invitation – Tubi is hiring senior engineers for data, ML, backend (Elixir, Scala), Android, iOS, frontend, and QA roles; details are on their website.

Event Announcement – An upcoming DataFunTalk big‑data architecture forum will feature a talk by Tubi senior data engineer on data‑quality architecture; registration via QR code.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

data engineeringartificial intelligencemachine learningA/B testingEmbeddingiteration
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.