Tag

exploration

1 views collected around this technical thread.

Bitu Technology
Bitu Technology
May 18, 2022 · Artificial Intelligence

Mitigating Exposure Bias in Tubi’s Recommendation System

This article explains how Tubi’s machine‑learning team reduces exposure bias in its video recommendation pipeline by normalizing popularity features, incorporating additional signals such as search behavior, and applying exploration techniques like bandit algorithms to diversify content exposure.

Banditsexplorationexposure bias
0 likes · 10 min read
Mitigating Exposure Bias in Tubi’s Recommendation System
HomeTech
HomeTech
Jun 10, 2020 · Artificial Intelligence

Exploitation & Exploration Algorithms in Recommender Systems: ε‑Greedy, UCB, and Thompson Sampling Applications

This article introduces recommender systems and the exploitation‑exploration dilemma, explains common E&E algorithms such as ε‑greedy, Upper‑Confidence‑Bound, and Thompson Sampling, and details their practical deployment for interest‑point eviction, selection, and adaptive recall count optimization in an automotive recommendation platform.

Bandit algorithmsThompson SamplingUCB
0 likes · 10 min read
Exploitation & Exploration Algorithms in Recommender Systems: ε‑Greedy, UCB, and Thompson Sampling Applications
Ctrip Technology
Ctrip Technology
May 28, 2020 · Mobile Development

Intelligent Android Exploration Tool (IAET): UI‑Driven Automated Testing, Algorithms, Implementation, and Evaluation

This article presents IAET, an intelligent Android exploration tool that detects UI elements, applies graph‑based traversal algorithms with similarity optimizations, implements a bridge using UiAutomator and app_process, and demonstrates superior crash‑detection and activity‑coverage performance compared with the APE benchmark across major Chinese apps.

AndroidUI automationalgorithm
0 likes · 15 min read
Intelligent Android Exploration Tool (IAET): UI‑Driven Automated Testing, Algorithms, Implementation, and Evaluation
DataFunTalk
DataFunTalk
Mar 20, 2019 · Artificial Intelligence

Addressing Sparse Reward Problems in Model-Free Reinforcement Learning

This article reviews the challenges of model‑free reinforcement learning, especially sparse reward issues exemplified by Montezuma’s Revenge, and surveys recent approaches such as expert demonstrations, curriculum learning, self‑play, hierarchical reinforcement learning, and count‑based exploration to mitigate these problems.

curriculum learningexplorationhierarchical RL
0 likes · 12 min read
Addressing Sparse Reward Problems in Model-Free Reinforcement Learning
Qunar Tech Salon
Qunar Tech Salon
May 16, 2016 · Artificial Intelligence

Improving A/B Testing with a 20‑Line Multi‑Armed Bandit Algorithm

This article explains how a simple 20‑line multi‑armed bandit implementation can replace traditional A/B testing by continuously balancing exploration and exploitation to automatically discover the most effective UI variant, reducing manual analysis and improving conversion rates.

A/B testingMulti-armed banditexploitation
0 likes · 8 min read
Improving A/B Testing with a 20‑Line Multi‑Armed Bandit Algorithm