Tag

TFServing

0 views collected around this technical thread.

iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 26, 2021 · Backend Development

Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls

The investigation of QLB‑4‑based TFServing calls revealed uneven traffic, stale routing after scaling, and idle servers due to layer‑4 hash routing, leading the team to replace QLB‑4 with a Consul‑driven client‑side load‑balancer that dynamically pools servers, eliminates restarts, and cuts GPU waste.

ConsulQLB-4TFServing
0 likes · 11 min read
Analysis and Solutions for Load‑Balancing Issues in QLB‑4 Based TFServing Service Calls