Tagged articles
1 articles
Page 1 of 1
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 18, 2024 · Industry Insights

Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics

The 2024 KDD Cup introduces the ShopBench benchmark, a large‑scale LLM competition that simulates real‑world online shopping with 57 tasks, over 20,000 questions, and multiple tracks covering concept understanding, knowledge reasoning, user‑behavior alignment, multilingual ability, and an all‑round track, all evaluated with task‑specific metrics and a hidden test set.

BenchmarkDatasetEvaluation Metrics
0 likes · 11 min read
Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics