Qwen1.5-110B vs Llama‑3‑70B: Performance Insights of Alibaba’s 110B Model

Alibaba unveiled the 110‑billion‑parameter Qwen1.5‑110B model, featuring GQA, 32k context and multilingual support, and benchmark results show it matches or surpasses Llama‑3‑70B and Mixtral‑8x22B on a range of tasks, with notable gains in chat evaluations.

AILLMModel Scaling

0 likes · 7 min read

Qwen1.5-110B vs Llama‑3‑70B: Performance Insights of Alibaba’s 110B Model