Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 4, 2024 · Artificial Intelligence

How Alibaba’s GTE‑Multilingual Models Boost RAG with Long‑Doc and Multi‑Language Support

Alibaba's Tongyi Lab introduces the GTE‑Multilingual series, high‑performance encoder‑only models that support 8k‑token texts, 75 languages, elastic and sparse embeddings, and demonstrate superior retrieval‑augmented generation performance across multilingual and long‑document benchmarks.

AI model trainingSparse Embeddingelastic embedding
0 likes · 18 min read
How Alibaba’s GTE‑Multilingual Models Boost RAG with Long‑Doc and Multi‑Language Support