Baidu ERNIE 3.0: Knowledge‑Enhanced 100B‑Parameter Model Sets New Chinese NLP Benchmarks and Tops SuperGLUE

Baidu's ERNIE 3.0 introduces a 100‑billion‑parameter, knowledge‑graph‑augmented language model that breaks 54 Chinese NLP benchmarks, achieves human‑level performance on SuperGLUE, and demonstrates strong generation and zero‑shot capabilities, now available for public demo and research.

DataFunTalk
DataFunTalk
DataFunTalk
Baidu ERNIE 3.0: Knowledge‑Enhanced 100B‑Parameter Model Sets New Chinese NLP Benchmarks and Tops SuperGLUE

Baidu has launched ERNIE 3.0, a 100‑billion‑parameter knowledge‑enhanced large language model that simultaneously learns from massive unsupervised text and a large‑scale knowledge graph using a parallel Knowledge‑Text Prediction pre‑training method built on PaddlePaddle's distributed training platform.

The model features a two‑layer architecture: a general semantic representation network that captures universal knowledge, and task‑specific semantic networks that inherit from the general layer and can be implemented via auto‑encoding or auto‑regressive structures, enabling both language understanding and generation.

ERNIE 3.0 sets new state‑of‑the‑art results on 54 Chinese NLP tasks—including sentiment analysis, opinion extraction, reading comprehension, summarization, dialogue generation, and mathematical reasoning—often improving performance by more than 3% and achieving the best scores across all evaluated datasets.

Its English counterpart also tops the SuperGLUE benchmark, surpassing Google T5, OpenAI GPT‑3 and even the human baseline by 0.8 percentage points, highlighting its superior commonsense reasoning, causal inference, and coreference resolution abilities.

The model supports zero‑shot learning and fine‑tuning, showing significant gains even with limited labeled data, and can generate literary content such as novels, lyrics, poems, and couplets without task‑specific training.

Both the research paper (https://arxiv.org/pdf/2107.02137.pdf) and an interactive demo (https://wenxin.baidu.com/wenxin/ernie) are publicly available, and the ERNIE series is already deployed in search, information feeds, smart speakers, and Baidu Cloud services across various industries.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

large language modelNLPKnowledge GraphBaiduERNIE 3.0SuperGLUE
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.