Artificial Intelligence 8 min read

Why Do Large Language Models Hallucinate and How to Reduce It?

The article explains why large language models generate hallucinations—due to data errors, training conflicts, and inference uncertainty—and outlines data‑cleaning, model‑level feedback, knowledge augmentation, constraint techniques, and post‑processing methods such as the “Truth‑seeking” algorithm to mitigate the issue.

Tencent Tech

Sep 20, 2023

Why Do Large Language Models Hallucinate and How to Reduce It?

Why Large Models Hallucinate

When using large models, users often encounter nonsensical or fact‑incorrect answers, a phenomenon called “hallucination” (translated from the English term “hallucination”). This arises from both data‑level problems—such as erroneous or biased pre‑training data and conflicting alignment data—and algorithmic issues during training and inference.

For example, if pre‑training data says Apple’s CEO is Tim Cook while alignment data says it is Bill Gates, the model becomes confused and may produce a hallucinated answer when asked about Apple’s CEO.

Additionally, models suffer from knowledge forgetting during pre‑training and stochastic decoding during inference, which further increase the chance of incorrect outputs.

How to Effectively Reduce Hallucinations

Solutions are grouped into data‑level, model‑level, and post‑processing methods.

Data‑level: Clean and filter training data to remove noise, bias, duplication, and factual errors; use manual annotation or heuristic rules to improve data quality.

Model‑level: Incorporate feedback (e.g., reinforcement learning with hallucination‑aware rewards), inject external knowledge via retrieval or knowledge bases, and apply constraints during decoding to force inclusion of required entities.

Post‑processing: Detect and correct hallucinations after generation by training a verifier on synthetic error data, especially for entities, numbers, and dates.

“Truth‑seeking” Algorithm

Tencent’s Hunyuan model introduces a “Truth‑seeking” algorithm during pre‑training that intervenes in the model’s hidden states when a potential hallucination is detected. A classifier inside the algorithm identifies patterns associated with hallucinations and adjusts the hidden variables during inference, achieving a 30%‑50% reduction in hallucination rate on the TruthfulQA benchmark without causing catastrophic forgetting.

Although hallucinations cannot be completely eliminated due to the inherent nature of transformer architectures, ongoing research—including reinforcement‑learning‑based trap detection, chain‑of‑thought prompting, and the “Truth‑seeking” technique—aims to further lower their occurrence.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Large Language Models Data Quality Model Training knowledge retrieval AI safety Hallucination

Written by

Tencent Tech

Tencent's official tech account. Delivering quality technical content to serve developers.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.