Tagged articles

hallucination reduction

9 articles · Page 1 of 1

Jun 3, 2026 · Artificial Intelligence

GPT‑5.5 Instant Goes Free: Hallucinations Cut 52%, Math Scores Jump to 81%, and Personalized Memory Arrives

OpenAI has rolled out GPT‑5.5 Instant as the new default ChatGPT model, delivering 52.5% fewer hallucinations, a rise in math benchmark scores from 65% to 81%, 30% shorter replies, and a memory system that surfaces past context for personalized answers, all available for free to every user.

AI benchmarksChatGPTGPT-5.5

0 likes · 10 min read

GPT‑5.5 Instant Goes Free: Hallucinations Cut 52%, Math Scores Jump to 81%, and Personalized Memory Arrives

SuanNi

May 7, 2026 · Artificial Intelligence

GPT-5.5 Instant Cuts Hallucinations by 52.5% and Delivers More Concise Answers

OpenAI's free GPT-5.5 Instant replaces GPT-5.3 as the default model, slashing hallucinations by 52.5% in high‑risk domains, improving factual accuracy, providing shorter yet precise responses, adding memory‑controlled personalization, and rolling out to all ChatGPT users via the chat‑latest API.

AIGPT-5.5OpenAI

0 likes · 6 min read

GPT-5.5 Instant Cuts Hallucinations by 52.5% and Delivers More Concise Answers

AI Engineering

May 6, 2026 · Artificial Intelligence

GPT-5.5 Instant Launch Cuts Hallucinations by 52.5% and Eliminates Fluff

OpenAI silently upgraded its default ChatGPT model to GPT-5.5 Instant, delivering self-correcting math reasoning, a 52.5% drop in hallucinations across medical and legal tests, 37.3% fewer user-marked errors, higher benchmark scores, shorter, fluff-free answers, and a new traceable memory feature, with a staged rollout to free and paid users.

AI model upgradeGPT-5.5OpenAI

0 likes · 4 min read

GPT-5.5 Instant Launch Cuts Hallucinations by 52.5% and Eliminates Fluff

Wu Shixiong's Large Model Academy

Mar 29, 2026 · Artificial Intelligence

Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy

This article dissects the unique challenges of RAG prompting, presents a systematic System/User Prompt design with strong constraints and citation requirements, compares constraint strengths with quantitative hallucination rates, and offers long‑context compression strategies and rigorous testing methods to ensure reliable LLM answers.

Context CompressionLLMRAG

0 likes · 19 min read

Mastering RAG Prompt Engineering: Prevent Hallucinations and Boost Accuracy

Machine Learning Algorithms & Natural Language Processing

Mar 11, 2026 · Industry Insights

GPT‑5.3 Cuts Hallucinations 27% and Gemini Flash‑Lite Slashes Costs – What It Means for AI’s Future

OpenAI and Google released GPT‑5.3 Instant and Gemini 3.1 Flash‑Lite on the same day, both emphasizing lower cost and smoother user experience rather than raw intelligence, with Google pricing its model at one‑eighth of flagship rates and OpenAI reporting a 27% hallucination reduction, signaling a shift in AI competition toward scalability and usability.

AI IndustryAI pricingGPT-5.3

0 likes · 5 min read

GPT‑5.3 Cuts Hallucinations 27% and Gemini Flash‑Lite Slashes Costs – What It Means for AI’s Future

Data Party THU

Mar 8, 2026 · Artificial Intelligence

6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations

This article explains why retrieval‑augmented generation (RAG) models often hallucinate, introduces the concept of context engineering, and details six practical techniques—including selective retrieval, context compression, hierarchical layout, dynamic query rewriting, memory management, and tool‑aware context—along with their trade‑offs and real‑world impact.

AILLMRAG

0 likes · 23 min read

6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations

ShiZhen AI

Mar 4, 2026 · Artificial Intelligence

OpenAI’s GPT‑5.3 Instant: More Accurate, Less Cringe with Hallucination Rate Down 26.8%

OpenAI’s GPT‑5.3 Instant launch trims unnecessary refusals, drops preachy tone, boosts web‑search integration and cuts hallucinations by up to 26.8% in high‑risk domains, while sparking fierce community debate over forced migrations and hinting at an imminent GPT‑5.4.

AI TrustGPT-5.3OpenAI

0 likes · 9 min read

OpenAI’s GPT‑5.3 Instant: More Accurate, Less Cringe with Hallucination Rate Down 26.8%

PaperAgent

Dec 12, 2025 · Artificial Intelligence

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

Gemini‑3‑pro’s surprise debut and OpenAI’s emergency release of GPT‑5.2 highlight a shift toward faster inference, deeper reasoning, and lower hallucination rates, with detailed performance metrics, three‑tier model options, extended context windows, and mixed community test results that reveal both strengths and shortcomings.

AI Model PerformanceGPT-5.2Gemini 3 Pro

0 likes · 4 min read

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

Tencent Advertising Technology

Dec 4, 2025 · Artificial Intelligence

How POPEN Boosts LVLM Reasoning Segmentation with Preference Optimization and Ensemble

The paper introduces POPEN, a new framework that uses preference‑based optimization and ensemble methods to reduce hallucinations and improve segmentation accuracy in large visual language models, achieving state‑of‑the‑art results on multiple benchmarks.

LVLMMultimodal ModelsPreference Optimization

0 likes · 14 min read

How POPEN Boosts LVLM Reasoning Segmentation with Preference Optimization and Ensemble