Tagged articles

knowledge injection

9 articles · Page 1 of 1

Jan 15, 2026 · Artificial Intelligence

How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs

The article presents GAG, a third‑generation framework that injects proprietary domain knowledge into frozen large language models using a single token, eliminating retrieval, avoiding base model updates, and maintaining constant inference budget while delivering strong performance on private QA and public benchmarks.

AI alignmentGAGLLM

0 likes · 8 min read

How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs

DataFunTalk

Jul 4, 2025 · Artificial Intelligence

How to Edit Large Language Models: Techniques, Metrics, and Challenges

This article explains model editing—injecting or updating knowledge in AI models—distinguishes it from post‑training, outlines reliability, generalization and locality metrics, and surveys both parameter‑free (e.g., IKE) and parameter‑based methods such as ROME, hypernetworks, and MEND, highlighting practical challenges.

MENDRomehypernetwork

0 likes · 10 min read

How to Edit Large Language Models: Techniques, Metrics, and Challenges

Ops Development & AI Practice

Mar 19, 2025 · Artificial Intelligence

How to Fine‑Tune Large Language Models: From PEFT to Knowledge Injection

This article provides a comprehensive guide to customizing pre‑trained large language models through fine‑tuning techniques—including parameter‑efficient methods, data preparation, knowledge injection, and robust evaluation—offering practical steps, best practices, and domain‑specific considerations for achieving superior task performance.

LLM fine-tuningdata preparationknowledge injection

0 likes · 18 min read

How to Fine‑Tune Large Language Models: From PEFT to Knowledge Injection

Tencent Advertising Technology

Jan 9, 2025 · Artificial Intelligence

Applying Large Language Models to Search Advertising: End‑to‑End Generative Recall and System Optimizations

This report details how large language models (LLMs) were integrated into Tencent's search advertising pipeline—from early extraction‑distillation experiments in 2023 to a 2024 end‑to‑end generative recall architecture—showing significant improvements in relevance, diversity, and revenue through knowledge injection, supervised fine‑tuning, constrained beam‑search decoding, and high‑performance inference services.

AIBeam SearchLLM

0 likes · 11 min read

Applying Large Language Models to Search Advertising: End‑to‑End Generative Recall and System Optimizations

JD Cloud Developers

Jul 16, 2024 · Artificial Intelligence

How Task‑Aware Decoding and RAG Reduce Hallucinations in Large Language Models

This article reviews the hallucination problem in large language models, analyzes its data, training, and inference sources, and presents Task‑aware Decoding (TaD) and Retrieval‑Augmented Generation (RAG) as effective, plug‑and‑play solutions demonstrated through extensive experiments.

AIDoLaHallucination

0 likes · 16 min read

How Task‑Aware Decoding and RAG Reduce Hallucinations in Large Language Models

NewBeeNLP

Apr 7, 2024 · Artificial Intelligence

Can Large Language Models Learn Recommendation Knowledge? A NL‑Simulated Auxiliary Task

This article reviews a recent study that bridges the knowledge gap between large language models and recommendation systems by generating natural‑language auxiliary tasks, fine‑tuning the models, and achieving notable performance gains on Amazon domain benchmarks.

AI researchfine-tuningknowledge injection

0 likes · 4 min read

Can Large Language Models Learn Recommendation Knowledge? A NL‑Simulated Auxiliary Task

DataFunTalk

Jul 11, 2023 · Artificial Intelligence

Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications

The article describes Sunshine Insurance Group's Zhèngyán Large Model Open Platform, detailing its three‑layer architecture, AutoTrain tool, self‑developed LLM, smart routing, plugin marketplace, intelligent review, and how these capabilities empower insurance marketing, sales, service, and management through AI‑driven solutions.

AI platformInsurance TechnologyModel Deployment

0 likes · 13 min read

Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications

Alibaba Cloud Big Data AI Platform

Dec 8, 2022 · Artificial Intelligence

KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning

KECP, a Knowledge‑Enhanced Contrastive Prompt‑tuning model, achieves strong few‑shot extractive question answering by converting questions to masked statements, injecting external knowledge via gated fusion, and leveraging contrastive learning alongside masked language modeling, as demonstrated on EMNLP‑2022 benchmarks.

NLPcontrastive learningknowledge injection

0 likes · 9 min read

KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning

DataFunTalk

Oct 30, 2022 · Artificial Intelligence

SPACE and Proton: Semi‑Supervised Knowledge Injection and Probing‑Tuning for Pretrained Conversational AI Models

This article reviews Alibaba DAMO‑ConvAI’s work on large‑scale conversational AI, comparing pretrained language and dialogue models, introducing the SPACE semi‑supervised knowledge‑injection framework and the Proton probing‑tuning method for extracting and applying model knowledge to downstream tasks.

Pretrained Dialogue ModelProbing TuningSemi-supervised Learning

0 likes · 21 min read

SPACE and Proton: Semi‑Supervised Knowledge Injection and Probing‑Tuning for Pretrained Conversational AI Models