Tagged articles
9 articles
Page 1 of 1
PaperAgent
PaperAgent
Jan 15, 2026 · Artificial Intelligence

How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs

The article presents GAG, a third‑generation framework that injects proprietary domain knowledge into frozen large language models using a single token, eliminating retrieval, avoiding base model updates, and maintaining constant inference budget while delivering strong performance on private QA and public benchmarks.

AI AlignmentGAGLLM
0 likes · 8 min read
How GAG Enables Zero‑Retrieval, Single‑Token Private Knowledge Injection in LLMs
DataFunTalk
DataFunTalk
Jul 4, 2025 · Artificial Intelligence

How to Edit Large Language Models: Techniques, Metrics, and Challenges

This article explains model editing—injecting or updating knowledge in AI models—distinguishes it from post‑training, outlines reliability, generalization and locality metrics, and surveys both parameter‑free (e.g., IKE) and parameter‑based methods such as ROME, hypernetworks, and MEND, highlighting practical challenges.

MENDRomehypernetwork
0 likes · 10 min read
How to Edit Large Language Models: Techniques, Metrics, and Challenges
Ops Development & AI Practice
Ops Development & AI Practice
Mar 19, 2025 · Artificial Intelligence

How to Fine‑Tune Large Language Models: From PEFT to Knowledge Injection

This article provides a comprehensive guide to customizing pre‑trained large language models through fine‑tuning techniques—including parameter‑efficient methods, data preparation, knowledge injection, and robust evaluation—offering practical steps, best practices, and domain‑specific considerations for achieving superior task performance.

LLM fine-tuningdata preparationknowledge injection
0 likes · 18 min read
How to Fine‑Tune Large Language Models: From PEFT to Knowledge Injection
Tencent Advertising Technology
Tencent Advertising Technology
Jan 9, 2025 · Artificial Intelligence

Applying Large Language Models to Search Advertising: End‑to‑End Generative Recall and System Optimizations

This report details how large language models (LLMs) were integrated into Tencent's search advertising pipeline—from early extraction‑distillation experiments in 2023 to a 2024 end‑to‑end generative recall architecture—showing significant improvements in relevance, diversity, and revenue through knowledge injection, supervised fine‑tuning, constrained beam‑search decoding, and high‑performance inference services.

AIBeam SearchLLM
0 likes · 11 min read
Applying Large Language Models to Search Advertising: End‑to‑End Generative Recall and System Optimizations
DataFunTalk
DataFunTalk
Jul 11, 2023 · Artificial Intelligence

Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications

The article describes Sunshine Insurance Group's Zhèngyán Large Model Open Platform, detailing its three‑layer architecture, AutoTrain tool, self‑developed LLM, smart routing, plugin marketplace, intelligent review, and how these capabilities empower insurance marketing, sales, service, and management through AI‑driven solutions.

AI PlatformInsurance TechnologyModel Deployment
0 likes · 13 min read
Sunshine Insurance Group's Zhèngyán Large Model Open Platform: Architecture, Tools, and Business Applications
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 8, 2022 · Artificial Intelligence

KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning

KECP, a Knowledge‑Enhanced Contrastive Prompt‑tuning model, achieves strong few‑shot extractive question answering by converting questions to masked statements, injecting external knowledge via gated fusion, and leveraging contrastive learning alongside masked language modeling, as demonstrated on EMNLP‑2022 benchmarks.

NLPcontrastive learningknowledge injection
0 likes · 9 min read
KECP: Enhancing Few-Shot Machine Reading Comprehension via Knowledge-Driven Prompt Tuning
DataFunTalk
DataFunTalk
Oct 30, 2022 · Artificial Intelligence

SPACE and Proton: Semi‑Supervised Knowledge Injection and Probing‑Tuning for Pretrained Conversational AI Models

This article reviews Alibaba DAMO‑ConvAI’s work on large‑scale conversational AI, comparing pretrained language and dialogue models, introducing the SPACE semi‑supervised knowledge‑injection framework and the Proton probing‑tuning method for extracting and applying model knowledge to downstream tasks.

Pretrained Dialogue ModelProbing TuningSemi-supervised Learning
0 likes · 21 min read
SPACE and Proton: Semi‑Supervised Knowledge Injection and Probing‑Tuning for Pretrained Conversational AI Models