PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

The PEAR framework introduces a position‑embedding‑agnostic attention re‑weighting method that detects and suppresses detrimental attention heads in large language models, dramatically improving retrieval‑augmented generation performance without adding any inference overhead, as demonstrated on multiple RAG benchmarks and LLM families.

Attention Re-weightingLLMPEAR

0 likes · 6 min read

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

Alibaba Terminal Technology

Mar 31, 2023 · Fundamentals

How Taobao Adapted libmemunreachable for Zero‑Overhead Release‑Package Memory Leak Detection

This article explains the principles of memory leaks, details the zero‑overhead libmemunreachable detector, walks through its core algorithms and code, and describes the modifications Taobao made to enable reliable leak detection in Android release builds.

Androidc++libmemunreachable

0 likes · 21 min read

How Taobao Adapted libmemunreachable for Zero‑Overhead Release‑Package Memory Leak Detection

Alibaba Cloud Developer

Sep 2, 2020 · Fundamentals

Mastering C++ Interfaces: Zero‑Overhead Techniques and Practical Patterns

This article explores various ways to implement interfaces in C++, comparing virtual functions, the Pimpl idiom, hidden subclasses, callback mechanisms, function pointers, std::function, member function pointers, and non‑intrusive traits, highlighting their trade‑offs in performance, binary compatibility, and design flexibility.

Interfacec++callback

0 likes · 13 min read

Mastering C++ Interfaces: Zero‑Overhead Techniques and Practical Patterns