Tagged articles
4 articles
Page 1 of 1
SuanNi
SuanNi
Mar 9, 2026 · Artificial Intelligence

How Hypernetworks Turn Documents into Instant LLM Skills

This article analyzes the memory and adaptation limits of large language models and presents a hypernetwork‑based approach that instantly converts documents or task descriptions into low‑rank LoRA modules, enabling cheap, on‑demand model updates and cross‑modal knowledge transfer.

AILLMLoRA
0 likes · 9 min read
How Hypernetworks Turn Documents into Instant LLM Skills
Efficient Ops
Efficient Ops
May 29, 2025 · Artificial Intelligence

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

DeepSeek quietly launched the R1 0528 model, which early testers report matches OpenAI’s o3 in benchmarks and style, while adding deeper chain‑of‑thought reasoning, better writing output, and extended thinking windows, and the announcement is followed by a promotion for the GOPS Global Ops Conference.

AI PerformanceDeepSeekModel Update
0 likes · 3 min read
DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3
WeChat Backend Team
WeChat Backend Team
Aug 5, 2022 · Artificial Intelligence

How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations

At the 16th OSDI conference, Tencent’s WeChat team presented the award‑winning Ekko system—a groundbreaking, ultra‑low‑latency model‑update solution for massive recommendation workloads that dramatically speeds up updates, supports over a trillion‑scale models, and has already boosted user engagement across billions of daily users.

Low latencyModel UpdateRecommendation Systems
0 likes · 5 min read
How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations