Tagged articles
4 articles
Page 1 of 1
HyperAI Super Neural
HyperAI Super Neural
Jan 14, 2026 · Artificial Intelligence

How OpenAI’s Circuit Sparsity Makes Large Language Model Reasoning Transparent

The article explains OpenAI’s 0.4B‑parameter Circuit Sparsity model, which zeros 99.9% of weights and uses dynamic forced sparsity, activation sparsity, and custom components to turn a dense transformer into an interpretable sparse circuit, and also highlights recent multilingual, portrait‑enhancement, and instruction‑tuned models with online demos.

Circuit SparsityLoRA portrait enhancementOpenAI
0 likes · 8 min read
How OpenAI’s Circuit Sparsity Makes Large Language Model Reasoning Transparent
DataFunSummit
DataFunSummit
Dec 3, 2024 · Artificial Intelligence

Applying Large Language Models to NPC Role‑Playing and Game Localization at Tencent

This article details Tencent's practical exploration of large language model deployment in overseas game scenarios, covering the design of customized NPC role‑playing models, multilingual localization pipelines, data construction, training, evaluation frameworks, multi‑agent improvement loops, and insights from a comprehensive Q&A session.

AI EvaluationNPC AITencent
0 likes · 17 min read
Applying Large Language Models to NPC Role‑Playing and Game Localization at Tencent
DataFunSummit
DataFunSummit
Jan 13, 2022 · Artificial Intelligence

DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation

DeltaLM is a multilingual pretrained encoder‑decoder model that leverages cross‑lingual transfer from a pretrained encoder and novel decoder architecture, employs span‑corruption and translation‑pair pretraining tasks, and uses a two‑stage fine‑tuning strategy to achieve strong zero‑shot and supervised translation performance across over 100 languages.

Cross-Lingual TransferDeltaLMNeural Machine Translation
0 likes · 12 min read
DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation
DataFunTalk
DataFunTalk
Apr 7, 2021 · Artificial Intelligence

Alibaba's Advances in Multilingual Neural Machine Translation: Research and Practice

This article presents Alibaba's comprehensive research on multilingual neural machine translation, covering motivations, model architectures, intermediate language modules, data‑augmentation strategies such as repair translation, integration of pre‑trained models with adapters, and engineering optimizations that enable a production‑ready system supporting over 200 languages.

AdapterAlibabaNeural Machine Translation
0 likes · 21 min read
Alibaba's Advances in Multilingual Neural Machine Translation: Research and Practice