Tagged articles
4 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 23, 2025 · Artificial Intelligence

How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference

This article explains how SGLang’s hybrid model design combines Transformer attention with Mamba state‑space layers, introduces a dual‑pool memory architecture and elastic allocation, and presents specialized prefix‑cache and speculative‑decoding techniques that together enable efficient, scalable inference for long‑context large language models.

Inference OptimizationKVCacheLarge Language Models
0 likes · 22 min read
How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference
AI Algorithm Path
AI Algorithm Path
Jun 8, 2025 · Artificial Intelligence

Autoregressive vs Diffusion Language Models: Principles, Trade‑offs, and Future Directions

The article compares autoregressive and diffusion language models, detailing their mathematical foundations, training and inference pipelines, performance trade‑offs such as speed, coherence and diversity, and explores hybrid approaches and emerging research directions for more efficient and controllable text generation.

AI researchText GenerationTransformer
0 likes · 17 min read
Autoregressive vs Diffusion Language Models: Principles, Trade‑offs, and Future Directions
Alimama Tech
Alimama Tech
Aug 25, 2021 · Artificial Intelligence

Advertising Creative Optimization Using Hybrid Bandit Models

The article describes Alibaba Moments’ advertising creative optimization platform, which uses hybrid bandit models that combine visual‑aware ranking priors with exploration‑exploitation algorithms such as Thompson Sampling and LinUCB to dynamically select whole creatives or individual elements, improving click‑through rates and mitigating cold‑start challenges.

Algorithmic Optimizationadvertising creativesbandit models
0 likes · 14 min read
Advertising Creative Optimization Using Hybrid Bandit Models
Tencent Cloud Developer
Tencent Cloud Developer
Sep 14, 2018 · Artificial Intelligence

Top 6 Notable Trends in Deep Learning and Neural Networks

The article surveys six emerging deep‑learning trends—capsule networks that retain spatial hierarchies, data‑efficient deep reinforcement and transfer learning, supervised models, memory‑augmented architectures such as long‑term and progressive networks, and hybrid Bayesian‑GAN approaches—highlighting how these advances expand AI capabilities beyond traditional fully‑connected networks.

AI trendsCapsule NetworksDeep Learning
0 likes · 11 min read
Top 6 Notable Trends in Deep Learning and Neural Networks