Tagged articles

hybrid models

4 articles · Page 1 of 1

Dec 23, 2025 · Artificial Intelligence

How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference

This article explains how SGLang’s hybrid model design combines Transformer attention with Mamba state‑space layers, introduces a dual‑pool memory architecture and elastic allocation, and presents specialized prefix‑cache and speculative‑decoding techniques that together enable efficient, scalable inference for long‑context large language models.

Inference OptimizationKVCacheLarge Language Models

0 likes · 22 min read

How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference

AI Algorithm Path

Jun 8, 2025 · Artificial Intelligence

Autoregressive vs Diffusion Language Models: Principles, Trade‑offs, and Future Directions

The article compares autoregressive and diffusion language models, detailing their mathematical foundations, training and inference pipelines, performance trade‑offs such as speed, coherence and diversity, and explores hybrid approaches and emerging research directions for more efficient and controllable text generation.

AI researchLanguage ModelsText Generation

0 likes · 17 min read

Autoregressive vs Diffusion Language Models: Principles, Trade‑offs, and Future Directions

Alimama Tech

Aug 25, 2021 · Artificial Intelligence

Advertising Creative Optimization Using Hybrid Bandit Models

The article describes Alibaba Moments’ advertising creative optimization platform, which uses hybrid bandit models that combine visual‑aware ranking priors with exploration‑exploitation algorithms such as Thompson Sampling and LinUCB to dynamically select whole creatives or individual elements, improving click‑through rates and mitigating cold‑start challenges.

Algorithmic Optimizationadvertising creativesbandit models

0 likes · 14 min read

Advertising Creative Optimization Using Hybrid Bandit Models

Tencent Cloud Developer

Sep 14, 2018 · Artificial Intelligence

Top 6 Notable Trends in Deep Learning and Neural Networks

The article surveys six emerging deep‑learning trends—capsule networks that retain spatial hierarchies, data‑efficient deep reinforcement and transfer learning, supervised models, memory‑augmented architectures such as long‑term and progressive networks, and hybrid Bayesian‑GAN approaches—highlighting how these advances expand AI capabilities beyond traditional fully‑connected networks.

AI trendsCapsule NetworksDeep Learning

0 likes · 11 min read

Top 6 Notable Trends in Deep Learning and Neural Networks