Tagged articles

1M context

10 articles · Page 1 of 1

Jun 24, 2026 · Artificial Intelligence

Why 1M Context Length Matters: Inside GLM 5.2’s New Techniques

The article examines how 1‑million‑token context has become a standard feature in modern LLMs, explains the compute and memory challenges it brings, reviews the sparse‑attention and token‑selection tricks (including GLM 5.2’s IndexShare and LayerSplit), and outlines practical evaluation methods for measuring long‑context effectiveness.

1M contextGLM-5.2IndexShare

0 likes · 10 min read

Why 1M Context Length Matters: Inside GLM 5.2’s New Techniques

AI Engineering

Jun 17, 2026 · Artificial Intelligence

How GLM-5.2 Surpassed Claude Fable 5 to Top Design Arena Rankings

GLM-5.2, the new open‑source LLM from Zhipu, offers a stable 1 M token context, adjustable coding inference strength, and an IndexShare architecture that cuts FLOPs per token by 2.9×, achieving the highest Elo score on Design Arena and leading multiple coding benchmarks against both open‑source and proprietary models.

1M contextGLM-5.2LLM

0 likes · 10 min read

How GLM-5.2 Surpassed Claude Fable 5 to Top Design Arena Rankings

Machine Heart

Jun 1, 2026 · Artificial Intelligence

MiniMax M3: First Open‑Source Model to Achieve the Frontier Trio – Our Three‑Task Evaluation

MiniMax M3 claims to be the first open‑source LLM that simultaneously delivers top‑tier coding/agentic ability, a 1‑million‑token context window, and native multimodal understanding, and our benchmarks on coding suites, long‑context efficiency, and multimodal tasks confirm it exceeds expectations.

1M contextMiniMax M3Open-source LLM

0 likes · 15 min read

MiniMax M3: First Open‑Source Model to Achieve the Frontier Trio – Our Three‑Task Evaluation

Amazon Cloud Developers

May 25, 2026 · Artificial Intelligence

Configure Claude Cowork with Amazon Bedrock in 10 Minutes (No Anthropic Account Required)

This step‑by‑step guide shows how to set up Claude Cowork to use Amazon Bedrock as its inference backend—without registering an Anthropic account—by installing the desktop client, generating a Bedrock API key, enabling Developer Mode, configuring third‑party inference, editing the config file to unlock 1 M‑token context, and handling common issues on macOS and Windows.

1M contextAI assistantAWS

0 likes · 16 min read

Configure Claude Cowork with Amazon Bedrock in 10 Minutes (No Anthropic Account Required)

Full-Stack DevOps & Kubernetes

Apr 30, 2026 · Artificial Intelligence

DeepSeek‑V4 Launch: Open‑Source Model Matching Top Closed‑Source Performance with Dual Versions

DeepSeek‑V4, released on April 24 2026, offers open‑source Pro and Flash versions with 1 M‑token context, benchmark‑leading performance, advanced agent capabilities, sparse‑attention efficiency, competitive pricing, and flexible deployment options for developers, enterprises, and content creators.

1M contextDeepSeek V4Open-source LLM

0 likes · 7 min read

DeepSeek‑V4 Launch: Open‑Source Model Matching Top Closed‑Source Performance with Dual Versions

Architect

Apr 25, 2026 · Artificial Intelligence

DeepSeek V4: 1M‑Token Context’s Impact on Model, Inference, Cache & Agents

The DeepSeek V4 technical report shows how a 1 million‑token context forces a redesign of attention, KV‑cache, optimizer, quantization and inference budgeting, turning long‑context capability from a costly showcase into a production‑ready feature for agents, search and Chinese professional tasks.

1M contextAgentic SearchAttention optimization

0 likes · 28 min read

DeepSeek V4: 1M‑Token Context’s Impact on Model, Inference, Cache & Agents

Machine Learning Algorithms & Natural Language Processing

Apr 25, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: 1M‑Token Context and New Architecture Challenge Closed‑Source LLMs

DeepSeek V4 introduces two flagship models—V4‑Pro with 1.6 T parameters and V4‑Flash with 284 B parameters—offering million‑token context, mixed attention (CSA + HCA), manifold‑constrained residuals, and the Muon optimizer, delivering open‑source performance that rivals top closed‑source LLMs while cutting inference cost dramatically.

1M contextDeepSeekMuon Optimizer

0 likes · 10 min read

DeepSeek V4 Unveiled: 1M‑Token Context and New Architecture Challenge Closed‑Source LLMs

ITPUB

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Unleashed: 1M‑Token Context Becomes Commodity, Teams with Ascend to Challenge Compute Dominance

DeepSeek released two V4 models—Pro and Flash—both supporting 1‑million‑token context as a standard feature, showcasing top‑tier agentic coding, world‑knowledge, and inference performance, while introducing DSA sparse attention and announcing upcoming large‑scale deployment on Huawei Ascend hardware.

1M contextAI inferenceDSA sparse attention

0 likes · 6 min read

DeepSeek V4 Unleashed: 1M‑Token Context Becomes Commodity, Teams with Ascend to Challenge Compute Dominance

IT Services Circle

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Released: Open-Source LLM Challenges Closed-Source Leaders and Partners with Huawei Chips

DeepSeek V4 launches in two versions—Pro and Flash—offering 1 M token context, enhanced agent capabilities, world‑knowledge and reasoning performance, a new token‑compression attention mechanism with DSA sparse attention, Huawei compute support, updated APIs, and a migration plan for legacy models.

1M contextAPI IntegrationDSA sparse attention

0 likes · 8 min read

DeepSeek V4 Released: Open-Source LLM Challenges Closed-Source Leaders and Partners with Huawei Chips

Tech Musings

Apr 24, 2026 · Artificial Intelligence

DeepSeek-V4 Unveiled: 1M Context Length and Ascend Compute Power

DeepSeek has launched the open‑source DeepSeek‑V4 series, offering Pro and Flash models with a 1 million token context window, a novel sparse attention mechanism, performance that rivals Opus 4.6 on coding and knowledge benchmarks, tiered pricing, and future cost reductions once Ascend 950 supernodes become widely available.

1M contextAI benchmarkingDeepSeek V4

0 likes · 5 min read

DeepSeek-V4 Unveiled: 1M Context Length and Ascend Compute Power