Tagged articles
42 articles
Page 1 of 1
Architecture & Thinking
Architecture & Thinking
Apr 26, 2026 · Artificial Intelligence

DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems

DeepSeek V4, released on April 24, 2026, introduces a 1‑million‑token context via DSA sparse attention, offers Pro and Flash variants, adapts to domestic AI chips, cuts compute costs dramatically, and leverages open‑source weights to challenge the dominance of closed‑source LLMs, reshaping the global AI landscape.

AI hardware adaptationAgentic AIDeepSeek-V4
0 likes · 9 min read
DeepSeek V4: How Million‑Token Context and Open‑Source Design Redefine AI Ecosystems
ZhiKe AI
ZhiKe AI
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context

DeepSeek V4, released today, offers two open‑source models (Pro and Flash) with up to 1.6 T parameters and a 1‑million‑token context, achieving top‑tier programming and mathematics benchmark scores that surpass the three major closed‑source competitors, while cutting API costs to a fraction of the price.

APIBenchmarkDeepSeek
0 likes · 7 min read
DeepSeek V4 Launch: Open‑Source Model Beats Closed‑Source Leaders in Coding & Math, 1.6 T Params, 1 M Context
IT Services Circle
IT Services Circle
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Released: Open-Source LLM Challenges Closed-Source Leaders and Partners with Huawei Chips

DeepSeek V4 launches in two versions—Pro and Flash—offering 1 M token context, enhanced agent capabilities, world‑knowledge and reasoning performance, a new token‑compression attention mechanism with DSA sparse attention, Huawei compute support, updated APIs, and a migration plan for legacy models.

1M contextDSA sparse attentionDeepSeek-V4
0 likes · 8 min read
DeepSeek V4 Released: Open-Source LLM Challenges Closed-Source Leaders and Partners with Huawei Chips
PaperAgent
PaperAgent
Apr 22, 2026 · Artificial Intelligence

Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE

Alibaba has added four new open‑source weight versions to its Qwen3.6 series, featuring the 27‑billion‑parameter dense multimodal model Qwen3.6‑27B and the 35‑billion‑parameter sparse expert model Qwen3.6‑35B‑A3B, both designed for stable, real‑world coding tasks and outperforming their Qwen3.5 predecessors.

AI agentsAlibabaDense Model
0 likes · 4 min read
Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE
ZhiKe AI
ZhiKe AI
Apr 21, 2026 · Artificial Intelligence

Open-Source Kimi K2.6 Beats GPT‑5.4 and Claude Opus 4.6 in Code Generation

Kimi K2.6, an open‑source Chinese LLM, outperforms GPT‑5.4 and Claude Opus 4.6 on SWE‑Bench Pro code tests, delivers 13‑hour uninterrupted coding, runs 300 parallel agents, and costs only one‑twentieth of comparable closed‑source models, while offering a trillion‑parameter MoE architecture and Apache 2.0 licensing.

AI model benchmarksApache 2.0Code Generation
0 likes · 9 min read
Open-Source Kimi K2.6 Beats GPT‑5.4 and Claude Opus 4.6 in Code Generation
HyperAI Super Neural
HyperAI Super Neural
Apr 16, 2026 · Artificial Intelligence

Open-Source Small LLMs Reach GPT‑5‑Level Intelligence: One‑Stop Evaluation of Qwen 3.5, Gemma 4 and Other Top Models

A recent Artificial Analysis report finds that the 27‑billion‑parameter Qwen 3.5 and 31‑billion‑parameter Gemma 4 models achieve Intelligence Index scores comparable to GPT‑5, and the article details their benchmark results, multimodal capabilities, deployment on a single NVIDIA H100, and provides one‑click notebook tutorials for several open‑source LLMs.

DeploymentGemma 4Intelligence Index
0 likes · 8 min read
Open-Source Small LLMs Reach GPT‑5‑Level Intelligence: One‑Stop Evaluation of Qwen 3.5, Gemma 4 and Other Top Models
SuanNi
SuanNi
Apr 11, 2026 · Industry Insights

Why Chinese Open‑Source LLMs Overtook the US in 2025: Data‑Driven Insights

A data‑driven report reveals that by summer 2025 Chinese open‑source language models surpassed U.S. counterparts in both download volume and real‑world inference usage, reshaping the global AI ecosystem through rapid adoption, aggressive model release strategies, and shifting developer preferences.

AI model trendsChina AIModel downloads
0 likes · 12 min read
Why Chinese Open‑Source LLMs Overtook the US in 2025: Data‑Driven Insights
AI Engineering
AI Engineering
Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

GLM-5, the new 744‑billion‑parameter open‑source LLM, expands on GLM‑4.5 with GlmMoeDsa architecture, achieves higher HLE benchmark scores than Claude Opus 4.5, demonstrates strong long‑context and agent capabilities, supports vLLM/SGLang, runs on various Chinese chips, and can directly generate Office documents.

AI benchmarksChinese chipsClaude
0 likes · 5 min read
GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 5, 2026 · Artificial Intelligence

Distilling GLM‑4.7‑Flash with Claude‑Opus‑4.5 for Easy Consumer‑GPU Deployment

The article explains how TeichAI used Claude‑Opus‑4.5 to generate a high‑quality 250‑sample reasoning dataset and distill the GLM‑4.7‑Flash model into a compact GGUF version that runs on a single consumer‑grade GPU via llama.cpp, detailing the workflow, quantization options, and practical considerations.

AI datasetsGGUFUnsloth
0 likes · 6 min read
Distilling GLM‑4.7‑Flash with Claude‑Opus‑4.5 for Easy Consumer‑GPU Deployment
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 3, 2026 · Artificial Intelligence

Step‑3.5‑Flash: Lightning‑Fast Inference with 196B Params, Only 11B Active (vLLM)

Step‑3.5‑Flash, a 196‑billion‑parameter open‑source LLM that activates only 11 B per token via a Mixture‑of‑Experts design, delivers 3‑plus‑times faster inference, matches top‑tier closed‑source models on SWE‑bench and other benchmarks, supports 256 K context, runs on consumer‑grade hardware, and is already integrated into vLLM, SGLang, and Claude Code, though it has known token‑efficiency and domain‑stability limitations.

LLM BenchmarkMoEMulti-token Prediction
0 likes · 11 min read
Step‑3.5‑Flash: Lightning‑Fast Inference with 196B Params, Only 11B Active (vLLM)
AI Insight Log
AI Insight Log
Jan 20, 2026 · Artificial Intelligence

Is GLM-4.7-Flash the New 30B‑Level LLM King? Open‑Source and Ollama‑Ready

GLM‑4.7‑Flash, a 30B‑parameter MoE LLM released as fully open‑source and free, delivers 30B‑class performance across six benchmarks, runs locally with a single Ollama command, and offers a faster cloud‑hosted version with modest token‑based pricing, though hardware costs still apply.

Anthropic APIBenchmarkGLM-4.7-Flash
0 likes · 7 min read
Is GLM-4.7-Flash the New 30B‑Level LLM King? Open‑Source and Ollama‑Ready
ShiZhen AI
ShiZhen AI
Jan 13, 2026 · Artificial Intelligence

Can a 30B Open‑Source Model Match Closed‑Source Giants? MiroThinker 1.5 Review

MiroThinker 1.5 adopts a "scientist" mode with Interactive Scaling, runs a hypothesis‑evidence loop, scores 56.1 on the BrowseComp benchmark—close to Gemini DeepSearch’s 59.2—while supporting up to 400 tool calls, 256K context, and delivers detailed research reports, all as an open‑source project on GitHub.

BenchmarkMiroThinkerSearch AI
0 likes · 8 min read
Can a 30B Open‑Source Model Match Closed‑Source Giants? MiroThinker 1.5 Review
Programmer's Advance
Programmer's Advance
Jan 12, 2026 · Artificial Intelligence

DeepSeek V4 Review: Open‑Source 1‑Trillion‑Parameter Model That Beats Claude & GPT for Developers

DeepSeek V4, the upcoming open‑source 1‑trillion‑parameter coding model, claims to surpass Claude and GPT with innovations like mHC, DSA and MoE, offering 1 M‑plus token context, 10× faster inference, and dramatically lower API costs—making it a game‑changer for most developers while reserving local deployment for only a few large enterprises.

AI coding modelAPI vs local deploymentDeepSeek-V4
0 likes · 19 min read
DeepSeek V4 Review: Open‑Source 1‑Trillion‑Parameter Model That Beats Claude & GPT for Developers
PaperAgent
PaperAgent
Dec 31, 2025 · Industry Insights

Who Will Lead China’s Open‑Source LLM Race in 2025? A Deep Dive

The 2025 review reveals how Chinese open‑source large language models shifted from a single‑dominant player to a fierce top‑ten battle, highlighting DeepSeek R1’s breakout, the rise of Qwen, MiniMax, Kimi, upcoming IPOs, a detailed release schedule, and bold predictions for 2026.

2025 reviewChina AIIndustry analysis
0 likes · 6 min read
Who Will Lead China’s Open‑Source LLM Race in 2025? A Deep Dive
PaperAgent
PaperAgent
Dec 4, 2025 · Artificial Intelligence

Mistral 3 Unveiled: How Its New Open‑Source Models Redefine Performance and Cost

Mistral AI’s latest open‑source release, Mistral 3, introduces three compact dense models and the powerful Mistral Large 3 MoE model, outperforming domestic rivals in benchmarks, offering strong multilingual and multimodal capabilities, and delivering the lowest cost‑performance ratio among open‑source LLMs.

Mistral 3Mixture of ExpertsModel Benchmark
0 likes · 4 min read
Mistral 3 Unveiled: How Its New Open‑Source Models Redefine Performance and Cost
PaperAgent
PaperAgent
Dec 2, 2025 · Artificial Intelligence

How DeepSeek‑V3.2’s New Agent Architecture Bridges the Gap to Closed‑Source LLMs

DeepSeek‑V3.2 introduces a reinforced‑agent framework that combines a synthetic task factory, scaling reinforcement learning, and advanced context management, achieving the highest open‑source agent scores and narrowing the performance gap with leading closed‑source models such as Claude‑4.5‑Sonnet, GPT‑5‑High, and Gemini‑3.0‑Pro.

AI agentsAgent ArchitectureDeepSeek
0 likes · 7 min read
How DeepSeek‑V3.2’s New Agent Architecture Bridges the Gap to Closed‑Source LLMs
HyperAI Super Neural
HyperAI Super Neural
Nov 28, 2025 · Artificial Intelligence

Weekly AI paper roundup: protein design, open‑source agent, HunyuanOCR, Olmo 3

This weekly roundup highlights five recent AI papers—including HumanSense for multimodal LLM evaluation, JAM‑2 for de novo antibody design, the open‑source Olmo 3 language models, the Lumine generalist 3D agent, and the lightweight HunyuanOCR vision‑language model—summarizing their core contributions, results, and links.

OCRgeneralist agentsmultimodal LLM
0 likes · 6 min read
Weekly AI paper roundup: protein design, open‑source agent, HunyuanOCR, Olmo 3
Data Party THU
Data Party THU
Aug 23, 2025 · Artificial Intelligence

How MiroMind‑M1 Sets New Benchmarks in Open‑Source Math Reasoning

The article presents MiroMind‑M1, an open‑source math‑reasoning language model that combines a 719K high‑quality SFT dataset, a novel CAMPO reinforcement‑learning algorithm, and extensive evaluations on AIME24, AIME25, and MATH‑500, demonstrating state‑of‑the‑art performance while reducing token usage.

CAMPOevaluationmath reasoning
0 likes · 11 min read
How MiroMind‑M1 Sets New Benchmarks in Open‑Source Math Reasoning
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 20, 2025 · Artificial Intelligence

Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning

Nvidia’s newly released Nemotron‑Nano‑9B‑v2, a 9‑billion‑parameter open‑source LLM optimized for a single Nvidia A10 GPU, introduces a toggleable reasoning mode and budget controls, delivering up to six‑fold speed gains, multilingual support, and strong benchmark results across various tasks.

AI inferenceMambaNvidia
0 likes · 5 min read
Nvidia Unveils Nemotron‑Nano‑9B‑v2: Tiny Open‑Source LLM with Switchable Reasoning
Programmer DD
Programmer DD
Aug 6, 2025 · Artificial Intelligence

What Is GPT-OSS? Inside OpenAI’s New Open‑Source Large Language Models

OpenAI has unveiled GPT‑OSS, an open‑source large language model series featuring a 120‑billion‑parameter version for high‑throughput production and a 20‑billion‑parameter version for low‑latency consumer hardware, both using Mixture‑of‑Experts architecture, 4‑bit quantization, and released under the permissive Apache 2.0 license.

4-bit quantizationApache 2.0 licenseGPT-OSS
0 likes · 3 min read
What Is GPT-OSS? Inside OpenAI’s New Open‑Source Large Language Models
JavaEdge
JavaEdge
Jul 28, 2025 · Artificial Intelligence

Why Kimi K2 Is the Next Open-Source LLM Challenging DeepSeek

The article examines Kimi K2, Moonshot AI’s open‑source large language model, detailing its MoE architecture, low‑cost pricing, agentic capabilities, performance comparisons with Claude and DeepSeek, and real‑world developer experiences, while discussing its potential impact on the AI landscape.

AI costDeveloper ExperienceKimi K2
0 likes · 8 min read
Why Kimi K2 Is the Next Open-Source LLM Challenging DeepSeek
DataFunTalk
DataFunTalk
Jun 17, 2025 · Artificial Intelligence

MiniMax M1: Open‑Source LLM That Rivals Gemini 2.5 Pro in Long‑Context Benchmarks

MiniMax’s newly released open‑source M1 model, built on the Lightning Attention‑enhanced MiniMax‑01 base, delivers up to 1 million token context, achieves near‑state‑of‑the‑art performance on MRCR and other long‑context benchmarks, and showcases impressive multilingual translation, code completion, and creative applications.

Lightning AttentionMiniMaxbenchmark evaluation
0 likes · 11 min read
MiniMax M1: Open‑Source LLM That Rivals Gemini 2.5 Pro in Long‑Context Benchmarks
AI Algorithm Path
AI Algorithm Path
Mar 26, 2025 · Artificial Intelligence

DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality

The DeepSeek V3-0324 model, released on March 24, 2025 with 6.85 trillion parameters and a Mixture‑of‑Experts architecture, is fully open‑source on Hugging Face and brings notable upgrades in coding ability, structured responses, stability, generation length, and speed, while offering performance comparable to leading closed‑source models such as Claude 3.7.

AI code generationCoding AIDeepSeek
0 likes · 10 min read
DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality
Architect
Architect
Feb 22, 2025 · Artificial Intelligence

How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits

This article reviews the most notable open‑source reproductions of DeepSeek‑R1—including Open R1, OpenThoughts, LIMO and DeepScaleR—detailing their data pipelines, training steps, reinforcement‑learning strategies, dataset constructions, and benchmark results that demonstrate how small, high‑quality data can rival massive‑scale models.

AI researchDeepSeek-R1Model Scaling
0 likes · 26 min read
How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits
21CTO
21CTO
Jul 24, 2024 · Artificial Intelligence

Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5

Meta’s newly released Llama 3.1 series, highlighted by the 405B model trained on 150 trillion tokens, claims state‑of‑the‑art performance in coding, mathematics, and multilingual summarization while offering an open‑source alternative to GPT‑4o and Claude 3.5 Sonnet.

AI competitionLlama 3.1large language models
0 likes · 6 min read
Meta’s Llama 3.1 405B: How the Open‑Source Giant Stands Up to GPT‑4 and Claude 3.5
21CTO
21CTO
Apr 8, 2024 · Artificial Intelligence

Download and Run Ollama with LLaMA 2 and LLaVA Locally

This tutorial walks you through downloading Ollama, an open‑source LLM platform, and demonstrates how to run the Meta LLaMA 2 text model and the multimodal LLaVA model on your own computer, including command‑line usage and image‑based queries.

AI TutorialLLaVALlama-2
0 likes · 7 min read
Download and Run Ollama with LLaMA 2 and LLaVA Locally
21CTO
21CTO
Mar 29, 2024 · Artificial Intelligence

Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2

Databricks unveiled the open‑source DBRX large language model, which leverages a mixed‑expert architecture to deliver faster, more cost‑effective inference and beats leading open‑source and proprietary models like Llama 2, Mixtral‑8x7B, and GPT‑3.5 on multiple benchmarks.

AIDBRXDatabricks
0 likes · 7 min read
Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 23, 2024 · Artificial Intelligence

Google’s Open‑Source Gemma Large Language Model: Architecture, Performance, and Community Reception

Google has released the open‑source Gemma LLM series (2B and 7B parameters) built on Gemini‑style architecture, offering free, commercial‑ready models that run on notebooks, support JAX/PyTorch/TensorFlow, outperform many open‑source peers, and have quickly sparked extensive community testing and discussion.

GemmaGoogleJAX
0 likes · 5 min read
Google’s Open‑Source Gemma Large Language Model: Architecture, Performance, and Community Reception
21CTO
21CTO
Feb 22, 2024 · Artificial Intelligence

How Google’s Open‑Source Gemma Model Brings LLM Power to Your Laptop

Google’s newly released open‑source Gemma models let developers run powerful large‑language‑model workloads on notebooks, workstations, or cloud platforms, offering competitive performance, extensive tooling, and built‑in safety measures for responsible AI deployment.

AI SafetyGemmaGoogle AI
0 likes · 6 min read
How Google’s Open‑Source Gemma Model Brings LLM Power to Your Laptop
Programmer DD
Programmer DD
Feb 22, 2024 · Artificial Intelligence

Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power

Google has launched Gemma, an open‑source large language model available in 2B and 7B parameter versions, built on the same technology as Gemini, outperforming many existing models and capable of running on ordinary laptops, with a detailed technical report and quick‑start guide provided online.

AIGemmaGoogle
0 likes · 3 min read
Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power
21CTO
21CTO
Dec 31, 2023 · Artificial Intelligence

2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral

Since ChatGPT’s debut, interest in large language models has surged, prompting the AI community to explore open‑source alternatives such as LLaMA, Pythia, MPT, Falcon, BLOOM, and Mistral, which together illustrate the rapid diversification and growing competitiveness of open‑source LLMs in 2023.

2023AIlarge language model
0 likes · 9 min read
2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral
21CTO
21CTO
Sep 14, 2023 · Artificial Intelligence

Unlocking Falcon 180B: The World’s Most Powerful Open‑Source LLM

Falcon 180B, the newly released 180‑billion‑parameter open‑source LLM from TII, outperforms Llama 2 and rivals top commercial models across numerous benchmarks, offers free commercial use, and comes with detailed hardware requirements, prompt formats, and ready‑to‑run code examples for developers.

AI modelFalcon 180BHardware Requirements
0 likes · 9 min read
Unlocking Falcon 180B: The World’s Most Powerful Open‑Source LLM
Tencent Cloud Developer
Tencent Cloud Developer
Aug 14, 2023 · Artificial Intelligence

Overview of Open‑Source Large Language Models: Llama 2, ChatGLM 2, Usage, Fine‑Tuning and Comparison

The article reviews the rapid evolution of open‑source large language models, detailing Meta’s Llama 2 series and Tsinghua’s ChatGLM 2, their enhanced capabilities such as RLHF, larger context windows, safety‑usefulness trade‑offs, performance gains, download and fine‑tuning procedures, and how they increasingly rival proprietary models like GPT‑4.

AIChatGLM2Llama2
0 likes · 10 min read
Overview of Open‑Source Large Language Models: Llama 2, ChatGLM 2, Usage, Fine‑Tuning and Comparison
Programmer DD
Programmer DD
May 6, 2023 · Artificial Intelligence

Can Open‑Source LLMs Overtake Google and OpenAI in the AI Arms Race?

An analysis of a leaked Google internal document reveals how open‑source large language models, low‑cost fine‑tuning techniques like LoRA, and rapid community innovation are reshaping the AI competition, challenging the dominance of both Google and OpenAI and prompting a strategic rethink.

AI competitionAI strategyGoogle
0 likes · 15 min read
Can Open‑Source LLMs Overtake Google and OpenAI in the AI Arms Race?
Architect
Architect
Apr 27, 2023 · Artificial Intelligence

Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives

This article provides a comprehensive overview of the development of large language models, reviewing classic papers from GPT‑1 through GPT‑4, discussing open‑source implementations such as LLaMA, Alpaca, GLM, and ChatGLM, and analyzing training methods, datasets, and future research directions.

AI researchGPTlarge language models
0 likes · 36 min read
Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives
21CTO
21CTO
Apr 9, 2023 · Artificial Intelligence

8 Open-Source ChatGPT Alternatives You Can Deploy Today

This article surveys eight popular open‑source ChatGPT alternatives, detailing each model’s size, training data, performance relative to proprietary systems, and providing links to code repositories, demos, and papers for developers interested in building or researching large language models.

AI researchChatGPT alternativesmodel comparison
0 likes · 8 min read
8 Open-Source ChatGPT Alternatives You Can Deploy Today
21CTO
21CTO
Mar 17, 2023 · Artificial Intelligence

Exploring OpenChatKit: The Open-Source Alternative to ChatGPT

OpenChatKit, released by Together Computer in March 2023, is an open‑source ChatGPT‑like large language model built on GPT‑NeoX‑20B, offering developers fine‑tuned conversational abilities, a modular architecture, retrieval system, and content‑filtering, while also outlining its current limitations and future potential.

AIChatGPT alternativeOpenChatKit
0 likes · 7 min read
Exploring OpenChatKit: The Open-Source Alternative to ChatGPT