Tagged articles
43 articles
Page 1 of 1
SuanNi
SuanNi
May 19, 2026 · Artificial Intelligence

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

Qwen 3.7‑Max‑Preview secures the 13th spot worldwide and the top position among Chinese models, while Qwen 3.7‑Plus‑Preview ranks 16th in vision, highlighting an accelerated release cadence, deeper technical depth across sub‑tasks, and a shift in China’s large‑model competition toward ecosystem control.

AI competitionChina AIModel Ranking
0 likes · 9 min read
Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard
DataFunTalk
DataFunTalk
May 19, 2026 · Artificial Intelligence

Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision

The Qwen 3.7‑Max and Qwen 3.7‑Plus preview models debut with top‑15 global rankings in Arena, the only Chinese models in text and vision leaderboards, while a timeline analysis shows the Qwen series accelerating from 4‑6‑month releases to a 2‑3‑month cadence and introducing dense and MoE variants up to 235 B parameters.

AI BenchmarkChinese AIModel Iteration
0 likes · 6 min read
Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision
Old Zhang's AI Learning
Old Zhang's AI Learning
May 13, 2026 · Artificial Intelligence

Why vLLM Now Leads Open‑Source LLM Inference Benchmarks

vLLM tops the Artificial Analysis ranking by delivering the highest throughput for DeepSeek V3.2, Qwen 3.5 397B, and MiniMax‑M2.5 on identical NVIDIA Blackwell Ultra hardware, thanks to extensive kernel‑fusion optimizations that remain in the main branch.

DeepSeekLLM inferenceQwen
0 likes · 7 min read
Why vLLM Now Leads Open‑Source LLM Inference Benchmarks
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 6, 2026 · Artificial Intelligence

How Qwen’s Mid‑Training with Value‑Document Guides Slashes Error Rates

Researchers at Claude applied the MSM (mid‑training) approach to Qwen models, inserting a value‑document pre‑training phase before alignment fine‑tuning, which reduced misalignment rates from 68%/54% to 5%/7% and cut required fine‑tuning data by 40‑60×, demonstrating superior generalization when combined with standard alignment.

AI AlignmentLarge Language ModelsMSM
0 likes · 6 min read
How Qwen’s Mid‑Training with Value‑Document Guides Slashes Error Rates
SuanNi
SuanNi
Apr 22, 2026 · Artificial Intelligence

How Alibaba’s Open‑Source Qwen 3.6‑27B Outperforms a 15× Larger Predecessor

Alibaba’s newly released open‑source Qwen 3.6‑27B dense model, with 27 billion parameters, beats its 397 billion‑parameter predecessor across a suite of code‑generation and multimodal benchmarks, while offering easier deployment thanks to its pure‑dense architecture and native image‑video‑text capabilities.

Dense ArchitectureMultimodalQwen
0 likes · 5 min read
How Alibaba’s Open‑Source Qwen 3.6‑27B Outperforms a 15× Larger Predecessor
SuanNi
SuanNi
Apr 21, 2026 · Artificial Intelligence

How Qwen3.6‑35B‑A3B Matches Dense Models with Only 30 B Active Parameters

The article analyzes Qwen3.6‑35B‑A3B’s MoE architecture, showing how its 30 B active parameters outperform larger dense models across programming, agent, and multimodal benchmarks, and examines the flagship Qwen3.6‑Max‑Preview’s substantial gains in world knowledge, instruction following, and third‑party rankings.

AI EvaluationMixture of ExpertsQwen
0 likes · 5 min read
How Qwen3.6‑35B‑A3B Matches Dense Models with Only 30 B Active Parameters
Design Hub
Design Hub
Apr 21, 2026 · Artificial Intelligence

Two Simultaneous Battlefronts Define the Past 24 Hours in AI, Not Just New Models

In the last 24 hours the AI landscape shifted not by a handful of new model releases but by two converging fronts—model‑level advances in agentic coding and product‑level moves that turn models into usable work systems—signaling deeper changes in competition and industry impact.

AI modelsAgentic CodingClaude
0 likes · 14 min read
Two Simultaneous Battlefronts Define the Past 24 Hours in AI, Not Just New Models
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 16, 2026 · Artificial Intelligence

Efficient Reasoning with Reward Shaping: Compressing Qwen 30B‑Series Chains by 20‑40%

The article analyzes how reward‑shaping techniques can shorten the chain‑of‑thought outputs of Qwen 30‑parameter series models by 20‑40% while preserving or slightly improving performance on AIME‑25 and out‑of‑distribution benchmarks, and it details the experimental design, strategic considerations, and practical insights behind this efficient reasoning approach.

Efficient InferenceQwenReinforcement Learning
0 likes · 16 min read
Efficient Reasoning with Reward Shaping: Compressing Qwen 30B‑Series Chains by 20‑40%
Machine Heart
Machine Heart
Apr 12, 2026 · Artificial Intelligence

LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps

Researchers introduce Latent Reasoning Tuning (LRT), a lightweight inference network that encodes explicit reasoning chains into fixed‑length latent vectors, eliminating thousands of decoding steps; experiments reveal substantial redundancy in traditional chains and demonstrate that LRT achieves faster, more accurate inference and outperforms existing efficient reasoning methods.

DeepSeekEfficient InferenceHybrid Reasoning
0 likes · 10 min read
LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 8, 2026 · Artificial Intelligence

2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio

An in‑depth 2026 comparative review of Alibaba’s Qwen series (Qwen2.5, Qwen3, Qwen3.5) evaluates architecture, performance, speed and VRAM usage on Mac Studio, ranks each variant, and provides concrete model‑selection guidance for different memory configurations, highlighting the MoE‑based Qwen3.5 as the optimal choice.

AI PerformanceMac StudioMoE
0 likes · 9 min read
2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio
Test Development Learning Exchange
Test Development Learning Exchange
Mar 24, 2026 · Artificial Intelligence

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

This guide presents an end‑to‑end solution for creating a test‑focused AI agent that indexes project code and defect data, integrates a large language model via LangChain, generates compliant Pytest cases, parses Allure reports, and offers deployment tips for seamless PyCharm integration.

AI AgentAllureLangChain
0 likes · 13 min read
Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports
AI Explorer
AI Explorer
Mar 4, 2026 · Industry Insights

Qwen’s Lead Architect Steps Down: Who Will Steer China’s Top Open‑Source AI Flagship?

On March 4, 2026, Alibaba’s youngest P10 technical leader Lin Junyang announced his resignation with a nine‑word tweet, just hours after releasing four Qwen 3.5 models that earned Elon Musk’s praise, while two other core researchers also left, leaving the future of China’s leading open‑source AI flagship uncertain.

AI talent turnoverAlibabaChina AI
0 likes · 9 min read
Qwen’s Lead Architect Steps Down: Who Will Steer China’s Top Open‑Source AI Flagship?
Woodpecker Software Testing
Woodpecker Software Testing
Feb 27, 2026 · Artificial Intelligence

Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)

This article walks through building a Python pipeline that uses DeepSeek and Alibaba Qwen to generate AI‑written articles, creates cover images, and automatically saves them as drafts in a WeChat public account, with detailed environment setup, client implementations, fallback strategies, and deployment tips.

Content GenerationDeepSeekPython
0 likes · 26 min read
Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)
Baobao Algorithm Notes
Baobao Algorithm Notes
Feb 25, 2026 · Artificial Intelligence

Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides

This article reviews the three open‑source Qwen 3.5 models—including a 35B MoE, a 122B MoE, and a 27B dense version—detailing their parameter layouts, core attention designs, context length, inference performance, hardware requirements, and provides step‑by‑step code examples for loading them with Hugging Face Transformers and vLLM.

MoEModel DeploymentQwen
0 likes · 10 min read
Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides
AI Algorithm Path
AI Algorithm Path
Feb 8, 2026 · Artificial Intelligence

Qwen Multi-Angle: An Open‑Source AI Tool for Full‑Perspective Image Reconstruction

The open‑source Qwen‑Image‑Edit‑2511‑Multiple‑Angles‑LoRA model can reconstruct images from 96 preset camera poses, letting users adjust distance, pitch and yaw to generate realistic multi‑angle views, with step‑by‑step usage instructions, example results, practical applications, and noted limitations.

Qwenaiimage editing
0 likes · 6 min read
Qwen Multi-Angle: An Open‑Source AI Tool for Full‑Perspective Image Reconstruction
DaTaobao Tech
DaTaobao Tech
Jan 30, 2026 · Artificial Intelligence

Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling

This article proposes an ASR‑driven pipeline that creates high‑quality AI‑reply vs. human‑like reply pairs, trains a rewrite model and a reward model, and uses GRPO reinforcement learning to generate natural, helpful, and less AI‑sounding responses in digital‑human live streaming, achieving 92% accuracy and 97% helpfulness while improving user experience.

ASR dataLLMQwen
0 likes · 20 min read
Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling
AI Insight Log
AI Insight Log
Jan 19, 2026 · Artificial Intelligence

Run Claude Code for Free? Ollama Adds Anthropic API Compatibility

Ollama v0.14.0 now supports the Anthropic API, letting you run Claude Code locally with open‑source models like Qwen or Llama without an API key, network, or cost, and the article provides a step‑by‑step setup, SDK examples, and an objective assessment of the approach.

Anthropic APIClaude CodeLocal-LLM
0 likes · 7 min read
Run Claude Code for Free? Ollama Adds Anthropic API Compatibility
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Apr 17, 2025 · Artificial Intelligence

Inside Qwen: A Deep Dive into the Large Model’s Source Code

The article provides a comprehensive technical walkthrough of Qwen’s large‑model series, covering data preparation, tokenization, model tweaks, training settings, RLHF pipeline, Code‑Qwen specifics, Qwen2 and Qwen3 architectural changes, scaling‑law experiments, and detailed source‑code analysis with illustrative diagrams.

MoEModel architectureQwen
0 likes · 7 min read
Inside Qwen: A Deep Dive into the Large Model’s Source Code
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini
0 likes · 9 min read
What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked
NewBeeNLP
NewBeeNLP
Mar 18, 2025 · Interview Experience

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

This article recounts a three‑stage interview for a multimodal large‑model position at Taobao's Search AI division, detailing typical questions on CLIP, LoRA, BLIP, Qwen‑VL, Transformer fundamentals, RLHF, and coding challenges, and offers insights on what interviewers focus on.

CLIPLoRAMultimodal
0 likes · 5 min read
How to Ace Multimodal Model Interviews at Taobao's Search AI Division
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 16, 2025 · Artificial Intelligence

Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?

This article details how a 7B parameter language model, fine‑tuned with DeepSeek's GRPO reinforcement‑learning algorithm and a carefully crafted multi‑component reward system, learned to solve Sudoku puzzles without any cold‑start data, outperforming a comparable 3B model and revealing key insights for structured reasoning tasks.

AI trainingGRPOQwen
0 likes · 15 min read
Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?
Top Architect
Top Architect
Mar 9, 2025 · Artificial Intelligence

Alibaba Unveils Qwen QwQ-32B: A Compact Open‑Source LLM Rivaling DeepSeek

Alibaba has released the open‑source Qwen QwQ‑32B model, a 32‑billion‑parameter LLM that matches DeepSeek‑R1's performance while being deployable on consumer‑grade GPUs, and the announcement is accompanied by extensive promotional offers for AI‑related products and services.

AI BenchmarkAlibabaQwen
0 likes · 7 min read
Alibaba Unveils Qwen QwQ-32B: A Compact Open‑Source LLM Rivaling DeepSeek
AI Product Manager Community
AI Product Manager Community
Mar 6, 2025 · Artificial Intelligence

Why Alibaba’s QwQ‑32B Rivals 670B Models with Just 32B Parameters

Alibaba’s newly released 32‑billion‑parameter QwQ‑32B model matches the performance of 670‑billion‑parameter rivals like DeepSeek‑R1, integrates agent‑based reasoning, runs on consumer hardware, and has sparked strong open‑source community adoption, as shown by benchmark results and download statistics.

AlibabaQwenagent
0 likes · 6 min read
Why Alibaba’s QwQ‑32B Rivals 670B Models with Just 32B Parameters
DataFunTalk
DataFunTalk
Mar 2, 2025 · Artificial Intelligence

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

This tutorial explains how to build a distributed reinforcement‑learning pipeline using the GRPO algorithm, covering data preparation, evaluation and reward functions, multi‑GPU DataParallel implementation, and full fine‑tuning of the Qwen2.5‑1.5B‑Instruct model with PyTorch, FlashAttention2 and Weights & Biases.

Distributed TrainingGRPOPyTorch
0 likes · 10 min read
Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct
Java Tech Enthusiast
Java Tech Enthusiast
Feb 14, 2025 · Artificial Intelligence

Apple Partners with Alibaba to Develop AI Features for iPhone Users

Apple’s new Apple Intelligence platform, unveiled at WWDC24, will incorporate Alibaba’s Qwen 2.5 Max model to create China‑specific AI features for iPhone users, with a custom dataset and regulatory submission, marking a shift from overseas ChatGPT reliance to a domestic partnership.

AlibabaAppleArtificial Intelligence
0 likes · 3 min read
Apple Partners with Alibaba to Develop AI Features for iPhone Users
JavaEdge
JavaEdge
Dec 1, 2024 · Artificial Intelligence

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

QwQ‑32B‑Preview, an experimental AI model from the Qwen team, showcases strong reasoning in math and programming while facing challenges like language switching, inference loops, safety concerns, and variable capabilities across domains, with benchmark scores ranging from 50% to over 90% on tests such as GPQA, AIME, MATH‑500, and LiveCodeBench.

AI BenchmarkLLMModel Evaluation
0 likes · 7 min read
Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model
Ops Development Stories
Ops Development Stories
Sep 19, 2024 · Artificial Intelligence

How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide

This tutorial walks through setting up a local k3d cluster, installing Higress, and using its AI plugins—including AI Proxy, AI JSON formatter, AI Agent, and AI Statistics—to integrate and observe Alibaba Cloud's Qwen large language models across various use cases such as weather and flight queries.

AI gatewayAI pluginsHigress
0 likes · 30 min read
How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide
Alibaba Cloud Native
Alibaba Cloud Native
May 30, 2024 · Cloud Native

Translate CS Textbooks Instantly with AI: A Hands‑On Higress Cloud‑Native Guide

This guide shows how to use free AI translation tools—Immersive Translate and OpenAI Translator—together with the Higress cloud‑native AI‑proxy plugin, configuring Docker, model mappings, and custom dictionaries to efficiently translate computer‑science textbooks like Rust and Crafting Interpreters, while comparing machine and human translations.

AI translationDockerHigress
0 likes · 11 min read
Translate CS Textbooks Instantly with AI: A Hands‑On Higress Cloud‑Native Guide
Alibaba Cloud Native
Alibaba Cloud Native
May 15, 2024 · Cloud Native

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

This article walks through setting up a cloud‑native test environment using the open‑source NextChat UI and Higress API gateway to let Qwen‑2.5 masquerade as GPT‑4o, enabling a side‑by‑side comparison of their responses while showcasing Higress’s streaming, hot‑update, and security features for AI workloads.

AI gatewayDockerGPT-4o
0 likes · 8 min read
Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 28, 2024 · Artificial Intelligence

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters

Qwen1.5‑MoE‑A2.7B is a 2.7 billion‑parameter Mixture‑of‑Experts model that delivers performance comparable to leading 7 billion‑parameter LLMs while cutting training cost by 75% and boosting inference speed by 1.74×, and the article details its architecture, benchmarks, efficiency analysis, and deployment steps.

MoEModel BenchmarkQwen
0 likes · 13 min read
How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters