Tagged articles

Qwen3

41 articles · Page 1 of 1

May 18, 2026 · Artificial Intelligence

ICML 2026: Teaching Large Models to Think and Speak – Turning “When to Speak” into a Learnable Strategy

The paper “When to Think, When to Speak” introduces Side‑by‑Side Interleaved Reasoning, a learnable disclosure policy that lets LLMs alternate between internal thinking and user‑visible answer fragments, reducing content latency while preserving or improving accuracy on math and scientific QA benchmarks.

CoTLLMQwen3

0 likes · 10 min read

ICML 2026: Teaching Large Models to Think and Speak – Turning “When to Speak” into a Learnable Strategy

PaperAgent

May 3, 2026 · Artificial Intelligence

Skill Graphs Reveal Why Training Diversity Beats Quantity for Terminal Agents

The paper shows that, instead of increasing the number of training tasks, controlling the diversity of scene‑skill combinations via a large‑scale Skill Graph dramatically improves terminal‑agent performance, with Qwen3‑32B surpassing a 480B model on the Terminal‑Bench 2.0 benchmark.

LLMQwen3Skill Graphs

0 likes · 9 min read

Skill Graphs Reveal Why Training Diversity Beats Quantity for Terminal Agents

SuanNi

Apr 13, 2026 · Artificial Intelligence

Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference

This guide walks you through deploying Alibaba’s open‑source Qwen‑3 8B model on the SumW platform using vLLM, covering environment activation, server launch with OpenAI‑compatible parameters, SSH tunneling for remote access, and Python client calls, while highlighting key configuration tips and common pitfalls.

Model DeploymentOpenAI APIPython SDK

0 likes · 6 min read

Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference

Tech Musings

Mar 6, 2026 · Artificial Intelligence

How to Build a Qwen3 Chat UI with Chainlit: Hooks, Auth, and Persistence

Learn how to use the Chainlit Python framework to create a web‑based Qwen3 chat interface, covering its core features, hook mechanisms for data layers, authentication, chat start, message handling, streaming generation, performance monitoring, and session restoration, with full code examples and SQLite persistence.

AIChainlitHooks

0 likes · 13 min read

How to Build a Qwen3 Chat UI with Chainlit: Hooks, Auth, and Persistence

Cognitive Technology Team

Mar 2, 2026 · Artificial Intelligence

Stream Real-Time Chat with Ollama’s qwen3 Model via Async Python & LangChain

This guide walks you through installing Ollama, downloading the qwen3:4b model, and using Python’s async client to perform streaming chat requests, then shows how to integrate the same model with LangChain, including setup, initialization, and both regular and streaming output examples.

Async PythonChatbotLangChain

0 likes · 5 min read

Stream Real-Time Chat with Ollama’s qwen3 Model via Async Python & LangChain

Tech Musings

Jan 29, 2026 · Artificial Intelligence

Running Qwen3‑Embedding on CPU‑Only Machines and Storing Vectors in Redis 8

This guide explains how to run the Qwen3‑Embedding‑0.6B model on a CPU‑only server, configure key parameters, optionally use Intel Extension for PyTorch, and efficiently store the resulting vectors in Redis 8 with proper serialization and indexing.

CPUEmbeddingPython

0 likes · 8 min read

Running Qwen3‑Embedding on CPU‑Only Machines and Storing Vectors in Redis 8

Baidu Intelligent Cloud Tech Hub

Jan 27, 2026 · Artificial Intelligence

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

This guide walks through setting up a Kunlun P800 XPU host, preparing Docker containers, deploying Qwen3‑8B/‑32B/‑VL models with vLLM‑Kunlun, benchmarking performance, and running full‑parameter DPO training using LLaMA‑Factory, providing scripts, configuration files, and troubleshooting tips for AI engineers.

DPOKunlun P800LLaMA-Factory

0 likes · 32 min read

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

AI Engineering

Jan 19, 2026 · Artificial Intelligence

How We Built a Self‑Evolving AI System Without Reward Functions

The Oxford study demonstrates that large language models can self‑evolve through a four‑step deploy‑validate‑filter‑inherit loop, eliminating handcrafted reward functions, and achieves dramatic performance gains on Blocksworld, Rovers, and Sokoban while providing theoretical proof of equivalence to REINFORCE.

AI safetyLLM planningQwen3

0 likes · 8 min read

How We Built a Self‑Evolving AI System Without Reward Functions

Fun with Large Models

Jan 14, 2026 · Artificial Intelligence

Understanding Large Language Model Files: Structure, Tokens, and Inference with Qwen3

This article walks through the complete workflow of loading and running the open‑source Qwen3‑8B model, explaining each core file (weights, config, generation config, tokenizer), how the model tokenizes input, applies chat templates, generates responses, and decodes output, all illustrated with code and diagrams.

ModelScopePythonQwen3

0 likes · 16 min read

Understanding Large Language Model Files: Structure, Tokens, and Inference with Qwen3

ShiZhen AI

Oct 24, 2025 · Artificial Intelligence

Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena

The NOF1.AI Alpha Arena benchmark shows Chinese models like Qwen3 Max and DeepSeek out‑performing GPT‑5, delivering +32.42% and +22.46% returns respectively, while GPT‑5 suffers a -72.49% loss, highlighting the impact of trade frequency, risk control, and profit‑to‑loss ratios in AI‑driven crypto trading.

AI tradingAlpha ArenaDeepSeek

0 likes · 14 min read

Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena

21CTO

Sep 8, 2025 · Artificial Intelligence

Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means

Alibaba introduced the Qwen3‑Max‑Preview model, a trillion‑parameter LLM that boosts multilingual understanding, complex instruction handling, and tool use while cutting hallucinations, offers competitive benchmark scores, supports 262K context, and comes with tiered token‑based pricing that may limit broader adoption.

AIAlibabaLLM

0 likes · 5 min read

Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means

Wuming AI

Sep 6, 2025 · Artificial Intelligence

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

The article reviews Alibaba's 1‑trillion‑parameter Qwen3‑Max‑Preview model, comparing its benchmark scores, hallucination rate, math and coding accuracy, and SVG generation quality against Claude, Kimi K2, and DeepSeek, while providing usage links and real‑world user impressions.

AI benchmarkLarge Language ModelQwen3

0 likes · 4 min read

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

Alibaba Cloud Native

Aug 29, 2025 · Cloud Native

Auto‑Generate Microservice Architecture Diagrams with Qwen3‑Coder and PlantUML

This guide walks through using the Lingma VS Code plugin with the Qwen3‑Coder model to analyze a complex microservice project, generate PlantUML architecture diagrams, refine them, and produce detailed API documentation, illustrating each step with commands and prompts.

AI code analysisCloud NativeMicroservices

0 likes · 8 min read

Auto‑Generate Microservice Architecture Diagrams with Qwen3‑Coder and PlantUML

Alibaba Cloud Native

Aug 27, 2025 · Artificial Intelligence

Build a Chat & Poetry Creation Agent with Qwen3 and Alipay MCP Using Tongyi Lingma

This guide walks through installing Tongyi Lingma, generating a Qwen3‑based conversational and poetry‑creation agent with Chainlit, integrating Alipay MCP for payment requests, troubleshooting common issues, and providing useful resource links for a complete AI‑agent development workflow.

AIChainlitMCP

0 likes · 7 min read

Build a Chat & Poetry Creation Agent with Qwen3 and Alipay MCP Using Tongyi Lingma

Baobao Algorithm Notes

Aug 1, 2025 · Artificial Intelligence

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

The article introduces Qwen3‑Coder‑30B‑A3B‑Instruct (aka Qwen3‑Coder‑Flash), detailing its architecture, 256K‑to‑1M token context, agentic coding capabilities, installation steps with Transformers, sample code for tool use, optimal sampling parameters, and deployment tips across various runtimes.

AI coding assistantLarge Language ModelQwen3

0 likes · 6 min read

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

Baobao Algorithm Notes

Jul 29, 2025 · Artificial Intelligence

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

The Qwen3‑30B‑A3B‑Instruct‑2507 model, an updated non‑thinking version of Qwen3‑30B‑A3B, delivers significant gains in instruction following, reasoning, multilingual knowledge coverage, and 256K context length, and its performance is benchmarked against leading LLMs across a wide range of tasks.

Instruction TuningMixture‑of‑ExpertsQwen3

0 likes · 6 min read

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

Alibaba Cloud Big Data AI Platform

Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI SearchElasticsearchLarge Language Model

0 likes · 11 min read

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

Instant Consumer Technology Team

Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus

0 likes · 19 min read

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

Java Architecture Diary

Jun 9, 2025 · Artificial Intelligence

How Qwen3 Embedding Redefines Multilingual Vector Search Performance

This article examines the Qwen3 Embedding series released by Alibaba's Qwen team, detailing its architecture, multilingual capabilities, benchmark superiority across MTEB and C‑MTEB tests, and provides practical deployment guidance via Ollama and API integration.

AIEmbeddingOllama

0 likes · 8 min read

How Qwen3 Embedding Redefines Multilingual Vector Search Performance

JavaEdge

Jun 6, 2025 · Artificial Intelligence

Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation

The article introduces the Qwen3 Embedding series, detailing its model variants, architecture, training methodology, multilingual support, performance metrics across several benchmarks, and future development plans, highlighting its superior generalization and flexibility for diverse AI applications.

AIEmbeddingQwen3

0 likes · 9 min read

Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation

Java Architecture Diary

Jun 5, 2025 · Artificial Intelligence

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Version 0.9.0 of Ollama introduces a ‘thinking’ control that lets users view and manage the AI model’s reasoning process, with detailed CLI commands, REST API usage, model support list, scripting options, and advanced Modelfile configurations for models like DeepSeek R1 and Qwen 3.

AI reasoningCLIDeepSeek

0 likes · 6 min read

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Alibaba Cloud Developer

May 20, 2025 · Artificial Intelligence

Unlock AI Data Integration with Qwen3, MCP & ComfyUI for Automated Content Creation

This article explores how to integrate the open‑source Qwen3‑235B‑A22B large model with Model Context Protocol (MCP) servers and ComfyUI, detailing architecture, Python implementation, deployment steps, third‑party media integration, practical use cases, limitations, and future prospects.

AIComfyUIMCP

0 likes · 16 min read

Unlock AI Data Integration with Qwen3, MCP & ComfyUI for Automated Content Creation

Architects' Tech Alliance

May 16, 2025 · Industry Insights

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition

The article provides a comprehensive analysis of DeepSeek’s rise in the large‑model market, examining its technical merits, security and customization hurdles, slowing innovation, fierce competition from OpenAI, Google and Alibaba’s Qwen3, as well as the fragility of its open‑source ecosystem and data preparation, ultimately questioning its long‑term viability.

AI modelsDeepSeekIndustry Analysis

0 likes · 13 min read

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition

Alibaba Cloud Big Data AI Platform

May 15, 2025 · Artificial Intelligence

How to Build a Qwen3‑Powered ChatBI Agent with PAI‑LangStudio and Hologres

This guide walks you through creating a ChatBI intelligent agent by integrating Alibaba's Qwen3 large language model with PAI‑LangStudio, configuring the Model Context Protocol (MCP) server, and connecting to Hologres real‑time data warehouse, covering setup, deployment, and verification steps for enterprise data analysis.

ChatBIHologresLLM

0 likes · 11 min read

How to Build a Qwen3‑Powered ChatBI Agent with PAI‑LangStudio and Hologres

Architect

May 14, 2025 · Artificial Intelligence

How Qwen3 Controls Hybrid Reasoning with the enable_thinking Parameter

This article explains how Qwen3 implements hybrid (fast/slow) reasoning by using the enable_thinking flag in the tokenizer's apply_chat_template method, detailing the underlying Jinja2 chat template, example prompts, the effect of toggling the flag, and design considerations for future autonomous thinking control.

AI modelChatMLHybrid Reasoning

0 likes · 13 min read

How Qwen3 Controls Hybrid Reasoning with the enable_thinking Parameter

Alibaba Cloud Developer

May 14, 2025 · Artificial Intelligence

Deploy Alibaba’s Qwen3 LLM in 10 Minutes with Bailei Platform

Learn how to quickly set up Alibaba Cloud’s Bailei platform to call the open-source Qwen3 large language model, explore its cost‑effective performance, dual‑mode reasoning, multilingual support, and enhanced agent capabilities, and follow step‑by‑step instructions for API key configuration, Cherry Studio integration, and tool‑calling setup.

AI DeploymentAlibaba CloudMLOps

0 likes · 6 min read

Deploy Alibaba’s Qwen3 LLM in 10 Minutes with Bailei Platform

Baobao Algorithm Notes

May 13, 2025 · Artificial Intelligence

How Qwen3 Achieves Multi-Stage Pretraining, Long-Context, and Thought-Controlled RL

The article details Qwen3's three‑phase pretraining pipeline, long‑context extensions, a cold‑start long‑chain‑of‑thought dataset, reinforcement‑learning fine‑tuning with custom rewards, and a two‑stage distillation process that yields versatile, thought‑controlled language models.

DistillationQwen3long-context

0 likes · 15 min read

How Qwen3 Achieves Multi-Stage Pretraining, Long-Context, and Thought-Controlled RL

Fun with Large Models

May 13, 2025 · Artificial Intelligence

Build a MiniManus AI Agent in 10 Minutes with Qwen3, Qwen‑Agent, and MCP

This tutorial walks through registering API keys, setting up a conda environment, integrating the Firecrawl MCP server, writing Qwen‑Agent code, and extending the agent with Amap MCP to create a multi‑functional MiniManus AI application in roughly ten minutes.

AmapFirecrawlMCP

0 likes · 9 min read

Build a MiniManus AI Agent in 10 Minutes with Qwen3, Qwen‑Agent, and MCP

Baidu Geek Talk

May 12, 2025 · Artificial Intelligence

One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform

This guide explains how to use Baidu Baige's AI heterogeneous computing platform to deploy the eight‑model Qwen3 family—including dense and MoE variants—via a one‑click process, covering resource configuration, inference acceleration options, and post‑deployment service access.

AIBaidu BaigeInference Optimization

0 likes · 4 min read

One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform

Fun with Large Models

May 8, 2025 · Artificial Intelligence

Building AI Agents with Qwen3 and Qwen‑Agent: A Hands‑On Guide to MCP Integration

This tutorial walks through registering a Qwen3 API key, setting up Qwen‑Agent, creating a multi‑turn chatbot, and integrating the MCP SQLite tool to enable natural‑language driven database operations, complete with step‑by‑step code examples and screenshots.

AnacondaMCPPython

0 likes · 11 min read

Building AI Agents with Qwen3 and Qwen‑Agent: A Hands‑On Guide to MCP Integration

Alibaba Cloud Big Data AI Platform

May 6, 2025 · Artificial Intelligence

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

This guide walks you through using Alibaba Cloud's PAI‑LangStudio together with the Qwen3 large language model to create an AI‑powered question‑answering system that combines Retrieval‑Augmented Generation, web search, secure deployment, and flexible customization for production use.

LangStudioMilvusQwen3

0 likes · 10 min read

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

Eric Tech Circle

May 6, 2025 · Artificial Intelligence

How to Deploy Qwen3-30B-A3B Locally and Unlock Its Full AI Potential

This article walks through the complete process of installing the Qwen3-30B-A3B large language model on a personal computer using LM Studio, evaluates its reasoning, creative, multilingual, and coding abilities with detailed prompts, and shares practical tips for optimizing local deployment and prompt design.

AI evaluationLM StudioPrompt Engineering

0 likes · 12 min read

How to Deploy Qwen3-30B-A3B Locally and Unlock Its Full AI Potential

JavaEdge

May 2, 2025 · Artificial Intelligence

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

This article introduces the Qwen3 family of open‑source large language models, details their architecture, parameter counts, multilingual support, and benchmark performance, and provides step‑by‑step instructions for deploying them with frameworks like SGLang, vLLM, and local runtimes such as Ollama and LMStudio.

AIAgentLarge Language Model

0 likes · 22 min read

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

AI Algorithm Path

May 2, 2025 · Artificial Intelligence

Qwen3 Launch: Open-Source Models Redefine General AI

The Qwen3 series introduces eight open‑source large language models ranging from 0.6B to 235B parameters, combines dense and Mixture‑of‑Experts architectures, supports multimodal input, offers mixed inference modes, and demonstrates benchmark superiority over leading models such as OpenAI o1 and Gemini 2.5 Pro.

AI AgentsLarge Language ModelMixture of Experts

0 likes · 10 min read

Qwen3 Launch: Open-Source Models Redefine General AI

Alibaba Cloud Infrastructure

Apr 30, 2025 · Cloud Native

Deploying Qwen3-8B Large Language Model on Alibaba Cloud ACK with ACS GPU Acceleration

This guide explains how to prepare, deploy, and verify the Qwen3‑8B large language model on an Alibaba Cloud Container Service for Kubernetes (ACK) cluster using ACS GPU resources, covering prerequisites, model download, storage setup, Kubernetes manifests, and testing the inference service.

ACKACSCloud Native

0 likes · 8 min read

Deploying Qwen3-8B Large Language Model on Alibaba Cloud ACK with ACS GPU Acceleration

Alibaba Cloud Native

Apr 29, 2025 · Artificial Intelligence

Qwen3 Unveiled: 8 Open‑Source Hybrid Inference Models Redefine AI Capabilities

Qwen3 introduces eight fully open‑source hybrid inference models—including two MoE and six dense variants—offering massive parameter scales, dual reasoning modes, 119‑language support, and record‑breaking agent performance that rival top‑tier LLMs.

AI inferenceQwen3multilingual

0 likes · 4 min read

Qwen3 Unveiled: 8 Open‑Source Hybrid Inference Models Redefine AI Capabilities

Alibaba Cloud Big Data AI Platform

Apr 29, 2025 · Artificial Intelligence

Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud

This article introduces Qwen3, the latest dense and MOE large language model with dual‑mode reasoning, enhanced inference, multilingual support, and strong agent capabilities, and explains how Alibaba Cloud's PAI‑Model Gallery enables zero‑code, one‑click deployment and enterprise‑grade usage.

Alibaba CloudLarge Language ModelQwen3

0 likes · 6 min read

Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud

Programmer DD

Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance

Qwen3, Alibaba’s latest open‑source large language model, introduces a pioneering mixed‑inference architecture that blends top‑tier reasoning and non‑reasoning capabilities, delivering record‑breaking benchmark scores, multilingual support for 119 languages, cost‑effective deployment, and a 128K context window, now accessible via Ollama and OpenRouter.

AI benchmarkLarge Language ModelQwen3

0 likes · 5 min read

Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance

DataFunTalk

Apr 29, 2025 · Artificial Intelligence

ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series

OpenAI announced new shopping capabilities for ChatGPT, improving product recommendation, visual presentation, and direct purchase links, while Alibaba released the Qwen3 series of large and MoE language models with detailed parameter counts and benchmark performance, highlighting rapid advancements in consumer‑focused AI applications.

AIChatGPTLarge Language Model

0 likes · 4 min read

ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series

Java Architecture Diary

Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Qwen3 introduces a suite of open‑source models—from a 235B expert model to compact 0.6B versions—offering competitive performance against top proprietary models, multilingual support, flexible thinking modes, and low deployment requirements, with detailed usage instructions via Ollama and OpenRouter.

Large Language ModelOllamaOpen-source AI

0 likes · 8 min read

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Baobao Algorithm Notes

Apr 28, 2025 · Artificial Intelligence

What Makes Qwen3 the Next Leap in Large Language Models?

The article announces Qwen3, detailing its flagship 235B and smaller MoE models, superior benchmark performance, extensive multilingual support, expanded pretraining data, four-stage post‑training, flexible thinking modes, deployment guides for SGLang, vLLM, Ollama, and future plans toward AGI‑level capabilities.

AI researchQwen3deployment

0 likes · 15 min read

What Makes Qwen3 the Next Leap in Large Language Models?