Tagged articles
41 articles
Page 1 of 1
Machine Heart
Machine Heart
May 18, 2026 · Artificial Intelligence

ICML 2026: Teaching Large Models to Think and Speak – Turning “When to Speak” into a Learnable Strategy

The paper “When to Think, When to Speak” introduces Side‑by‑Side Interleaved Reasoning, a learnable disclosure policy that lets LLMs alternate between internal thinking and user‑visible answer fragments, reducing content latency while preserving or improving accuracy on math and scientific QA benchmarks.

CoTLLMQwen3
0 likes · 10 min read
ICML 2026: Teaching Large Models to Think and Speak – Turning “When to Speak” into a Learnable Strategy
PaperAgent
PaperAgent
May 3, 2026 · Artificial Intelligence

Skill Graphs Reveal Why Training Diversity Beats Quantity for Terminal Agents

The paper shows that, instead of increasing the number of training tasks, controlling the diversity of scene‑skill combinations via a large‑scale Skill Graph dramatically improves terminal‑agent performance, with Qwen3‑32B surpassing a 480B model on the Terminal‑Bench 2.0 benchmark.

LLMQwen3Skill Graphs
0 likes · 9 min read
Skill Graphs Reveal Why Training Diversity Beats Quantity for Terminal Agents
SuanNi
SuanNi
Apr 13, 2026 · Artificial Intelligence

Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference

This guide walks you through deploying Alibaba’s open‑source Qwen‑3 8B model on the SumW platform using vLLM, covering environment activation, server launch with OpenAI‑compatible parameters, SSH tunneling for remote access, and Python client calls, while highlighting key configuration tips and common pitfalls.

Model DeploymentOpenAI APIPython SDK
0 likes · 6 min read
Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference
Tech Musings
Tech Musings
Mar 6, 2026 · Artificial Intelligence

How to Build a Qwen3 Chat UI with Chainlit: Hooks, Auth, and Persistence

Learn how to use the Chainlit Python framework to create a web‑based Qwen3 chat interface, covering its core features, hook mechanisms for data layers, authentication, chat start, message handling, streaming generation, performance monitoring, and session restoration, with full code examples and SQLite persistence.

AIChainlitPython
0 likes · 13 min read
How to Build a Qwen3 Chat UI with Chainlit: Hooks, Auth, and Persistence
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 27, 2026 · Artificial Intelligence

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

This guide walks through setting up a Kunlun P800 XPU host, preparing Docker containers, deploying Qwen3‑8B/‑32B/‑VL models with vLLM‑Kunlun, benchmarking performance, and running full‑parameter DPO training using LLaMA‑Factory, providing scripts, configuration files, and troubleshooting tips for AI engineers.

DPOInferenceKunlun P800
0 likes · 32 min read
Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide
AI Engineering
AI Engineering
Jan 19, 2026 · Artificial Intelligence

How We Built a Self‑Evolving AI System Without Reward Functions

The Oxford study demonstrates that large language models can self‑evolve through a four‑step deploy‑validate‑filter‑inherit loop, eliminating handcrafted reward functions, and achieves dramatic performance gains on Blocksworld, Rovers, and Sokoban while providing theoretical proof of equivalence to REINFORCE.

AI SafetyLLM planningQwen3
0 likes · 8 min read
How We Built a Self‑Evolving AI System Without Reward Functions
Fun with Large Models
Fun with Large Models
Jan 14, 2026 · Artificial Intelligence

Understanding Large Language Model Files: Structure, Tokens, and Inference with Qwen3

This article walks through the complete workflow of loading and running the open‑source Qwen3‑8B model, explaining each core file (weights, config, generation config, tokenizer), how the model tokenizes input, applies chat templates, generates responses, and decodes output, all illustrated with code and diagrams.

InferenceModelScopePython
0 likes · 16 min read
Understanding Large Language Model Files: Structure, Tokens, and Inference with Qwen3
ShiZhen AI
ShiZhen AI
Oct 24, 2025 · Artificial Intelligence

Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena

The NOF1.AI Alpha Arena benchmark shows Chinese models like Qwen3 Max and DeepSeek out‑performing GPT‑5, delivering +32.42% and +22.46% returns respectively, while GPT‑5 suffers a -72.49% loss, highlighting the impact of trade frequency, risk control, and profit‑to‑loss ratios in AI‑driven crypto trading.

AI tradingAlpha ArenaDeepSeek
0 likes · 14 min read
Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena
21CTO
21CTO
Sep 8, 2025 · Artificial Intelligence

Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means

Alibaba introduced the Qwen3‑Max‑Preview model, a trillion‑parameter LLM that boosts multilingual understanding, complex instruction handling, and tool use while cutting hallucinations, offers competitive benchmark scores, supports 262K context, and comes with tiered token‑based pricing that may limit broader adoption.

AIAlibabaLLM
0 likes · 5 min read
Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means
Wuming AI
Wuming AI
Sep 6, 2025 · Artificial Intelligence

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

The article reviews Alibaba's 1‑trillion‑parameter Qwen3‑Max‑Preview model, comparing its benchmark scores, hallucination rate, math and coding accuracy, and SVG generation quality against Claude, Kimi K2, and DeepSeek, while providing usage links and real‑world user impressions.

AI BenchmarkQwen3SVG generation
0 likes · 4 min read
Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 1, 2025 · Artificial Intelligence

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

The article introduces Qwen3‑Coder‑30B‑A3B‑Instruct (aka Qwen3‑Coder‑Flash), detailing its architecture, 256K‑to‑1M token context, agentic coding capabilities, installation steps with Transformers, sample code for tool use, optimal sampling parameters, and deployment tips across various runtimes.

AI coding assistantAgentic CodingDeep Learning
0 likes · 6 min read
Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 29, 2025 · Artificial Intelligence

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

The Qwen3‑30B‑A3B‑Instruct‑2507 model, an updated non‑thinking version of Qwen3‑30B‑A3B, delivers significant gains in instruction following, reasoning, multilingual knowledge coverage, and 256K context length, and its performance is benchmarked against leading LLMs across a wide range of tasks.

Instruction TuningMixture‑of‑ExpertsQwen3
0 likes · 6 min read
Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 27, 2025 · Artificial Intelligence

Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch

This guide walks you through using the PAI‑LangStudio platform together with the Qwen3 large language model and Elasticsearch to create a full‑stack AI Search RAG solution, covering prerequisites, step‑by‑step configuration of model services, database connections, runtimes, knowledge bases, workflow creation, testing, and deployment for production use.

AI searchElasticsearchPAI‑LangStudio
0 likes · 11 min read
Build a Powerful AI Search RAG Application with PAI‑LangStudio, Qwen3 & Elasticsearch
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus
0 likes · 19 min read
How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models
JavaEdge
JavaEdge
Jun 6, 2025 · Artificial Intelligence

Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation

The article introduces the Qwen3 Embedding series, detailing its model variants, architecture, training methodology, multilingual support, performance metrics across several benchmarks, and future development plans, highlighting its superior generalization and flexibility for diverse AI applications.

AIEmbeddingModel Evaluation
0 likes · 9 min read
Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation
Java Architecture Diary
Java Architecture Diary
Jun 5, 2025 · Artificial Intelligence

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Version 0.9.0 of Ollama introduces a ‘thinking’ control that lets users view and manage the AI model’s reasoning process, with detailed CLI commands, REST API usage, model support list, scripting options, and advanced Modelfile configurations for models like DeepSeek R1 and Qwen 3.

AI reasoningCLIDeepSeek
0 likes · 6 min read
Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works
Architects' Tech Alliance
Architects' Tech Alliance
May 16, 2025 · Industry Insights

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition

The article provides a comprehensive analysis of DeepSeek’s rise in the large‑model market, examining its technical merits, security and customization hurdles, slowing innovation, fierce competition from OpenAI, Google and Alibaba’s Qwen3, as well as the fragility of its open‑source ecosystem and data preparation, ultimately questioning its long‑term viability.

AI modelsDeepSeekIndustry analysis
0 likes · 13 min read
Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
May 15, 2025 · Artificial Intelligence

How to Build a Qwen3‑Powered ChatBI Agent with PAI‑LangStudio and Hologres

This guide walks you through creating a ChatBI intelligent agent by integrating Alibaba's Qwen3 large language model with PAI‑LangStudio, configuring the Model Context Protocol (MCP) server, and connecting to Hologres real‑time data warehouse, covering setup, deployment, and verification steps for enterprise data analysis.

ChatBIHologresLLM
0 likes · 11 min read
How to Build a Qwen3‑Powered ChatBI Agent with PAI‑LangStudio and Hologres
Architect
Architect
May 14, 2025 · Artificial Intelligence

How Qwen3 Controls Hybrid Reasoning with the enable_thinking Parameter

This article explains how Qwen3 implements hybrid (fast/slow) reasoning by using the enable_thinking flag in the tokenizer's apply_chat_template method, detailing the underlying Jinja2 chat template, example prompts, the effect of toggling the flag, and design considerations for future autonomous thinking control.

AI modelChatMLHybrid Reasoning
0 likes · 13 min read
How Qwen3 Controls Hybrid Reasoning with the enable_thinking Parameter
Alibaba Cloud Developer
Alibaba Cloud Developer
May 14, 2025 · Artificial Intelligence

Deploy Alibaba’s Qwen3 LLM in 10 Minutes with Bailei Platform

Learn how to quickly set up Alibaba Cloud’s Bailei platform to call the open-source Qwen3 large language model, explore its cost‑effective performance, dual‑mode reasoning, multilingual support, and enhanced agent capabilities, and follow step‑by‑step instructions for API key configuration, Cherry Studio integration, and tool‑calling setup.

AI deploymentAlibaba CloudMLOps
0 likes · 6 min read
Deploy Alibaba’s Qwen3 LLM in 10 Minutes with Bailei Platform
Baidu Geek Talk
Baidu Geek Talk
May 12, 2025 · Artificial Intelligence

One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform

This guide explains how to use Baidu Baige's AI heterogeneous computing platform to deploy the eight‑model Qwen3 family—including dense and MoE variants—via a one‑click process, covering resource configuration, inference acceleration options, and post‑deployment service access.

AIBaidu BaigeCloud AI
0 likes · 4 min read
One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform
Eric Tech Circle
Eric Tech Circle
May 6, 2025 · Artificial Intelligence

How to Deploy Qwen3-30B-A3B Locally and Unlock Its Full AI Potential

This article walks through the complete process of installing the Qwen3-30B-A3B large language model on a personal computer using LM Studio, evaluates its reasoning, creative, multilingual, and coding abilities with detailed prompts, and shares practical tips for optimizing local deployment and prompt design.

AI EvaluationLM StudioPrompt engineering
0 likes · 12 min read
How to Deploy Qwen3-30B-A3B Locally and Unlock Its Full AI Potential
JavaEdge
JavaEdge
May 2, 2025 · Artificial Intelligence

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

This article introduces the Qwen3 family of open‑source large language models, details their architecture, parameter counts, multilingual support, and benchmark performance, and provides step‑by‑step instructions for deploying them with frameworks like SGLang, vLLM, and local runtimes such as Ollama and LMStudio.

AIAgentQwen3
0 likes · 22 min read
Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides
AI Algorithm Path
AI Algorithm Path
May 2, 2025 · Artificial Intelligence

Qwen3 Launch: Open-Source Models Redefine General AI

The Qwen3 series introduces eight open‑source large language models ranging from 0.6B to 235B parameters, combines dense and Mixture‑of‑Experts architectures, supports multimodal input, offers mixed inference modes, and demonstrates benchmark superiority over leading models such as OpenAI o1 and Gemini 2.5 Pro.

AI agentsBenchmarkMixture of Experts
0 likes · 10 min read
Qwen3 Launch: Open-Source Models Redefine General AI
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 29, 2025 · Artificial Intelligence

Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud

This article introduces Qwen3, the latest dense and MOE large language model with dual‑mode reasoning, enhanced inference, multilingual support, and strong agent capabilities, and explains how Alibaba Cloud's PAI‑Model Gallery enables zero‑code, one‑click deployment and enterprise‑grade usage.

Alibaba CloudQwen3Zero‑Code Deployment
0 likes · 6 min read
Unlock Qwen3: Powerful LLM Features and Zero‑Code Deployment on Alibaba Cloud
Programmer DD
Programmer DD
Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance

Qwen3, Alibaba’s latest open‑source large language model, introduces a pioneering mixed‑inference architecture that blends top‑tier reasoning and non‑reasoning capabilities, delivering record‑breaking benchmark scores, multilingual support for 119 languages, cost‑effective deployment, and a 128K context window, now accessible via Ollama and OpenRouter.

AI BenchmarkQwen3large language model
0 likes · 5 min read
Why Qwen3 Is Redefining Open‑Source LLMs: Mixed‑Inference Power and Unmatched Performance
DataFunTalk
DataFunTalk
Apr 29, 2025 · Artificial Intelligence

ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series

OpenAI announced new shopping capabilities for ChatGPT, improving product recommendation, visual presentation, and direct purchase links, while Alibaba released the Qwen3 series of large and MoE language models with detailed parameter counts and benchmark performance, highlighting rapid advancements in consumer‑focused AI applications.

AIChatGPTQwen3
0 likes · 4 min read
ChatGPT Adds Shopping Feature and Alibaba Unveils Qwen3 Model Series
Java Architecture Diary
Java Architecture Diary
Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Qwen3 introduces a suite of open‑source models—from a 235B expert model to compact 0.6B versions—offering competitive performance against top proprietary models, multilingual support, flexible thinking modes, and low deployment requirements, with detailed usage instructions via Ollama and OpenRouter.

OllamaQwen3large language model
0 likes · 8 min read
Why Qwen3 Is the New Powerhouse in Open‑Source AI Models
Baobao Algorithm Notes
Baobao Algorithm Notes
Apr 28, 2025 · Artificial Intelligence

What Makes Qwen3 the Next Leap in Large Language Models?

The article announces Qwen3, detailing its flagship 235B and smaller MoE models, superior benchmark performance, extensive multilingual support, expanded pretraining data, four-stage post‑training, flexible thinking modes, deployment guides for SGLang, vLLM, Ollama, and future plans toward AGI‑level capabilities.

AI researchDeploymentQwen3
0 likes · 15 min read
What Makes Qwen3 the Next Leap in Large Language Models?