Tagged articles

Qwen

52 articles · Page 1 of 1

Jun 29, 2026 · Artificial Intelligence

DeepSeek’s DSpark Boosts AI Inference Speed Up to 400% with Speculative Decoding

DeepSeek’s open‑source DSpark applies speculative decoding to its V4 Flash and Pro models, delivering 51%‑400% inference throughput gains that vary by task, while also supporting other models such as Gemma and Qwen, positioning it as a versatile, cross‑model acceleration solution.

AI Inference AccelerationDeepSeekGemma

0 likes · 6 min read

DeepSeek’s DSpark Boosts AI Inference Speed Up to 400% with Speculative Decoding

IT Services Circle

Jun 26, 2026 · Artificial Intelligence

Where to Find Reliable Free Large‑Model APIs for Everyday Developers?

The author built a zero‑cost internal coding assistant using iFlytek's free Qwen3.6‑35B‑A3B and Qwen3.5‑35B‑A3B models, explains why these models were chosen over alternatives, provides a nine‑step guide to claim the free MaaS token quota, shares ready‑to‑run Python code, and reports real‑world performance across code generation, long‑document parsing, and multi‑turn conversations, while also outlining suitable user groups and an optional enterprise Token Plan.

APILarge Language ModelMaaS

0 likes · 12 min read

Where to Find Reliable Free Large‑Model APIs for Everyday Developers?

Machine Heart

Jun 24, 2026 · Artificial Intelligence

From Pixels to Words: A Native Vision-Language Model Unifies Images and Video

The paper introduces NEO‑ov, a native vision‑language model that discards external visual encoders, feeding raw pixels directly into a unified transformer, and demonstrates competitive performance on image, multi‑image, and video tasks—including fine‑grained perception and spatial reasoning—while outlining its three‑stage training pipeline and current limitations.

BenchmarkMultimodalQwen

0 likes · 13 min read

From Pixels to Words: A Native Vision-Language Model Unifies Images and Video

Machine Heart

Jun 16, 2026 · Industry Insights

Tencent Invests $20 M in Ex‑Alibaba AI Leader Lin Junyang’s New Lab, Valuing It at $2 B as He Seeks Next Round

Tencent has contributed $20 million to Lin Junyang’s newly founded AI Lab, bringing the post‑money valuation to roughly $2 billion, while the founder—formerly Alibaba’s youngest P10 technical executive and key figure behind the Qwen model series—already begins looking for a follow‑up funding round.

AI fundingAlibabaArtificial Intelligence

0 likes · 4 min read

Tencent Invests $20 M in Ex‑Alibaba AI Leader Lin Junyang’s New Lab, Valuing It at $2 B as He Seeks Next Round

Old Zhang's AI Learning

Jun 11, 2026 · Artificial Intelligence

Distilling Claude Opus: Qwen 9B Coding Model Runs on Consumer GPUs – Real‑World Benchmarks

The Qwopus3.5‑9B‑Coder model, fine‑tuned for agentic coding, tool calling and logical reasoning, offers three formats (Safetensors, GGUF, GGUF+MTP), runs on a 16 GB Mac mini via LM‑Studio, achieves up to 35% throughput gain with MTP, scores 85 on HermesAgent‑20, 100 on ToolCall‑15, and 53.89% on SWE‑bench, matching Claude Opus 4.6 in a 31‑tool adversarial test while highlighting its training tricks and current limitations.

LLM BenchmarkQwenQwopus

0 likes · 11 min read

Distilling Claude Opus: Qwen 9B Coding Model Runs on Consumer GPUs – Real‑World Benchmarks

Su San Talks Tech

Jun 9, 2026 · Artificial Intelligence

Zero‑Cost Unlimited‑Token Access to Qwen 3.6: A Step‑by‑Step Guide

This article explains how developers can bypass token‑cost barriers by using iFlytek’s MaaS platform to obtain free, unlimited‑token access to the Qwen 3.6‑35B‑A3B model, details the model’s MoE architecture and benchmark performance, and provides a complete Java integration tutorial with code samples and practical use‑case suggestions.

AIAPIJava

0 likes · 16 min read

Zero‑Cost Unlimited‑Token Access to Qwen 3.6: A Step‑by‑Step Guide

Java Tech Enthusiast

Jun 4, 2026 · Artificial Intelligence

How to Connect Codex to DeepSeek, Qwen and Other Third‑Party Models in Minutes

This step‑by‑step guide shows how to install CC Switch v3.16.0, add DeepSeek or Qwen as a provider, enable local routing, and switch Codex to these third‑party large language models, preserving the original Codex experience while reducing API costs.

AI modelsCC SwitchCodex

0 likes · 6 min read

How to Connect Codex to DeepSeek, Qwen and Other Third‑Party Models in Minutes

Old Zhang's AI Learning

May 23, 2026 · Artificial Intelligence

The Underrated Lifesaving Template for Qwen Local Deployment

This article analyzes the hidden pitfalls of Qwen's official Jinja chat template, explains how the community‑maintained Qwen‑Fixed‑Chat‑Templates v19 fixes rendering errors, KV‑Cache loss, token waste and agent dead‑locks, and provides step‑by‑step installation instructions for LM Studio, llama.cpp, vLLM and MLX.

Agent LoopChat TemplateKV cache

0 likes · 10 min read

The Underrated Lifesaving Template for Qwen Local Deployment

Old Zhang's AI Learning

May 23, 2026 · Artificial Intelligence

Qwopus 3.6‑27B‑v2: Trace‑Inversion Distillation Cuts Token Use by 36% and Boosts Accuracy

The Qwopus 3.6‑27B‑v2 model reconstructs full step‑by‑step reasoning from compressed Claude outputs using a Trace‑Inverter, creates two high‑quality SFT datasets, and achieves 35.9% token savings, a 2.57‑point accuracy gain on MMLU‑Pro, 75.25% success on SWE‑bench, while running on a single consumer‑grade RTX 5090.

GGUFMMLUQwen

0 likes · 11 min read

Qwopus 3.6‑27B‑v2: Trace‑Inversion Distillation Cuts Token Use by 36% and Boosts Accuracy

SuanNi

May 19, 2026 · Artificial Intelligence

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

Qwen 3.7‑Max‑Preview secures the 13th spot worldwide and the top position among Chinese models, while Qwen 3.7‑Plus‑Preview ranks 16th in vision, highlighting an accelerated release cadence, deeper technical depth across sub‑tasks, and a shift in China’s large‑model competition toward ecosystem control.

AI competitionChina AILarge Language Model

0 likes · 9 min read

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

DataFunTalk

May 19, 2026 · Artificial Intelligence

Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision

The Qwen 3.7‑Max and Qwen 3.7‑Plus preview models debut with top‑15 global rankings in Arena, the only Chinese models in text and vision leaderboards, while a timeline analysis shows the Qwen series accelerating from 4‑6‑month releases to a 2‑3‑month cadence and introducing dense and MoE variants up to 235 B parameters.

AI benchmarkChinese AILarge Language Model

0 likes · 6 min read

Qwen 3.7 Max Preview Lands: Rapid Dual‑Model Iteration Keeps China’s Lead in Text and Vision

Old Zhang's AI Learning

May 13, 2026 · Artificial Intelligence

Why vLLM Now Leads Open‑Source LLM Inference Benchmarks

vLLM tops the Artificial Analysis ranking by delivering the highest throughput for DeepSeek V3.2, Qwen 3.5 397B, and MiniMax‑M2.5 on identical NVIDIA Blackwell Ultra hardware, thanks to extensive kernel‑fusion optimizations that remain in the main branch.

DeepSeekLLM InferenceQwen

0 likes · 7 min read

Why vLLM Now Leads Open‑Source LLM Inference Benchmarks

Machine Learning Algorithms & Natural Language Processing

May 6, 2026 · Artificial Intelligence

How Qwen’s Mid‑Training with Value‑Document Guides Slashes Error Rates

Researchers at Claude applied the MSM (mid‑training) approach to Qwen models, inserting a value‑document pre‑training phase before alignment fine‑tuning, which reduced misalignment rates from 68%/54% to 5%/7% and cut required fine‑tuning data by 40‑60×, demonstrating superior generalization when combined with standard alignment.

AI alignmentLarge Language ModelsMSM

0 likes · 6 min read

How Qwen’s Mid‑Training with Value‑Document Guides Slashes Error Rates

Old Zhang's AI Learning

May 3, 2026 · Artificial Intelligence

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

Qwen‑Scope adds a sparse autoencoder (SAE) to the Qwen‑3.5‑27B model, exposing a top‑K 50‑feature, residual‑stream hook across all 64 layers for interpretability, controllable generation, data analysis, and training diagnostics, while detailing installation, usage, and practical trade‑offs.

Large Language ModelQwenSAE

0 likes · 11 min read

Alibaba’s Qwen‑Scope: A Brain‑Computer Interface for Qwen‑3.5‑27B

SuanNi

Apr 22, 2026 · Artificial Intelligence

How Alibaba’s Open‑Source Qwen 3.6‑27B Outperforms a 15× Larger Predecessor

Alibaba’s newly released open‑source Qwen 3.6‑27B dense model, with 27 billion parameters, beats its 397 billion‑parameter predecessor across a suite of code‑generation and multimodal benchmarks, while offering easier deployment thanks to its pure‑dense architecture and native image‑video‑text capabilities.

BenchmarkDense ArchitectureLarge Language Model

0 likes · 5 min read

How Alibaba’s Open‑Source Qwen 3.6‑27B Outperforms a 15× Larger Predecessor

Old Zhang's AI Learning

Apr 22, 2026 · Artificial Intelligence

Qwen3.6-27B Open‑Source: How a 27B Dense Model Outperforms the 397B Giant

The newly released Qwen3.6-27B dense multimodal model, at just 27 B parameters, surpasses the 397 B flagship on most encoding benchmarks, offers up to 1 M token context, supports FP8 quantization, and can be deployed locally via vLLM, SGLang or Transformers with modest hardware.

27BBenchmarkDense Model

0 likes · 12 min read

Qwen3.6-27B Open‑Source: How a 27B Dense Model Outperforms the 397B Giant

SuanNi

Apr 21, 2026 · Artificial Intelligence

How Qwen3.6‑35B‑A3B Matches Dense Models with Only 30 B Active Parameters

The article analyzes Qwen3.6‑35B‑A3B’s MoE architecture, showing how its 30 B active parameters outperform larger dense models across programming, agent, and multimodal benchmarks, and examines the flagship Qwen3.6‑Max‑Preview’s substantial gains in world knowledge, instruction following, and third‑party rankings.

AI evaluationBenchmarkLarge Language Model

0 likes · 5 min read

How Qwen3.6‑35B‑A3B Matches Dense Models with Only 30 B Active Parameters

Design Hub

Apr 21, 2026 · Artificial Intelligence

Two Simultaneous Battlefronts Define the Past 24 Hours in AI, Not Just New Models

In the last 24 hours the AI landscape shifted not by a handful of new model releases but by two converging fronts—model‑level advances in agentic coding and product‑level moves that turn models into usable work systems—signaling deeper changes in competition and industry impact.

AI modelsClaudeKimi

0 likes · 14 min read

Two Simultaneous Battlefronts Define the Past 24 Hours in AI, Not Just New Models

Machine Learning Algorithms & Natural Language Processing

Apr 16, 2026 · Artificial Intelligence

Efficient Reasoning with Reward Shaping: Compressing Qwen 30B‑Series Chains by 20‑40%

The article analyzes how reward‑shaping techniques can shorten the chain‑of‑thought outputs of Qwen 30‑parameter series models by 20‑40% while preserving or slightly improving performance on AIME‑25 and out‑of‑distribution benchmarks, and it details the experimental design, strategic considerations, and practical insights behind this efficient reasoning approach.

Efficient InferenceQwenreinforcement learning

0 likes · 16 min read

Efficient Reasoning with Reward Shaping: Compressing Qwen 30B‑Series Chains by 20‑40%

Machine Heart

Apr 12, 2026 · Artificial Intelligence

LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps

Researchers introduce Latent Reasoning Tuning (LRT), a lightweight inference network that encodes explicit reasoning chains into fixed‑length latent vectors, eliminating thousands of decoding steps; experiments reveal substantial redundancy in traditional chains and demonstrate that LRT achieves faster, more accurate inference and outperforms existing efficient reasoning methods.

DeepSeekEfficient InferenceHybrid Reasoning

0 likes · 10 min read

LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps

Lao Guo's Learning Space

Apr 8, 2026 · Artificial Intelligence

2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio

An in‑depth 2026 comparative review of Alibaba’s Qwen series (Qwen2.5, Qwen3, Qwen3.5) evaluates architecture, performance, speed and VRAM usage on Mac Studio, ranks each variant, and provides concrete model‑selection guidance for different memory configurations, highlighting the MoE‑based Qwen3.5 as the optimal choice.

AI performanceLarge Language ModelMac Studio

0 likes · 9 min read

2026 Qwen Model Comparison: Choose the Right Qwen for Your Mac Studio

Test Development Learning Exchange

Mar 24, 2026 · Artificial Intelligence

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

This guide presents an end‑to‑end solution for creating a test‑focused AI agent that indexes project code and defect data, integrates a large language model via LangChain, generates compliant Pytest cases, parses Allure reports, and offers deployment tips for seamless PyCharm integration.

AI AgentAllureLangChain

0 likes · 13 min read

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

AI Engineer Programming

Mar 19, 2026 · Industry Insights

Chinese LLMs Surge Ahead: Token Usage Overtakes U.S. Models in 2026

In March 2026, OpenRouter recorded 9.55 trillion tokens consumed weekly, with Chinese models occupying six of the top‑10 slots, Qwen surpassing 1 billion downloads, and cost advantages that let domestic LLMs outpace U.S. counterparts in both performance and price.

AI costChinese LLMsMiniMax

0 likes · 9 min read

Chinese LLMs Surge Ahead: Token Usage Overtakes U.S. Models in 2026

AI Explorer

Mar 4, 2026 · Industry Insights

Qwen’s Lead Architect Steps Down: Who Will Steer China’s Top Open‑Source AI Flagship?

On March 4, 2026, Alibaba’s youngest P10 technical leader Lin Junyang announced his resignation with a nine‑word tweet, just hours after releasing four Qwen 3.5 models that earned Elon Musk’s praise, while two other core researchers also left, leaving the future of China’s leading open‑source AI flagship uncertain.

AI talent turnoverAlibabaChina AI

0 likes · 9 min read

Qwen’s Lead Architect Steps Down: Who Will Steer China’s Top Open‑Source AI Flagship?

Woodpecker Software Testing

Feb 27, 2026 · Artificial Intelligence

Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)

This article walks through building a Python pipeline that uses DeepSeek and Alibaba Qwen to generate AI‑written articles, creates cover images, and automatically saves them as drafts in a WeChat public account, with detailed environment setup, client implementations, fallback strategies, and deployment tips.

AIAutomationContent Generation

0 likes · 26 min read

Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)

Baobao Algorithm Notes

Feb 25, 2026 · Artificial Intelligence

Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides

This article reviews the three open‑source Qwen 3.5 models—including a 35B MoE, a 122B MoE, and a 27B dense version—detailing their parameter layouts, core attention designs, context length, inference performance, hardware requirements, and provides step‑by‑step code examples for loading them with Hugging Face Transformers and vLLM.

AILarge Language ModelMoE

0 likes · 10 min read

Exploring Qwen 3.5: Small‑Scale MoE Models, Architecture, and Deployment Guides

Alibaba Cloud Developer

Feb 13, 2026 · Operations

How AI‑Powered Chaterm Agent Skills Reduce a 20‑Minute Ops Task to 3 Minutes

This article explains how Chaterm's Agent Skills, powered by the Qwen large model, let you package operational expertise into reusable, executable skills that automatically diagnose and fix issues, turning a manual 20‑minute troubleshooting process into a three‑minute AI‑driven workflow.

AI OpsAgent SkillsChaterm

0 likes · 14 min read

How AI‑Powered Chaterm Agent Skills Reduce a 20‑Minute Ops Task to 3 Minutes

AI Algorithm Path

Feb 8, 2026 · Artificial Intelligence

Qwen Multi-Angle: An Open‑Source AI Tool for Full‑Perspective Image Reconstruction

The open‑source Qwen‑Image‑Edit‑2511‑Multiple‑Angles‑LoRA model can reconstruct images from 96 preset camera poses, letting users adjust distance, pitch and yaw to generate realistic multi‑angle views, with step‑by‑step usage instructions, example results, practical applications, and noted limitations.

AIOpen-sourceQwen

0 likes · 6 min read

Qwen Multi-Angle: An Open‑Source AI Tool for Full‑Perspective Image Reconstruction

DaTaobao Tech

Jan 30, 2026 · Artificial Intelligence

Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling

This article proposes an ASR‑driven pipeline that creates high‑quality AI‑reply vs. human‑like reply pairs, trains a rewrite model and a reward model, and uses GRPO reinforcement learning to generate natural, helpful, and less AI‑sounding responses in digital‑human live streaming, achieving 92% accuracy and 97% helpfulness while improving user experience.

ASR dataLLMQwen

0 likes · 20 min read

Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling

AI Insight Log

Jan 19, 2026 · Artificial Intelligence

Run Claude Code for Free? Ollama Adds Anthropic API Compatibility

Ollama v0.14.0 now supports the Anthropic API, letting you run Claude Code locally with open‑source models like Qwen or Llama without an API key, network, or cost, and the article provides a step‑by‑step setup, SDK examples, and an objective assessment of the approach.

Anthropic APIClaude CodeOllama

0 likes · 7 min read

Run Claude Code for Free? Ollama Adds Anthropic API Compatibility

PMTalk Product Manager Community

Dec 4, 2025 · Industry Insights

Three Chinese AI Giants, Three Strategies: Doubao, DeepSeek, and Qwen in 2025

In 2025 China's AI large‑model arena is sharply fragmenting, with ByteDance's Doubao leading user activity, DeepSeek dominating technical and international influence, and Alibaba's Qwen carving a unique full‑stack strategic edge, each pursuing distinct paths in technology, product and ecosystem competition.

AIChinaDeepSeek

0 likes · 11 min read

Three Chinese AI Giants, Three Strategies: Doubao, DeepSeek, and Qwen in 2025

Test Development Learning Exchange

Nov 18, 2025 · Artificial Intelligence

Auto‑Generate API Test Cases with Qwen AI and Python

This guide shows how to use the Qwen large language model to automatically generate pytest‑style API test scripts from an OpenAPI specification, covering setup, prompt design, model invocation, code extraction, and execution steps.

AI code generationAPI testingOpenAPI

0 likes · 9 min read

Auto‑Generate API Test Cases with Qwen AI and Python

HyperAI Super Neural

Nov 4, 2025 · Artificial Intelligence

On‑Device TTS Breakthrough: NeuTTS‑Air Achieves 3‑Second Audio Cloning with a 0.5B Model

NeuTTS‑Air, an open‑source on‑device text‑to‑speech model built on a 0.5B Qwen LLM and NeuCodec, reaches SOTA among open models, runs entirely on CPU, supports 3‑second voice cloning, and comes with a step‑by‑step tutorial for deployment on edge devices.

NeuCodecNeuTTS-AirQwen

0 likes · 5 min read

On‑Device TTS Breakthrough: NeuTTS‑Air Achieves 3‑Second Audio Cloning with a 0.5B Model

Alibaba Cloud Developer

Oct 13, 2025 · Mobile Development

Build an AI-Powered Flutter App with Qoder, Supabase, and Qwen Image Edit

Learn how to rapidly create a full-featured AI-driven Flutter mobile application that generates and edits 3D figurine images using Qoder's code generation, Alibaba Cloud ADB Supabase for backend-as-a-service, and the Qwen Image Edit model, all without building a traditional backend.

AIEdge FunctionFlutter

0 likes · 10 min read

Build an AI-Powered Flutter App with Qoder, Supabase, and Qwen Image Edit

Full-Stack Internet Architecture

Jun 18, 2025 · Artificial Intelligence

Master Spring AI Prompt Templates: Dynamic Travel Queries with DeepSeek & QWEN

Learn how to leverage Spring AI's prompt template feature to create flexible, variable-driven queries, and implement backend services using DeepSeek and QWEN models for dynamic travel recommendations, complete with code examples for interfaces, service implementations, and controller routing.

DeepSeekJavaPrompt templates

0 likes · 7 min read

Master Spring AI Prompt Templates: Dynamic Travel Queries with DeepSeek & QWEN

AI2ML AI to Machine Learning

Apr 17, 2025 · Artificial Intelligence

Inside Qwen: A Deep Dive into the Large Model’s Source Code

The article provides a comprehensive technical walkthrough of Qwen’s large‑model series, covering data preparation, tokenization, model tweaks, training settings, RLHF pipeline, Code‑Qwen specifics, Qwen2 and Qwen3 architectural changes, scaling‑law experiments, and detailed source‑code analysis with illustrative diagrams.

Large Language ModelMoEQwen

0 likes · 7 min read

Inside Qwen: A Deep Dive into the Large Model’s Source Code

Architects' Tech Alliance

Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini

0 likes · 9 min read

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

Code Mala Tang

Mar 31, 2025 · Artificial Intelligence

Unlocking LLM Power: A Hands‑On Guide to Function Calling with Mistral, Llama, and Qwen

This tutorial explains how large language models can use function calling to access real‑time data, walks through setting up a Flask endpoint, demonstrates integration with Mistral Small, Llama 3.2‑1B, and Qwen models, and provides complete Python code examples for end‑to‑end execution.

APIFunction CallingLLM

0 likes · 10 min read

Unlocking LLM Power: A Hands‑On Guide to Function Calling with Mistral, Llama, and Qwen

NewBeeNLP

Mar 18, 2025 · Interview Experience

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

This article recounts a three‑stage interview for a multimodal large‑model position at Taobao's Search AI division, detailing typical questions on CLIP, LoRA, BLIP, Qwen‑VL, Transformer fundamentals, RLHF, and coding challenges, and offers insights on what interviewers focus on.

AICLIPLoRA

0 likes · 5 min read

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

Baobao Algorithm Notes

Mar 16, 2025 · Artificial Intelligence

Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?

This article details how a 7B parameter language model, fine‑tuned with DeepSeek's GRPO reinforcement‑learning algorithm and a carefully crafted multi‑component reward system, learned to solve Sudoku puzzles without any cold‑start data, outperforming a comparable 3B model and revealing key insights for structured reasoning tasks.

AI trainingGRPOQwen

0 likes · 15 min read

Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?

Fun with Large Models

Mar 14, 2025 · Artificial Intelligence

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

This guide walks you through fine‑tuning a large language model without any coding by using LLaMA‑Factory, covering LoRA fundamentals, environment setup, dataset creation, parameter configuration, training, loss monitoring, model export, and a quick evaluation on the Qwen2.5‑0.5B model.

AnacondaLLaMA-FactoryLoRA

0 likes · 15 min read

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

Top Architect

Mar 9, 2025 · Artificial Intelligence

Alibaba Unveils Qwen QwQ-32B: A Compact Open‑Source LLM Rivaling DeepSeek

Alibaba has released the open‑source Qwen QwQ‑32B model, a 32‑billion‑parameter LLM that matches DeepSeek‑R1's performance while being deployable on consumer‑grade GPUs, and the announcement is accompanied by extensive promotional offers for AI‑related products and services.

AI benchmarkAlibabaLarge Language Model

0 likes · 7 min read

Alibaba Unveils Qwen QwQ-32B: A Compact Open‑Source LLM Rivaling DeepSeek

AI Product Manager Community

Mar 6, 2025 · Artificial Intelligence

Why Alibaba’s QwQ‑32B Rivals 670B Models with Just 32B Parameters

Alibaba’s newly released 32‑billion‑parameter QwQ‑32B model matches the performance of 670‑billion‑parameter rivals like DeepSeek‑R1, integrates agent‑based reasoning, runs on consumer hardware, and has sparked strong open‑source community adoption, as shown by benchmark results and download statistics.

AgentAlibabaLarge Language Model

0 likes · 6 min read

Why Alibaba’s QwQ‑32B Rivals 670B Models with Just 32B Parameters

DataFunTalk

Mar 2, 2025 · Artificial Intelligence

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

This tutorial explains how to build a distributed reinforcement‑learning pipeline using the GRPO algorithm, covering data preparation, evaluation and reward functions, multi‑GPU DataParallel implementation, and full fine‑tuning of the Qwen2.5‑1.5B‑Instruct model with PyTorch, FlashAttention2 and Weights & Biases.

AIGRPOPyTorch

0 likes · 10 min read

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

Ops Development & AI Practice

Feb 16, 2025 · Artificial Intelligence

Why FlashAttention Supercharges Qwen Models: A Technical Deep Dive

This article explains the FlashAttention algorithm, its memory‑efficient tiling and recomputation techniques, and how enabling the flash_attn flag dramatically speeds up Qwen‑series large models while outlining hardware, software requirements and potential trade‑offs.

FlashAttentionGPU OptimizationLarge Language Model

0 likes · 8 min read

Why FlashAttention Supercharges Qwen Models: A Technical Deep Dive

Java Tech Enthusiast

Feb 14, 2025 · Artificial Intelligence

Apple Partners with Alibaba to Develop AI Features for iPhone Users

Apple’s new Apple Intelligence platform, unveiled at WWDC24, will incorporate Alibaba’s Qwen 2.5 Max model to create China‑specific AI features for iPhone users, with a custom dataset and regulatory submission, marking a shift from overseas ChatGPT reliance to a domestic partnership.

AIAlibabaApple

0 likes · 3 min read

Apple Partners with Alibaba to Develop AI Features for iPhone Users

JavaEdge

Dec 1, 2024 · Artificial Intelligence

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

QwQ‑32B‑Preview, an experimental AI model from the Qwen team, showcases strong reasoning in math and programming while facing challenges like language switching, inference loops, safety concerns, and variable capabilities across domains, with benchmark scores ranging from 50% to over 90% on tests such as GPQA, AIME, MATH‑500, and LiveCodeBench.

AI benchmarkLLMQwen

0 likes · 7 min read

Exploring the Limits and Benchmarks of Qwen’s QwQ‑32B‑Preview AI Model

System Architect Go

Oct 17, 2024 · Artificial Intelligence

Running and Fine‑Tuning Large Language Models Locally with Ollama, Docker, and Cloud Resources

The author chronicles the challenges and solutions of running large language models locally using Ollama, experimenting with cloud GPUs on Google Colab, managing Python dependencies through Docker, and ultimately fine‑tuning a small Qwen model, providing a practical guide for AI enthusiasts.

DockerGoogle ColabLLM

0 likes · 6 min read

Running and Fine‑Tuning Large Language Models Locally with Ollama, Docker, and Cloud Resources

Ops Development Stories

Sep 19, 2024 · Artificial Intelligence

How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide

This tutorial walks through setting up a local k3d cluster, installing Higress, and using its AI plugins—including AI Proxy, AI JSON formatter, AI Agent, and AI Statistics—to integrate and observe Alibaba Cloud's Qwen large language models across various use cases such as weather and flight queries.

AI PluginsAI gatewayHigress

0 likes · 30 min read

How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide

Alibaba Cloud Native

May 30, 2024 · Cloud Native

Translate CS Textbooks Instantly with AI: A Hands‑On Higress Cloud‑Native Guide

This guide shows how to use free AI translation tools—Immersive Translate and OpenAI Translator—together with the Higress cloud‑native AI‑proxy plugin, configuring Docker, model mappings, and custom dictionaries to efficiently translate computer‑science textbooks like Rust and Crafting Interpreters, while comparing machine and human translations.

AI translationDockerHigress

0 likes · 11 min read

Translate CS Textbooks Instantly with AI: A Hands‑On Higress Cloud‑Native Guide

Alibaba Cloud Native

May 15, 2024 · Cloud Native

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

This article walks through setting up a cloud‑native test environment using the open‑source NextChat UI and Higress API gateway to let Qwen‑2.5 masquerade as GPT‑4o, enabling a side‑by‑side comparison of their responses while showcasing Higress’s streaming, hot‑update, and security features for AI workloads.

AI gatewayDockerGPT-4o

0 likes · 8 min read

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

Baobao Algorithm Notes

Mar 28, 2024 · Artificial Intelligence

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters

Qwen1.5‑MoE‑A2.7B is a 2.7 billion‑parameter Mixture‑of‑Experts model that delivers performance comparable to leading 7 billion‑parameter LLMs while cutting training cost by 75% and boosting inference speed by 1.74×, and the article details its architecture, benchmarks, efficiency analysis, and deployment steps.

Large Language ModelMoEModel Benchmark

0 likes · 13 min read

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters