Tagged articles
2 articles
Page 1 of 1
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 7, 2025 · Artificial Intelligence

How Pai‑Megatron‑Patch Boosts Qwen2‑VL Multimodal Training Efficiency

This article explains how the Pai‑Megatron‑Patch toolkit enhances the usability and training performance of the Qwen2‑VL multimodal large model by introducing model‑parallel weight conversion, user‑friendly data loading, visual feature processing optimizations, optimizer offloading, and pipeline parallelism techniques, supported by extensive experimental analysis.

MegatronPipeline ParallelismQwen2-VL
0 likes · 25 min read
How Pai‑Megatron‑Patch Boosts Qwen2‑VL Multimodal Training Efficiency
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 1, 2024 · Artificial Intelligence

Fine‑Tune Qwen2‑VL with LLaMA Factory on Alibaba Cloud to Build a Tourism QA Bot

This guide walks you through using Alibaba Cloud's PAI‑DSW service together with the open‑source LLaMA Factory to fine‑tune the multimodal Qwen2‑VL model, set up a tourism‑focused knowledge‑question answering bot, and run inference via the Web UI, while covering environment setup, dataset handling, training parameters, and post‑experiment cleanup.

AIAlibaba CloudFine-tuning
0 likes · 9 min read
Fine‑Tune Qwen2‑VL with LLaMA Factory on Alibaba Cloud to Build a Tourism QA Bot