Artificial Intelligence 10 min read

Unlock Qwen2: Fast‑Track LLM Fine‑Tuning and Deployment with Alibaba Cloud PAI‑QuickStart

Qwen2, Alibaba Cloud's new open‑source LLM series, offers five model sizes with GQA acceleration, and through PAI‑QuickStart developers can zero‑code fine‑tune, evaluate, and deploy these models using cloud resources, SDKs, and OpenAI‑compatible APIs.

Alibaba Cloud Big Data AI Platform

Jun 14, 2024

Unlock Qwen2: Fast‑Track LLM Fine‑Tuning and Deployment with Alibaba Cloud PAI‑QuickStart

Qwen2 (Tongyi Qianwen 2) is Alibaba Cloud's newly released open‑source large language model series, offering five model sizes (0.5B, 1.5B, 7B, 57B‑A14B MoE, 72B) with GQA for faster inference and lower memory usage.

Alibaba Cloud's AI platform PAI provides full technical support for the Qwen2 series. Through the PAI‑QuickStart component, developers and enterprise users can fine‑tune, evaluate, and deploy Qwen2 models with zero‑code or SDK workflows.

Runtime Requirements

Supported regions: Beijing, Shanghai, Shenzhen, Hangzhou, Ulanqab, etc.

Resource requirements: 0.5B/1.5B/7B models need V100/P100/T4 (≥16 GB); 72B model needs A100 (≥80 GB).

Model Deployment and Invocation

PAI provides a pre‑configured deployment for Qwen2‑7B‑Instruct. Users only specify a service name and resource configuration, then deploy to the PAI‑EAS inference platform. The deployed service can be accessed via ChatLLM WebUI or OpenAI‑compatible API.

Model Fine‑Tuning

PAI offers a built‑in fine‑tuning algorithm for Qwen2‑7B‑Instruct. Training data should be a JSON file with “instruction” and “output” fields, e.g.:

[
    {
        "instruction": "You are a cardiologist, give advice for hypertension...",
        "output": "Recommend lifestyle changes, low‑salt diet, etc."
    },
    {
        "instruction": "You are a respiratory doctor, how to treat a cold with white phlegm?",
        "output": "Use appropriate medication and avoid spicy food."
    }
]

Training requires V100/P100/T4 GPUs (16 GB). Hyper‑parameters can be adjusted or left at defaults.

Model Evaluation

PAI supplies an evaluation algorithm for both the base and fine‑tuned models. Users can evaluate on custom datasets (JSONL with “question” and “answer” columns) or on public benchmarks such as MMLU, TriviaQA, HellaSwag, GSM8K, C‑Eval, and TruthfulQA.

Python SDK Usage

The PAI Python SDK enables programmatic deployment, inference, and fine‑tuning. Example code for deploying and calling the model:

from pai.model import RegisteredModel
from openai import OpenAI

model = RegisteredModel(model_name="qwen2-7b-instruct", model_provider="pai")
predictor = model.deploy(service="qwen2_7b_instruct_example")
openai_client = predictor.openai()
resp = openai_client.chat.completions.create(
    messages=[{"role": "system", "content": "You are a helpful assistant."},
              {"role": "user", "content": "What is the meaning of life?"}],
    model="default"
)
print(resp.choices[0].message.content)
predictor.delete_service()

Fine‑tuning via SDK:

# Get fine‑tuning estimator
est = model.get_estimator()
training_inputs = model.get_estimator_inputs()
# Optionally replace with custom data paths
# training_inputs.update({"train": "<OSS or local path>", "validation": "<path>"})
est.fit(inputs=training_inputs)
print(est.model_data())

Conclusion

Qwen2 marks Alibaba Cloud’s latest progress in open‑source LLMs, offering multiple scales for diverse downstream tasks. With PAI‑QuickStart, developers can easily customize, evaluate, and deploy Qwen2 models, and access a rich collection of pre‑integrated models across various domains.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

model fine-tuning AI Deployment Python SDK Qwen2 PAI-QuickStart

Written by

Alibaba Cloud Big Data AI Platform

The Alibaba Cloud Big Data AI Platform builds on Alibaba’s leading cloud infrastructure, big‑data and AI engineering capabilities, scenario algorithms, and extensive industry experience to offer enterprises and developers a one‑stop, cloud‑native big‑data and AI capability suite. It boosts AI development efficiency, enables large‑scale AI deployment across industries, and drives business value.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.