Tagged articles

LLaMA-Factory

12 articles · Page 1 of 1

Apr 28, 2026 · Artificial Intelligence

Zero‑Code Fine‑Tuning Hundreds of Large Models with the LLaMA‑Factory MLU Image

This article provides a step‑by‑step guide to deploying the LLaMA‑Factory MLU image on Cambricon MLU hardware, covering environment checks, downloading the modified source package, configuring Python dependencies, and running both the Web UI and command‑line fine‑tuning for models such as Qwen2.5‑0.5B.

CLICambriconLLM

0 likes · 7 min read

Zero‑Code Fine‑Tuning Hundreds of Large Models with the LLaMA‑Factory MLU Image

Machine Heart

Apr 15, 2026 · Artificial Intelligence

DataFlex: An Industrial‑Grade Dynamic Data Training System for Large Models

DataFlex, built on LLaMA‑Factory, offers a unified, reproducible infrastructure that dynamically selects, mixes, and re‑weights training data, turning data into a controllable optimization object and delivering measurable gains in training efficiency and model performance for large‑scale AI models.

Data-centric AIDataFlexDynamic Data Training

0 likes · 14 min read

DataFlex: An Industrial‑Grade Dynamic Data Training System for Large Models

Baidu Intelligent Cloud Tech Hub

Jan 27, 2026 · Artificial Intelligence

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

This guide walks through setting up a Kunlun P800 XPU host, preparing Docker containers, deploying Qwen3‑8B/‑32B/‑VL models with vLLM‑Kunlun, benchmarking performance, and running full‑parameter DPO training using LLaMA‑Factory, providing scripts, configuration files, and troubleshooting tips for AI engineers.

DPOKunlun P800LLaMA-Factory

0 likes · 32 min read

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

AI Algorithm Path

Dec 23, 2025 · Artificial Intelligence

Fine‑Tuning Qwen‑Video‑8B with LLaMA‑Factory for Domain‑Specific Video Understanding

This article details how the Qwen‑Video‑8B model, built on Qwen3‑VL‑8B‑Instruct, is fine‑tuned with the LLaMA‑Factory framework using a curated city‑scenery dataset, addresses challenges of domain knowledge, temporal modeling and multimodal fusion, and demonstrates improved video captioning across baseline, English‑fine‑tuned and Chinese‑fine‑tuned versions.

AI fine-tuningLLaMA-FactoryLoRA

0 likes · 10 min read

Fine‑Tuning Qwen‑Video‑8B with LLaMA‑Factory for Domain‑Specific Video Understanding

Tencent Technical Engineering

Jul 1, 2025 · Information Security

How Wukong AI Agent Uncovered a Critical RCE Vulnerability in LLaMA‑Factory (CVE‑2025‑53002)

This article details how the Wukong AI Agent automatically audited the popular LLaMA‑Factory project, discovered a high‑severity remote code execution vulnerability (CVE‑2025‑53002) caused by unsafe torch.load usage, reported it to the maintainers, and demonstrated the official fix that adds a secure weights_only flag.

AI securityCVE-2025-53002LLaMA-Factory

0 likes · 8 min read

How Wukong AI Agent Uncovered a Critical RCE Vulnerability in LLaMA‑Factory (CVE‑2025‑53002)

Network Intelligence Research Center (NIRC)

May 27, 2025 · Artificial Intelligence

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

This article walks through using LLaMA‑Factory—a unified framework that supports over 100 LLMs—to install dependencies, prepare Alpaca‑style datasets, perform LoRA fine‑tuning, run inference, and export the tuned model, all with concrete command‑line examples.

GitHubLLaMA-FactoryLoRA

0 likes · 6 min read

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

Fun with Large Models

Mar 14, 2025 · Artificial Intelligence

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

This guide walks you through fine‑tuning a large language model without any coding by using LLaMA‑Factory, covering LoRA fundamentals, environment setup, dataset creation, parameter configuration, training, loss monitoring, model export, and a quick evaluation on the Qwen2.5‑0.5B model.

AnacondaLLaMA-FactoryLoRA

0 likes · 15 min read

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

Tencent Cloud Developer

Mar 11, 2025 · Artificial Intelligence

Fine‑Tuning Local LLaMA‑Factory Models and Building Networked AI Applications

The article walks through preparing a GPU‑enabled environment, downloading and LoRA‑fine‑tuning a DeepSeek model with LLaMA‑Factory, merging the adapter, then wrapping the model in a web UI that queries a ChromaDB vector store via crawled web data, illustrating security‑focused use cases and forecasting domain‑specific LLM adoption.

AILLMLLaMA-Factory

0 likes · 17 min read

Fine‑Tuning Local LLaMA‑Factory Models and Building Networked AI Applications

Alibaba Cloud Big Data AI Platform

Feb 24, 2025 · Artificial Intelligence

Unlock Data+AI Fusion: Fine‑Tune Multimodal Models on DataWorks with GPU‑Ready Notebooks

This tutorial shows how to use Alibaba Cloud DataWorks' serverless GPU resource groups together with the open‑source LLaMA‑Factory framework to fine‑tune the Qwen2‑VL‑2B multimodal model for tourism‑domain Q&A, covering environment setup, dataset preparation, parameter configuration, training, and interactive inference.

DataWorksGPULLaMA-Factory

0 likes · 10 min read

Unlock Data+AI Fusion: Fine‑Tune Multimodal Models on DataWorks with GPU‑Ready Notebooks

DataFunSummit

Jan 6, 2025 · Artificial Intelligence

Efficient Large‑Model Training with LLaMA‑Factory: Overview, Techniques, and Applications

This article explains how to train large language models efficiently using LLaMA‑Factory, covering low‑resource training challenges, memory‑saving optimizations for parameters, gradients and activations, framework features, quick‑start guidance, performance tuning, real‑world case studies, and a detailed Q&A.

AIDeepSpeedLLaMA-Factory

0 likes · 10 min read

Efficient Large‑Model Training with LLaMA‑Factory: Overview, Techniques, and Applications

Alibaba Cloud Developer

Nov 1, 2024 · Artificial Intelligence

Fine‑Tune Qwen2‑VL with LLaMA Factory on Alibaba Cloud to Build a Tourism QA Bot

This guide walks you through using Alibaba Cloud's PAI‑DSW service together with the open‑source LLaMA Factory to fine‑tune the multimodal Qwen2‑VL model, set up a tourism‑focused knowledge‑question answering bot, and run inference via the Web UI, while covering environment setup, dataset handling, training parameters, and post‑experiment cleanup.

AIAlibaba CloudLLaMA-Factory

0 likes · 9 min read

Fine‑Tune Qwen2‑VL with LLaMA Factory on Alibaba Cloud to Build a Tourism QA Bot

Alibaba Cloud Big Data AI Platform

Aug 15, 2024 · Artificial Intelligence

Build Your Own AI “Zhuge Liang” Chatbot with LLaMA Factory on Alibaba Cloud

This guide walks you through using Alibaba Cloud PAI and the open‑source LLaMA Factory framework to fine‑tune a Llama‑3 8B model for Chinese dialogue and role‑playing, create a “Zhuge Liang” chatbot, evaluate its performance, and clean up resources.

Alibaba Cloud PAIChatbotLLaMA-Factory

0 likes · 12 min read

Build Your Own AI “Zhuge Liang” Chatbot with LLaMA Factory on Alibaba Cloud