Alibaba Cloud Developer
Alibaba Cloud Developer
May 28, 2025 · Artificial Intelligence

Unlocking LLM Fine‑Tuning: From Architecture to LoRA, DPO and Deployment

This article provides a comprehensive guide to large language model fine‑tuning, covering model architecture, parameter and memory calculations, prompt engineering, data construction, LoRA and PEFT techniques, reinforcement learning methods such as DPO, and practical deployment workflows on internal platforms.

Fine‑TuningLLMLoRA
0 likes · 21 min read
Unlocking LLM Fine‑Tuning: From Architecture to LoRA, DPO and Deployment
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 27, 2024 · Artificial Intelligence

Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design

This article presents a comprehensive guide to data engineering for research‑focused large language models, covering domain‑adaptive pre‑training, supervised fine‑tuning, retrieval‑augmented generation, dataset construction, data cleaning pipelines, token‑izer adaptation, and prompt engineering best practices to boost model performance in specialized tasks.

Domain AdaptationFine‑TuningLLM
0 likes · 20 min read
Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design