Tagged articles

124 articles

Page 1 of 2

May 20, 2026 · Artificial Intelligence

How Introspection Adapters Enable LLMs to Self‑Report Hidden Behaviors

Anthropic's new paper introduces lightweight LoRA‑based introspection adapters that let large language models translate their internal activations into natural‑language reports of learned behaviors, achieving a 59% success rate on the AuditBench benchmark and exposing previously undetectable encrypted fine‑tuning attacks.

AI SafetyAuditBenchEncrypted Fine‑Tuning

0 likes · 20 min read

How Introspection Adapters Enable LLMs to Self‑Report Hidden Behaviors

Machine Heart

May 15, 2026 · Artificial Intelligence

D-OPSD: On‑Policy Self‑Distillation Lets Few‑Step Diffusion Models Learn While Running

D-OPSD presents the first online self‑distillation framework for step‑distilled diffusion models, allowing them to continuously fine‑tune with only image‑text pairs, retain their fast few‑step sampling, and acquire new concepts, styles, or domain preferences without reward models.

LoRAdiffusion modelsfew-step sampling

0 likes · 10 min read

D-OPSD: On‑Policy Self‑Distillation Lets Few‑Step Diffusion Models Learn While Running

Machine Learning Algorithms & Natural Language Processing

May 3, 2026 · Artificial Intelligence

Anthropic’s Introspection Adapter Enables LLMs to Self‑Report Hidden Behaviors

A new Anthropic paper introduces an ultra‑lightweight LoRA plug‑in called the Introspection Adapter that lets large language models translate their internal activations into natural‑language reports of learned malicious or biased behaviors, achieving a 59% success rate on the AuditBench benchmark and outperforming existing black‑box and white‑box audit tools.

AI SafetyAuditBenchEncrypted Fine‑Tuning Attack

0 likes · 21 min read

Anthropic’s Introspection Adapter Enables LLMs to Self‑Report Hidden Behaviors

Lao Guo's Learning Space

May 3, 2026 · Artificial Intelligence

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

This comprehensive guide explains why enterprises should fine‑tune large language models instead of using raw APIs or RAG, compares six fine‑tuning techniques (Full, LoRA, QLoRA, AdaLoRA, DoRA, Prompt‑Tuning), evaluates popular toolchains, outlines a step‑by‑step workflow, presents cost analyses, real‑world case studies, and practical best‑practice recommendations for 2026.

Cost OptimizationEnterprise AIFine-tuning

0 likes · 18 min read

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

PMTalk Product Manager Community

Apr 30, 2026 · Artificial Intelligence

10 Essential Large‑Model Fine‑Tuning Techniques for AI Product Managers

This article systematically presents ten large‑model training and fine‑tuning methods—from full‑parameter finetuning to parameter‑efficient PEFT—detailing their principles, suitable scenarios, step‑by‑step workflows, code examples, and practical selection guidance for AI product managers.

AdapterFine-tuningLarge Model

0 likes · 13 min read

10 Essential Large‑Model Fine‑Tuning Techniques for AI Product Managers

Wu Shixiong's Large Model Academy

Apr 29, 2026 · Interview Experience

ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory

The article examines a common interview scenario where candidates are asked about LoRA rank selection, outlines two typical mistakes—guessing or staying silent—and presents a three‑step strategy of honest boundary setting, logical derivation, and asking a focused question, illustrating the approach with concrete LoRA calculations and a vLLM case study.

AI EngineeringLoRAinterview strategy

0 likes · 13 min read

ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory

Woodpecker Software Testing

Apr 24, 2026 · Artificial Intelligence

Practical Guide to Optimizing Large Model Performance in Production

This guide details how enterprises can move large language models from lab to production by defining specific SLI/SLO metrics, diagnosing hidden bottlenecks such as tokenizer latency, and applying four quantifiable optimization levers that dramatically improve latency, throughput, and cost efficiency.

Continuous BatchingGPU OptimizationLoRA

0 likes · 6 min read

Practical Guide to Optimizing Large Model Performance in Production

CodeTrend

Apr 24, 2026 · Artificial Intelligence

How Large Language Models Acquire Tool‑Calling Ability: SFT, RLHF & LoRA Explained

The article explains why pretrained LLMs cannot call tools, then breaks down the three‑stage training pipeline—Supervised Fine‑Tuning, Reinforcement Learning from Human Feedback, and knowledge distillation—showing how each step teaches models to read tool schemas, decide when to invoke a tool, generate JSON calls, and finally transfer the capability to smaller models with LoRA.

AI trainingFunction CallingLLM

0 likes · 19 min read

How Large Language Models Acquire Tool‑Calling Ability: SFT, RLHF & LoRA Explained

AgentGuide

Apr 19, 2026 · Artificial Intelligence

Understanding the Key Differences Between Large Model Pretraining and Fine‑Tuning

The article explains how pretraining on massive generic data creates a reusable base model, while fine‑tuning uses smaller, high‑quality task‑specific data to adapt the model, covering objectives, data scale, cost, methods, and why most projects prefer fine‑tuning.

Fine-tuningLoRAPEFT

0 likes · 6 min read

Understanding the Key Differences Between Large Model Pretraining and Fine‑Tuning

Fun with Large Models

Apr 17, 2026 · Artificial Intelligence

Mastering Large Model Training: Practical Parameter Tuning from Beginner to Pro

This guide walks you through interpreting training logs and loss curves, diagnosing common issues such as oscillation, under‑fitting, and over‑fitting, and applying concrete adjustments to learning rate, LoRA settings, batch size, and epochs, with scenario‑specific strategies to turn a novice into a tuning expert.

AI trainingLarge ModelLoRA

0 likes · 23 min read

Mastering Large Model Training: Practical Parameter Tuning from Beginner to Pro

Baidu MEUX

Apr 15, 2026 · Industry Insights

How AI Revolutionized the “One Orange” IP Design Workflow

This article examines the Baidu Search team’s “One Orange” IP case, revealing why they shifted from a traditional, labor‑intensive pipeline to an AI‑driven process, how multimodal models, LoRA and ControlNet enabled massive innovation and efficiency gains, and what this means for designers and the broader industry.

AIControlNetIP design

0 likes · 10 min read

How AI Revolutionized the “One Orange” IP Design Workflow

Old Zhang's AI Learning

Apr 10, 2026 · Artificial Intelligence

How a 9B‑parameter Qwen3.5 model achieves full‑auto data analysis on a consumer GPU

The open‑source CoPaw‑Flash‑9B‑DataAnalyst‑LoRA model, fine‑tuned via LoRA, can autonomously load, explore, statistically analyze, visualize, and generate structured reports for CSV/Excel/JSON datasets, achieving a 90% success rate with an average of 26 iteration rounds, and it runs on a single consumer‑grade GPU using vLLM and the Data Analyst framework.

AgentData AnalystGPU

0 likes · 10 min read

How a 9B‑parameter Qwen3.5 model achieves full‑auto data analysis on a consumer GPU

AI Tech Publishing

Apr 9, 2026 · Artificial Intelligence

Engineering‑Focused Guide to Training and Inference of Large Language Models

This article walks engineers through the full LLM stack—from tokenization and positional encoding to transformer blocks, efficient fine‑tuning, quantization, and production‑grade inference techniques such as KV‑cache, FlashAttention, PagedAttention, continuous batching, and speculative decoding—highlighting trade‑offs, toolchains, and practical workflow steps.

Fine-tuningInferenceLLM

0 likes · 13 min read

Engineering‑Focused Guide to Training and Inference of Large Language Models

Fun with Large Models

Apr 1, 2026 · Artificial Intelligence

A Beginner's Deep Dive into Large‑Model Training Parameters with LLaMAFactory

This article walks readers through the three major training methods—full‑parameter, LoRA, and QLoRA—explaining their memory costs, data requirements, and trade‑offs, then provides a line‑by‑line breakdown of LLaMAFactory configuration files, hyper‑parameter tuning guidelines, and the process for merging LoRA adapters into a deployable model.

LLaMAFactoryLoRAQLoRA

0 likes · 27 min read

A Beginner's Deep Dive into Large‑Model Training Parameters with LLaMAFactory

AI Engineer Programming

Mar 28, 2026 · Artificial Intelligence

How to Start Training Your Own AI Model: A Complete Roadmap

This guide maps the end-to-end process for building a small AI model—from leveraging open-source base models and applying SFT with LoRA/QLoRA, through alignment techniques like DPO or ORPO, to low-cost distillation and final quantization for local deployment, while recommending free GPU resources and essential tooling.

AIAlignmentDistillation

0 likes · 12 min read

How to Start Training Your Own AI Model: A Complete Roadmap

Qborfy AI

Mar 24, 2026 · Artificial Intelligence

Why Full Fine‑Tuning Beats LoRA: When and How to Update Every Model Parameter

This article explains full fine‑tuning—updating all parameters of a pretrained model—to achieve the highest task performance, compares it with LoRA and prompt tuning, shows when it is appropriate, provides a step‑by‑step Hugging Face implementation, memory‑saving tricks, common pitfalls, and practical takeaways.

Deep LearningDeepSpeedGPU Memory

0 likes · 9 min read

Why Full Fine‑Tuning Beats LoRA: When and How to Update Every Model Parameter

Machine Learning Algorithms & Natural Language Processing

Mar 24, 2026 · Artificial Intelligence

Convert Any Text to LLM LoRA in a Single Forward Pass with SHINE

The SHINE hypernetwork can turn arbitrary text into LoRA parameters for a large language model with just one forward pass, internalizing the knowledge for multi‑turn dialogue, achieving efficiency and scaling comparable to in‑context methods while outperforming traditional fine‑tuning baselines.

LoRAhypernetworkparameter-efficient fine-tuning

0 likes · 17 min read

Convert Any Text to LLM LoRA in a Single Forward Pass with SHINE

Fun with Large Models

Mar 20, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Using LLaMAFactory for Full‑Cycle Large‑Model Training (Part 9)

This article walks through the complete workflow of fine‑tuning a Qwen2.5‑0.5B model with LLaMAFactory, covering environment setup, model download, dataset preparation, configuration editing, training execution, LoRA weight merging, and deployment via vLLM, while highlighting the framework’s minimal‑code and broad model support.

AI model trainingLLaMAFactoryLoRA

0 likes · 12 min read

Step‑by‑Step Guide to Using LLaMAFactory for Full‑Cycle Large‑Model Training (Part 9)

Machine Learning Algorithms & Natural Language Processing

Mar 15, 2026 · Artificial Intelligence

HY‑WU: Real‑Time Adaptive AI Model That Generates Parameters On‑The‑Fly

HY‑WU demonstrates that generating model parameters dynamically during inference enables a single foundation model to perform diverse image‑editing tasks, outperforming fixed‑parameter baselines in human and automatic evaluations, benchmark tests, and conflict‑task experiments, highlighting a practical real‑time adaptation approach for AI systems.

HY-WULoRATransformer

0 likes · 16 min read

HY‑WU: Real‑Time Adaptive AI Model That Generates Parameters On‑The‑Fly

Old Zhang's AI Learning

Mar 12, 2026 · Artificial Intelligence

Distilling Claude Opus 4.6 into Qwen3.5‑27B: High‑Quality Reasoning on a Single RTX 3090

The article details how Claude Opus 4.6's chain‑of‑thought data were used to distill the 27‑billion‑parameter Qwen3.5‑27B model with Unsloth and LoRA, achieving full‑context inference on a single RTX 3090/4090, while outlining performance numbers, hyper‑parameter tips, benchmark gains and the trade‑offs of losing multimodal abilities.

Claude Opus 4.6GPU inferenceLoRA

0 likes · 7 min read

Distilling Claude Opus 4.6 into Qwen3.5‑27B: High‑Quality Reasoning on a Single RTX 3090

SuanNi

Mar 9, 2026 · Artificial Intelligence

How Hypernetworks Turn Documents into Instant LLM Skills

This article analyzes the memory and adaptation limits of large language models and presents a hypernetwork‑based approach that instantly converts documents or task descriptions into low‑rank LoRA modules, enabling cheap, on‑demand model updates and cross‑modal knowledge transfer.

AILLMLoRA

0 likes · 9 min read

How Hypernetworks Turn Documents into Instant LLM Skills

Machine Learning Algorithms & Natural Language Processing

Mar 9, 2026 · Artificial Intelligence

Instant LoRA Generation and Long‑Document Internalization: Cost‑Amortized Model Updates via 0.1‑Second Forward Pass

The article analyzes the quadratic attention and KV‑Cache bottlenecks of Transformers on ultra‑long inputs and the heavy compute cost of traditional supervised fine‑tuning, then presents Sakana AI's Cost Amortization framework—Doc‑to‑LoRA and Text‑to‑LoRA—that shifts weight updates to a meta‑training hypernetwork, achieving sub‑50 MB memory for 128K‑token inference, sub‑GB update memory for long‑document QA, and zero‑shot task adaptation with sub‑second latency.

Cost AmortizationCross-modalLoRA

0 likes · 13 min read

Instant LoRA Generation and Long‑Document Internalization: Cost‑Amortized Model Updates via 0.1‑Second Forward Pass

AIWalker

Mar 8, 2026 · Artificial Intelligence

FireRed-Image-Edit v1.1 Boosts OOTD Element Fusion and Portrait Consistency

The Super Intelligence team at Xiaohongshu unveils FireRed-Image-Edit v1.1, an open‑source image‑editing model that dramatically improves ID‑consistent edits, multi‑element OOTD fusion, portrait makeup, and font style rendering while delivering end‑to‑end generation in 4.5 seconds on 30 GB VRAM, backed by a full training‑distillation pipeline and a technical report on arXiv.

AI modelFireRed-Image-EditLoRA

0 likes · 10 min read

FireRed-Image-Edit v1.1 Boosts OOTD Element Fusion and Portrait Consistency

Old Zhang's AI Learning

Mar 8, 2026 · Artificial Intelligence

Twinkle – A Lightweight, Fully Chinese Large‑Model Training Framework from ModelScope

Twinkle is a lightweight client‑server training framework open‑sourced by ModelScope that abstracts away Ray clusters, data and model parallelism, offers three run modes (torchrun, Ray, HTTP), multi‑tenant LoRA training, dual back‑ends (Transformers and Megatron), and a serverless Training‑as‑a‑Service gateway for enterprise and individual developers.

LoRAModelScopeTaaS

0 likes · 14 min read

Twinkle – A Lightweight, Fully Chinese Large‑Model Training Framework from ModelScope

AIWalker

Mar 7, 2026 · Artificial Intelligence

YOLO-Master v2026.02 Unveils Four Innovations for SOTA Object Detection

Tencent’s YOLO-Master v2026.02 adds a Mixture‑of‑Experts architecture, zero‑overhead LoRA fine‑tuning, Sparse SAHI inference for large images, and Cluster‑Weighted NMS, delivering 3‑5× faster inference, up to 70% reduced training resources, and markedly higher detection accuracy across diverse benchmarks.

Computer VisionLoRAMixture of Experts

0 likes · 15 min read

YOLO-Master v2026.02 Unveils Four Innovations for SOTA Object Detection

Machine Learning Algorithms & Natural Language Processing

Mar 6, 2026 · Artificial Intelligence

Why Reasoning and Tool-Use Clash in Agentic RL—and How DART Solves It

Recent studies reveal that in Agentic RL, jointly training reasoning and tool-use on shared parameters creates a persistent negative interaction, with gradients nearly orthogonal, limiting performance; a disentangled tuning approach (DART) using separate LoRA adapters isolates the two abilities and restores gains across benchmarks.

DARTGradient InterferenceLoRA

0 likes · 12 min read

Why Reasoning and Tool-Use Clash in Agentic RL—and How DART Solves It

AIWalker

Mar 1, 2026 · Artificial Intelligence

How X2HDR Enables AI to Achieve True Transparent HDR Imaging

X2HDR tackles the long‑standing HDR generation problem by converting color data into a perceptual uniform space and applying LoRA lightweight fine‑tuning, dramatically boosting visual fidelity while slashing data and compute demands for film, gaming, and VR.

AIHDR imagingLoRA

0 likes · 3 min read

How X2HDR Enables AI to Achieve True Transparent HDR Imaging

Data Party THU

Mar 1, 2026 · Artificial Intelligence

Unlocking Efficient LLM Fine‑Tuning: LoRA, QLoRA, and DoRA Compared

This article examines three parameter‑efficient fine‑tuning (PEFT) techniques—LoRA, QLoRA, and DoRA—explaining their core mechanisms, providing implementation code, benchmark results, memory and speed trade‑offs, and offering guidance on which method best fits different hardware and accuracy requirements.

DoRAFine-tuningLoRA

0 likes · 20 min read

Unlocking Efficient LLM Fine‑Tuning: LoRA, QLoRA, and DoRA Compared

Old Zhang's AI Learning

Feb 21, 2026 · Artificial Intelligence

Why Fine‑Tuning Large Models Is Now Ridiculously Easy

The article explains how Unsloth dramatically lowers the barrier to fine‑tuning large language models, offering one‑click installation, free Colab GPU support, extensive model coverage, impressive speed and memory gains, and detailed step‑by‑step guides that let anyone with basic Python skills train powerful models.

ColabGPULoRA

0 likes · 14 min read

Why Fine‑Tuning Large Models Is Now Ridiculously Easy

Qborfy AI

Feb 20, 2026 · Artificial Intelligence

Mastering Model Fine‑Tuning: Theory, Workflow, and Real‑World Code

This article explains fine‑tuning as a second‑stage training method that adapts large pre‑trained models to specific tasks, outlines the three‑phase workflow, compares it with prompt engineering and retrieval‑augmented generation, and provides four detailed case studies with complete code snippets and best‑practice tips.

Fine-tuningLoRAOpenAI

0 likes · 20 min read

Mastering Model Fine‑Tuning: Theory, Workflow, and Real‑World Code

Old Zhang's AI Learning

Feb 1, 2026 · Artificial Intelligence

Microsoft VibeVoice‑ASR Open‑Source: One‑Shot 60‑Minute Transcription with Speaker ID and Timestamps

Microsoft’s newly open‑sourced VibeVoice‑ASR model can transcribe up to 60‑minute audio in a single pass, preserving global context while providing built‑in speaker diarization and timestamps, supports 50+ languages, offers custom hot‑word injection, and can be deployed via Docker, Gradio, or vLLM for high‑throughput API services.

ASRDockerLoRA

0 likes · 9 min read

Microsoft VibeVoice‑ASR Open‑Source: One‑Shot 60‑Minute Transcription with Speaker ID and Timestamps

AI Cyberspace

Jan 29, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

This tutorial explains the concepts, methods, and practical commands for fine‑tuning large language models using efficient techniques like LoRA and QLoRA, covering model selection, resource considerations, Docker deployment, dataset preparation, training configuration, evaluation metrics, model merging, and deployment with GGUF and Ollama.

GGUFGPU memory optimizationLLM fine-tuning

0 likes · 27 min read

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

AI Frontier Lectures

Jan 12, 2026 · Artificial Intelligence

How GraphKeeper Tackles Catastrophic Forgetting in Domain‑Incremental Graph Learning

This article analyzes the GraphKeeper framework, which combines multi‑domain graph decoupling, unbiased ridge‑regression knowledge preservation, and a domain‑aware distribution discriminator to overcome catastrophic forgetting in domain‑incremental graph neural network training, and validates its superiority through extensive experiments and ablations.

Catastrophic ForgettingDomain Incremental LearningGraphKeeper

0 likes · 15 min read

How GraphKeeper Tackles Catastrophic Forgetting in Domain‑Incremental Graph Learning

Design Hub

Dec 24, 2025 · Artificial Intelligence

Qwen-Image-Edit-2511 Boosts Designer Control with Stronger AI Image Editing

The open‑source Qwen-Image-Edit-2511 model from Alibaba introduces major upgrades—enhanced multi‑person consistency, built‑in LoRA styles, reduced image drift, and stronger geometric reasoning—while community tests, GGUF local deployment, and a 42.55× LightX2V speed boost demonstrate its practical impact for designers.

AI Image EditingGGUFLightX2V acceleration

0 likes · 7 min read

Qwen-Image-Edit-2511 Boosts Designer Control with Stronger AI Image Editing

AI Algorithm Path

Dec 23, 2025 · Artificial Intelligence

Fine‑Tuning Qwen‑Video‑8B with LLaMA‑Factory for Domain‑Specific Video Understanding

This article details how the Qwen‑Video‑8B model, built on Qwen3‑VL‑8B‑Instruct, is fine‑tuned with the LLaMA‑Factory framework using a curated city‑scenery dataset, addresses challenges of domain knowledge, temporal modeling and multimodal fusion, and demonstrates improved video captioning across baseline, English‑fine‑tuned and Chinese‑fine‑tuned versions.

AI fine-tuningLLaMA-FactoryLoRA

0 likes · 10 min read

Fine‑Tuning Qwen‑Video‑8B with LLaMA‑Factory for Domain‑Specific Video Understanding

Instant Consumer Technology Team

Dec 16, 2025 · Artificial Intelligence

How Mind Lab Trained a Trillion‑Parameter Agentic Memory with Only 10% GPU Power

This article explains how the Mind Lab team tackled the challenges of training a 1‑trillion‑parameter mixture‑of‑experts model for agentic memory using reinforcement learning, LoRA, and a custom Megatron‑Bridge architecture, achieving a ten‑fold speedup while consuming just a fraction of the usual GPU resources.

AIAgentic AppsLoRA

0 likes · 9 min read

How Mind Lab Trained a Trillion‑Parameter Agentic Memory with Only 10% GPU Power

58UXD

Oct 28, 2025 · Artificial Intelligence

How AI-Powered Size Extension Boosts Image Workflow Efficiency

This article explains how an AI‑driven size‑extension technique using a custom Kontext LoRA model and workflow dramatically speeds up image adaptation for automotive marketing, detailing the training process, resource consumption, labeling strategy, test results, and future prospects while acknowledging current limitations.

AICase StudyLoRA

0 likes · 8 min read

How AI-Powered Size Extension Boosts Image Workflow Efficiency

Data Party THU

Oct 9, 2025 · Artificial Intelligence

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

This article presents Crab, a unified audio‑visual scene understanding model that leverages a novel display‑cooperation learning paradigm, introduces the AV‑UIE dataset with explicit reasoning steps, and demonstrates superior performance across temporal, spatial, pixel‑level, and spatio‑temporal tasks through extensive experiments and ablations.

BenchmarkDatasetLoRA

0 likes · 12 min read

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

Baobao Algorithm Notes

Sep 22, 2025 · Artificial Intelligence

How to Add Special Tokens to LLMs Without Losing Performance

This guide explains why naïvely adding special tokens during supervised fine‑tuning can destabilize a large language model, and provides step‑by‑step strategies—including tokenizer updates, embedding resizing, smart initialization, and LoRA‑based PEFT—to integrate new tokens while preserving the model's original capabilities.

LLMLoRAspecial tokens

0 likes · 9 min read

How to Add Special Tokens to LLMs Without Losing Performance

Wu Shixiong's Large Model Academy

Sep 19, 2025 · Artificial Intelligence

Master Parameter-Efficient Fine‑Tuning: LoRA & QLoRA Explained for Interviews

This article explains why full fine‑tuning of large models is impractical, introduces parameter‑efficient fine‑tuning (PEFT) with LoRA and QLoRA, provides mathematical foundations, implementation code, resource‑usage analysis, interview question templates, and practical deployment tips for real‑world AI projects.

LoRAQLoRAlow-rank adaptation

0 likes · 24 min read

Master Parameter-Efficient Fine‑Tuning: LoRA & QLoRA Explained for Interviews

Fun with Large Models

Aug 29, 2025 · Artificial Intelligence

How to Estimate Hardware Costs for Large-Model Fine-Tuning and Training (Interview Classic #1)

The article explains how to estimate GPU memory and overall hardware requirements for fine-tuning and training large dense and MoE models, detailing calculations for full-parameter and LoRA approaches, scaling rules, and hidden costs relevant to interview assessments.

Fine-tuningGPU MemoryLarge Model

0 likes · 8 min read

How to Estimate Hardware Costs for Large-Model Fine-Tuning and Training (Interview Classic #1)

Wu Shixiong's Large Model Academy

Aug 23, 2025 · Artificial Intelligence

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

This article explains the mathematical basis of LoRA, compares it with QLoRA, Prompt Tuning, Prefix Tuning and P‑tuning, shows practical PyTorch implementations, and provides mixed‑precision training tips so readers can choose the most memory‑efficient fine‑tuning method for their large language models.

LoRAPrompt TuningQLoRA

0 likes · 17 min read

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

Amap Tech

Aug 18, 2025 · Artificial Intelligence

How Omni-Effects Enables Spatially Controllable Multi‑VFX Generation with LoRA‑MoE

Omni-Effects introduces a unified framework that combines LoRA‑based expert mixture models and spatially aware prompts to generate multiple, precisely placed visual effects in video, supported by the new Omni‑VFX dataset and evaluation suite, demonstrating superior spatial control and diversity over prior single‑effect methods.

AILoRAVideo Generation

0 likes · 8 min read

How Omni-Effects Enables Spatially Controllable Multi‑VFX Generation with LoRA‑MoE

Tencent Technical Engineering

Aug 14, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

This article systematically examines the root causes of hallucinations in large language models, evaluates their pros and cons, and presents a comprehensive set of optimization techniques—including prompt engineering, RAG, sampling tweaks, supervised fine‑tuning, LoRA, RLHF, chain‑of‑thought reasoning, and agent/workflow designs—to build more reliable and trustworthy AI applications.

AILLMLoRA

0 likes · 29 min read

Why Do Large Language Models Hallucinate? Causes, Risks, and Multi‑Dimensional Solutions

AI Algorithm Path

Aug 9, 2025 · Artificial Intelligence

How LoRA Enables Multimodal Capabilities in Large Language Models

This article compares two ways to add vision to large language models—training a native multimodal model from scratch or attaching a visual module to a pretrained LLM—then details the VoRA approach that uses LoRA adapters to inject visual knowledge without extra inference cost.

ChameleonLLaVALoRA

0 likes · 7 min read

How LoRA Enables Multimodal Capabilities in Large Language Models

Amap Tech

Aug 7, 2025 · Artificial Intelligence

Boosting Codebase Upgrades with Code RAG and Agent‑Driven Fine‑Tuning

This article describes how the Gaode terminal team tackled large‑scale repository upgrades by building a code‑RAG and code‑Agent tool, addressing recall and stability issues, then fine‑tuning a small LLM (Qwen3‑4B) with LoRA and custom datasets to achieve reliable, low‑cost, on‑device code‑query performance.

Code AgentKnowledge GraphLLM

0 likes · 11 min read

Boosting Codebase Upgrades with Code RAG and Agent‑Driven Fine‑Tuning

58UXD

Jul 30, 2025 · Artificial Intelligence

How to Build AI-Powered Real‑Person 3D Scenes: From Style Exploration to LoRA Fusion

This guide walks creators through the complete workflow of generating realistic AI‑driven human scenes and converting them into 3D‑style visuals, covering style definition, tool‑assisted pipelines with Doubao and Stable Diffusion, LoRA training, and flexible weight blending for diverse artistic outcomes.

3D renderingAI-generated imageryDigital Content Creation

0 likes · 9 min read

How to Build AI-Powered Real‑Person 3D Scenes: From Style Exploration to LoRA Fusion

AI Algorithm Path

Jul 19, 2025 · Artificial Intelligence

Understanding LoRA and QLoRA: Techniques for Efficient LLM Fine‑Tuning

This article explains how low‑rank adaptation (LoRA) and its quantized variant (QLoRA) compress large language model weights, reduce training cost, and enable flexible adapter switching, while detailing matrix decomposition, training mechanics, and trade‑offs with concrete examples and quantitative analysis.

AdapterLLM fine-tuningLoRA

0 likes · 11 min read

Understanding LoRA and QLoRA: Techniques for Efficient LLM Fine‑Tuning

AI Frontier Lectures

Jun 20, 2025 · Artificial Intelligence

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

Researchers from RUC, Tsinghua, and Tencent present Crab, a unified audio‑visual scene understanding model that leverages explicit cooperation and a new AV‑UIE dataset with visible reasoning steps, achieving state‑of‑the‑art performance across temporal, spatial, pixel‑level, and spatio‑temporal tasks.

LoRAaudio-visualscene understanding

0 likes · 13 min read

Sohu Tech Products

Jun 18, 2025 · Artificial Intelligence

Master LLaMA Factory Fine‑Tuning: Key Parameter Settings & Memory Optimization

This tutorial walks through LLaMA‑Factory fine‑tuning by explaining how to choose learning rate, epochs, batch size, cutoff length, LoRA rank, and validation split, and shows how to estimate and reduce GPU memory usage with techniques like gradient accumulation, liger_kernel, and DeepSpeed.

AIDeepSpeedFine-tuning

0 likes · 25 min read

Master LLaMA Factory Fine‑Tuning: Key Parameter Settings & Memory Optimization

Data Thinking Notes

Jun 15, 2025 · Artificial Intelligence

Mastering Fine-Tuning: From Basics to Advanced Techniques for Large Language Models

Fine‑tuning transforms a general‑purpose large language model into a domain‑specific expert by training on a small, labeled dataset, and this guide explains its background, core concepts, technical mechanisms, various methods—including full‑parameter, LoRA, adapters, and prompt tuning—plus practical use cases, advantages, challenges, and best‑practice recommendations.

AIAdapterLoRA

0 likes · 13 min read

Mastering Fine-Tuning: From Basics to Advanced Techniques for Large Language Models

DataFunSummit

Jun 10, 2025 · Artificial Intelligence

How Quwan’s Kaitian Model Tackles Emotional AI for Social Apps – Architecture, Training Tricks, and Safety

Quwan Technology presents its Kaitian social large model, designed for personalized, emotionally rich, multimodal AI interactions, detailing its scene‑specific goals, CPT+SFT+RLHF training pipeline, data desensitization, LoRA fine‑tuning, evaluation methods, pruning, latency trade‑offs, safety mechanisms, and future feedback loops.

AI SafetyLoRAModel Pruning

0 likes · 13 min read

How Quwan’s Kaitian Model Tackles Emotional AI for Social Apps – Architecture, Training Tricks, and Safety

DaTaobao Tech

Jun 4, 2025 · Artificial Intelligence

Understanding Large Language Model Architecture, Parameters, Memory, Storage, and Fine‑Tuning Techniques

This article provides a comprehensive overview of large language models (LLMs), covering their transformer architecture, parameter counts, GPU memory and storage requirements, and detailed fine‑tuning methods such as prompt engineering, data construction, LoRA, PEFT, RLHF, and DPO, along with practical deployment and inference acceleration strategies.

DPOFine-tuningLLM

0 likes · 17 min read

Understanding Large Language Model Architecture, Parameters, Memory, Storage, and Fine‑Tuning Techniques

Alibaba Cloud Developer

May 28, 2025 · Artificial Intelligence

Unlocking LLM Fine‑Tuning: From Architecture to LoRA, DPO and Deployment

This article provides a comprehensive guide to large language model fine‑tuning, covering model architecture, parameter and memory calculations, prompt engineering, data construction, LoRA and PEFT techniques, reinforcement learning methods such as DPO, and practical deployment workflows on internal platforms.

Fine‑TuningLLMLoRA

0 likes · 21 min read

Unlocking LLM Fine‑Tuning: From Architecture to LoRA, DPO and Deployment

Network Intelligence Research Center (NIRC)

May 27, 2025 · Artificial Intelligence

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

This article walks through using LLaMA‑Factory—a unified framework that supports over 100 LLMs—to install dependencies, prepare Alpaca‑style datasets, perform LoRA fine‑tuning, run inference, and export the tuned model, all with concrete command‑line examples.

Fine-tuningGitHubLLaMA-Factory

0 likes · 6 min read

Simplify Large‑Model Fine‑Tuning with LLaMA‑Factory

AI Frontier Lectures

May 19, 2025 · Artificial Intelligence

DreamO: Multi‑Condition Image Customization with a 400M Flux‑Based Model

DreamO, a collaborative effort by ByteDance and Peking University, introduces a unified 400M‑parameter framework built on Flux‑1.0‑dev that enables simultaneous control of identity, style, appearance, and virtual try‑on, offering open‑source, low‑cost, and fast image customization comparable to commercial large models.

AI researchDreamOFlux model

0 likes · 6 min read

DreamO: Multi‑Condition Image Customization with a 400M Flux‑Based Model

Mafengwo Technology

Apr 30, 2025 · Artificial Intelligence

How MaFengWo’s mfw-32B Travel LLM Outperforms DeepSeek‑R1 in Speed and Accuracy

The article details the development, training, and evaluation of MaFengWo's 32‑billion‑parameter travel large language model (mfw‑32B), highlighting its superior itinerary planning, personalized demand capture, budget management, and resource efficiency compared to DeepSeek‑R1, and describing the SFT and reinforcement‑learning stages that enabled these gains.

AI OptimizationLoRAModel Evaluation

0 likes · 14 min read

How MaFengWo’s mfw-32B Travel LLM Outperforms DeepSeek‑R1 in Speed and Accuracy

Fun with Large Models

Apr 29, 2025 · Artificial Intelligence

Beginner’s Guide to Large Model Fine‑Tuning with Unsloth: Tips and Parameter Ranges

This article walks beginners through the entire fine‑tuning workflow for large language models using Unsloth, covering model and method selection, key hyper‑parameters, dataset formats, training scripts, evaluation strategies, and model‑saving options with concrete code examples.

Fine‑tuningLoRAParameter Tuning

0 likes · 16 min read

Beginner’s Guide to Large Model Fine‑Tuning with Unsloth: Tips and Parameter Ranges

Ops Development & AI Practice

Apr 6, 2025 · Artificial Intelligence

Mastering Ollama Modelfile: Build and Customize Your Own LLM

This guide explains how to retrieve, analyze, and modify an Ollama Modelfile—using commands like `ollama show --modelfile`, dissecting key directives such as FROM, TEMPLATE, LICENSE, PARAMETER, SYSTEM, and ADAPTER—and walks through step‑by‑step creation of a custom model.

AI modelLLM customizationLoRA

0 likes · 9 min read

Mastering Ollama Modelfile: Build and Customize Your Own LLM

AIWalker

Apr 2, 2025 · Artificial Intelligence

EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference

EasyControl introduces a lightweight condition‑injection LoRA module, a position‑aware training paradigm, and causal attention with KV‑cache to enable plug‑and‑play multi‑condition control for DiT models, supporting arbitrary image resolutions while cutting inference latency by up to 30% and preserving high‑quality generation.

Conditional GenerationDiTEasyControl

0 likes · 17 min read

EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference

Architect

Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLoRARAG

0 likes · 9 min read

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

Baidu MEUX

Mar 27, 2025 · Artificial Intelligence

How LoRA Supercharges AI‑Generated Seasonal Poetry Posters

This article details how the LoRA model was employed to enhance AI-generated seasonal poetry posters, covering project background, innovative gameplay, training methodology, dataset preparation, and the resulting benefits of fully automated visual creation that boosts user engagement and product AI capabilities.

AI creativityAI image generationLoRA

0 likes · 8 min read

How LoRA Supercharges AI‑Generated Seasonal Poetry Posters

Fun with Large Models

Mar 20, 2025 · Artificial Intelligence

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

This guide walks through setting up an Anaconda environment, installing Unsloth, downloading the DeepSeek‑R1‑Distill‑Llama‑8B model, preparing a medical CoT dataset, configuring LoRA parameters, running a short fine‑tuning job, and evaluating the customized model with structured prompts.

DeepSeekFine-tuningLoRA

0 likes · 18 min read

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

Alibaba Cloud Infrastructure

Mar 18, 2025 · Cloud Native

Gray Release of LoRA and Base Models Using ACK Gateway with AI Extension on Kubernetes

This guide explains how to deploy large language model inference services on a GPU-enabled Kubernetes cluster, configure ACK Gateway with AI Extension for intelligent routing and load balancing, and perform gray releases for both LoRA fine‑tuned models and base models such as QwQ‑32B and DeepSeek‑R1, including step‑by‑step commands and validation procedures.

ACK GatewayAI inferenceCloud Native

0 likes · 25 min read

Gray Release of LoRA and Base Models Using ACK Gateway with AI Extension on Kubernetes

NewBeeNLP

Mar 18, 2025 · Interview Experience

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

This article recounts a three‑stage interview for a multimodal large‑model position at Taobao's Search AI division, detailing typical questions on CLIP, LoRA, BLIP, Qwen‑VL, Transformer fundamentals, RLHF, and coding challenges, and offers insights on what interviewers focus on.

AICLIPLoRA

0 likes · 5 min read

How to Ace Multimodal Model Interviews at Taobao's Search AI Division

Fun with Large Models

Mar 14, 2025 · Artificial Intelligence

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

This guide walks you through fine‑tuning a large language model without any coding by using LLaMA‑Factory, covering LoRA fundamentals, environment setup, dataset creation, parameter configuration, training, loss monitoring, model export, and a quick evaluation on the Qwen2.5‑0.5B model.

AnacondaLLaMA-FactoryLoRA

0 likes · 15 min read

Fine‑Tune Your Own Large Model in 5 Minutes Without Writing Code (Using LLaMA‑Factory on Qwen)

Cognitive Technology Team

Feb 24, 2025 · Artificial Intelligence

Fine-Tuning Large Language Models with LoRA: A Step-by-Step Guide and Code Example

This article demonstrates the before-and-after effects of fine‑tuning a large language model, explains the concept with analogies, details hardware setup, dataset preparation, LoRA configuration, training arguments, and provides complete Python code for a pure‑framework fine‑tuning workflow.

LLM fine-tuningLoRAPyTorch

0 likes · 24 min read

Fine-Tuning Large Language Models with LoRA: A Step-by-Step Guide and Code Example

Baobao Algorithm Notes

Feb 19, 2025 · Artificial Intelligence

How X‑R1’s New Open‑Source 0.5B/1.5B/3B Models Enable LoRA and Chinese Inference

The X‑R1 release introduces fully open‑source 0.5B, 1.5B and 3B models with one‑click training scripts, LoRA fine‑tuning support, Chinese inference capabilities, detailed reward‑curve visualizations, and quick‑start instructions for both CUDA and Ascend platforms.

AI researchChinese inferenceLoRA

0 likes · 5 min read

How X‑R1’s New Open‑Source 0.5B/1.5B/3B Models Enable LoRA and Chinese Inference

Ops Development & AI Practice

Feb 15, 2025 · Artificial Intelligence

How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth

This article provides a step‑by‑step, code‑rich tutorial for fine‑tuning the open‑source Llama 3 1B and 3B models on Google Colab using the Unsloth library and LoRA, covering environment setup, model loading, adapter insertion, dataset preparation, training configuration, inference, and model saving, all while keeping GPU memory usage low.

AIColabFine-tuning

0 likes · 13 min read

How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth

vivo Internet Technology

Feb 12, 2025 · Artificial Intelligence

Bidirectional Optimization of NLLB-200 and ChatGPT for Low-Resource Language Translation

The paper proposes a bidirectional optimization framework that fine‑tunes the low‑resource NLLB‑200 translation model with LoRA using data generated by ChatGPT, while also translating low‑resource prompts with NLLB before feeding them to LLMs, thereby improving multilingual translation quality yet requiring careful validation of noisy synthetic data.

Fine-tuningLLMLoRA

0 likes · 28 min read

Bidirectional Optimization of NLLB-200 and ChatGPT for Low-Resource Language Translation

Baobao Algorithm Notes

Jan 10, 2025 · Artificial Intelligence

Unlocking Text Classification with Qwen2: Experiments, Tips, and LoRA Fine‑Tuning

This article shares practical experiments and insights on using Qwen2ForSequenceClassification for short‑ and long‑text sentiment tasks, compares it with BERT, outlines improvement strategies such as generative fine‑tuning and LoRA, and provides end‑to‑end code, training details, and evaluation results.

FineTuningLLMLoRA

0 likes · 25 min read

Unlocking Text Classification with Qwen2: Experiments, Tips, and LoRA Fine‑Tuning

360 Zhihui Cloud Developer

Jan 9, 2025 · Artificial Intelligence

Unlocking Efficient Large Model Fine‑Tuning: LoRA, LoRA+, rsLoRA, DoRA & PiSSA Explained

This article introduces the fundamentals of large‑model fine‑tuning, compares popular parameter‑efficient methods such as LoRA and its variants, presents experimental results on the Qwen2.5‑7B model, and discusses current challenges and future research directions.

AI researchLoRAlarge model fine-tuning

0 likes · 17 min read

Unlocking Efficient Large Model Fine‑Tuning: LoRA, LoRA+, rsLoRA, DoRA & PiSSA Explained

Baobao Algorithm Notes

Jan 9, 2025 · Artificial Intelligence

How to Efficiently Deploy and Manage 100 LoRA‑Enhanced LLMs with vLLM

A technical walkthrough shows how to use vLLM to load multiple LoRA adapters for role‑playing LLMs, analyzes the massive GPU and labor costs of naïve deployment, and presents a hosted multi‑LoRA platform as a cost‑effective solution.

AI inferenceLLMLoRA

0 likes · 11 min read

How to Efficiently Deploy and Manage 100 LoRA‑Enhanced LLMs with vLLM

DataFunSummit

Jan 6, 2025 · Artificial Intelligence

Efficient Large‑Model Training with LLaMA‑Factory: Overview, Techniques, and Applications

This article explains how to train large language models efficiently using LLaMA‑Factory, covering low‑resource training challenges, memory‑saving optimizations for parameters, gradients and activations, framework features, quick‑start guidance, performance tuning, real‑world case studies, and a detailed Q&A.

AIDeepSpeedLLaMA-Factory

0 likes · 10 min read

Efficient Large‑Model Training with LLaMA‑Factory: Overview, Techniques, and Applications

Alibaba Cloud Big Data AI Platform

Dec 4, 2024 · Artificial Intelligence

How EasyAnimate V5 Advances AI Video Generation with Multimodal Control

EasyAnimate V5, an Alibaba Cloud AI video generation framework, expands model size to 7B/12B, introduces multimodal control, token‑length based training, and inpaint‑based image‑to‑video strategies, while providing easy deployment via PAI, DSW, and local ComfyUI integration.

AILoRAMMDiT

0 likes · 11 min read

How EasyAnimate V5 Advances AI Video Generation with Multimodal Control

ZhongAn Tech Team

Nov 16, 2024 · Artificial Intelligence

Weekly AI Digest Issue 2: Video Generation, Large Models, AGI, and LoRA Fine‑Tuning

This weekly AI roundup discusses emerging video generation tools like PixelDance and Vidu 1.5, debates on scaling limits of large models, AGI geopolitical considerations, and a MIT study comparing LoRA with full fine‑tuning for domain adaptation.

AGIAIFine-tuning

0 likes · 8 min read

Weekly AI Digest Issue 2: Video Generation, Large Models, AGI, and LoRA Fine‑Tuning

Architecture and Beyond

Nov 2, 2024 · Artificial Intelligence

Step-by-Step Guide to Training a LoRA Model with Flux1_dev on ComfyUI

This tutorial walks programmers through preparing a GPU cloud environment, installing ComfyUI, downloading Flux1_dev models, integrating a custom LoRA, labeling generated images, and finally training the LoRA using ai‑toolkit, providing detailed commands, configuration tips, and practical cost estimates.

AI image generationComfyUIFlux

0 likes · 12 min read

Step-by-Step Guide to Training a LoRA Model with Flux1_dev on ComfyUI

System Architect Go

Oct 24, 2024 · Artificial Intelligence

How to Fine‑Tune Translation Models on Kubernetes Docs with LoRA

This article walks through the complete process of fine‑tuning both domain‑specific and large‑language translation models on Kubernetes documentation, covering data preparation, model selection, training configurations, the differences between Seq2Seq and CausalLM, and how LoRA can dramatically reduce resource usage while improving performance.

AIFine-tuningLLM

0 likes · 7 min read

How to Fine‑Tune Translation Models on Kubernetes Docs with LoRA

Baobao Algorithm Notes

Oct 16, 2024 · Artificial Intelligence

How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning

This article analyzes the Meta Comprehensive RAG (CRAG) benchmark, detailing its three tasks, evaluation metrics, and the champion DB3 team's end‑to‑end solution that combines data preprocessing, dual‑stage retrieval, prompt engineering, LoRA‑based fine‑tuning, and public data augmentation to achieve top scores across all tasks.

BenchmarkKnowledge GraphLLM

0 likes · 17 min read

How the DB3 Team Won the Meta CRAG RAG Challenge: Prompts, Retrieval, and LoRA Fine‑Tuning

DataFunSummit

Sep 23, 2024 · Artificial Intelligence

TransLLM: A Framework for Cross‑Language Transfer of Conversational Large Language Models

This article presents TransLLM, a cross‑language migration framework that enables high‑quality conversational LLMs to be transferred to low‑resource languages by preserving advanced capabilities through Recovery KD, LoRA‑based continual pre‑training, and a translation‑thinking‑chain, with extensive experiments showing superior performance and safety over ChatGPT and GPT‑4.

LoRASafetyconversation LLM

0 likes · 22 min read

TransLLM: A Framework for Cross‑Language Transfer of Conversational Large Language Models

Baobao Algorithm Notes

Sep 9, 2024 · Artificial Intelligence

How MoSLoRA Reinvents Low‑Rank Adaptation with Mixer Matrices

This article analyzes the Mixture‑of‑Subspaces in Low‑Rank Adaptation (MoSLoRA) paper, explaining its motivation, design choices that replace LoRA's gate with a mixer matrix, connections to multi‑head attention, experimental findings on LLaMA‑3 fine‑tuning, and theoretical proofs of its re‑parameterization properties.

AILoRAMixture of Experts

0 likes · 12 min read

How MoSLoRA Reinvents Low‑Rank Adaptation with Mixer Matrices

Alibaba Cloud Native

Aug 30, 2024 · Artificial Intelligence

Generate Hyper‑Realistic Journey Images with Flux AI on Alibaba Cloud Serverless

This guide walks you through enabling Alibaba Cloud Function Compute and NAS, claiming free quotas, deploying the Flux text‑to‑image model with a LoRA, and using ComfyUI to create detailed, ultra‑realistic illustrations of classic Journey to the West scenes.

ComfyUIFluxFunction Compute

0 likes · 8 min read

Generate Hyper‑Realistic Journey Images with Flux AI on Alibaba Cloud Serverless

DaTaobao Tech

Aug 21, 2024 · Artificial Intelligence

Mastering Custom Large‑Model Training: Data Strategies, LoRA Tricks, and Resource Planning

This article provides a comprehensive, step‑by‑step guide to training customized large language models, covering industry‑specific needs, data privacy, meticulous data cleaning, optimal data‑ratio balancing, token budgeting, GPU memory accounting, LoRA fine‑tuning techniques, and practical evaluation metrics for robust AI deployment.

AI trainingFine-tuningGPU Memory

0 likes · 23 min read

Mastering Custom Large‑Model Training: Data Strategies, LoRA Tricks, and Resource Planning

DeWu Technology

Aug 19, 2024 · Artificial Intelligence

Multi‑LoRA Deployment for Large Language Models: Concepts, Fine‑tuning, and Cost‑Effective Strategies

The article introduces a multi‑LoRA strategy that lets many scenario‑specific adapters share a single base LLM, dramatically cutting GPU usage and cost while preserving performance, and explains how to fine‑tune with LoRA, merge adapters, and serve them efficiently using VLLM.

Fine-tuningLoRAModel Deployment

0 likes · 10 min read

Multi‑LoRA Deployment for Large Language Models: Concepts, Fine‑tuning, and Cost‑Effective Strategies

Alibaba Cloud Big Data AI Platform

Aug 15, 2024 · Artificial Intelligence

Build Your Own AI “Zhuge Liang” Chatbot with LLaMA Factory on Alibaba Cloud

This guide walks you through using Alibaba Cloud PAI and the open‑source LLaMA Factory framework to fine‑tune a Llama‑3 8B model for Chinese dialogue and role‑playing, create a “Zhuge Liang” chatbot, evaluate its performance, and clean up resources.

Alibaba Cloud PAIChatbotLLaMA-Factory

0 likes · 12 min read

Build Your Own AI “Zhuge Liang” Chatbot with LLaMA Factory on Alibaba Cloud

JD Tech Talk

Jul 9, 2024 · Artificial Intelligence

Getting Started with AI Image Generation Using Stable Diffusion for Promotional Posters

This guide introduces the fundamentals of AI image generation with Stable Diffusion, covering three main usage methods, the Draw Things desktop app, model types, samplers, prompts, and post‑processing techniques to create high‑quality promotional graphics for events like the 618 sale.

AI artDrawThingsLoRA

0 likes · 11 min read

Getting Started with AI Image Generation Using Stable Diffusion for Promotional Posters

Practical DevOps Architecture

Jun 28, 2024 · Artificial Intelligence

Large Model (LLM) Training Curriculum – Weekly Topics and Resources

This article outlines a five‑week large‑model training curriculum, detailing weekly topics such as transformer fundamentals, encoder‑decoder architectures, self‑attention, LoRA fine‑tuning, and quantization, along with associated video lectures and PDF slide decks for developers.

AILLMLoRA

0 likes · 3 min read

Large Model (LLM) Training Curriculum – Weekly Topics and Resources

Alibaba Cloud Big Data AI Platform

Jun 4, 2024 · Artificial Intelligence

EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers

EasyAnimate, an open‑source DiT‑based video generation framework from Alibaba Cloud AI Platform PAI, offers a complete pipeline—including data preprocessing, VAE and DiT training, LoRA fine‑tuning, motion‑module integration, and scalable inference up to 768×768 resolution and 144 frames—leveraging Diffusion Transformers to produce longer, higher‑quality videos.

AI videoDiffusion TransformerLoRA

0 likes · 14 min read

EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers

58 Tech

Jun 3, 2024 · Artificial Intelligence

Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth

This article systematically analyzes popular parameter‑efficient fine‑tuning (PEFT) techniques for large language models—including Adapter Tuning, Prefix Tuning, LoRA, QLoRA, AdaLoRA, and SoRA—detailing their principles, implementation code, experimental results on NLU tasks, and practical acceleration using the Unsloth library.

AdaLoRALoRAPEFT

0 likes · 39 min read

Parameter-Efficient Fine-Tuning (PEFT) Methods for Large Language Models: LoRA, QLoRA, AdaLoRA, SoRA, and Training Acceleration with Unsloth

DaTaobao Tech

Jun 3, 2024 · Artificial Intelligence

Transforming Interior Design: AIGC’s Text‑to‑Image, Lora, and IP‑Adapter Techniques

This article explains how AI‑generated content (AIGC) technologies such as text‑to‑image diffusion models, Lora fine‑tuning, and IP‑Adapter style transfer are applied to interior design, dramatically reducing design time, cutting costs, and enabling personalized, high‑quality visualizations for both consumers and furniture merchants.

AIGCIP-AdapterLoRA

0 likes · 9 min read

Transforming Interior Design: AIGC’s Text‑to‑Image, Lora, and IP‑Adapter Techniques

Rare Earth Juejin Tech Community

May 31, 2024 · Artificial Intelligence

Generating Custom QA Datasets with Large Language Models and Fine‑Tuning via LoRA

This article explains how to use a large language model to automatically convert long‑form texts into Alpaca‑style question‑answer pairs, build a LangChain processing chain, and then fine‑tune a model such as Phi‑3‑mini‑4k‑instruct with LoRA, providing full Python code examples.

Fine-tuningLLMLangChain

0 likes · 11 min read

Generating Custom QA Datasets with Large Language Models and Fine‑Tuning via LoRA

Sohu Tech Products

May 21, 2024 · Artificial Intelligence

OPPO Multimodal Pretrained Model Deployment in Cloud-Edge Scenarios: Practices and Optimizations

OPPO details how it deploys multimodal pretrained models on resource‑constrained edge devices by compressing CLIP‑based image‑text retrieval, adapting Chinese text‑to‑image generation with LoRA and adapters, and lightweighting diffusion models through layer pruning and progressive distillation, achieving sub‑3‑second generation while preserving cloud‑level quality.

CLIPDistillationLoRA

0 likes · 18 min read

OPPO Multimodal Pretrained Model Deployment in Cloud-Edge Scenarios: Practices and Optimizations

Baobao Algorithm Notes

May 5, 2024 · Artificial Intelligence

Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More

This article provides concise technical explanations for 25 common questions about Transformer models, covering scaled dot‑product attention scaling, separate Q/K projections, feed‑forward network design, attention variants, normalization, LoRA versus full‑parameter training, KV‑cache, pre‑ and post‑norm, computational cost analysis, and advanced position‑encoding techniques.

LLMLoRATransformer

0 likes · 25 min read

Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More

Baidu MEUX

Apr 17, 2024 · Artificial Intelligence

How AI‑Powered Design Revives China’s 24 Solar Terms in a New‑Chinese Style

Leveraging AI image generation and LoRA model fine‑tuning, Baidu’s MEUX team reimagined the traditional 24 solar‑term cards in a fresh New‑Chinese aesthetic, blending cultural heritage, artistic composition, and modern visual effects to boost design efficiency and create a cohesive brand experience.

24 solar termsAI designBaidu

0 likes · 9 min read

How AI‑Powered Design Revives China’s 24 Solar Terms in a New‑Chinese Style

OPPO Kernel Craftsman

Mar 29, 2024 · Artificial Intelligence

InternLM Model Research and XTuner Practical Guide (Part 1): DataLoader, Model Conversion, Merging, and Inference

The guide walks through fine‑tuning InternLM‑Chat‑7B with XTuner, showing how to build a DataLoader from a HuggingFace Dataset, convert a LoRA .pth checkpoint to HuggingFace format, merge the adapter into the base model, run inference, and adapt the process for custom datasets and 4‑bit quantization experiments.

DataLoaderFineTuningInternLM

0 likes · 27 min read

InternLM Model Research and XTuner Practical Guide (Part 1): DataLoader, Model Conversion, Merging, and Inference

DaTaobao Tech

Mar 6, 2024 · Artificial Intelligence

AI Clothing Graffiti Project: Implementation and Optimization of AIGC Technology in Taobao Life 2

The AI Clothing Graffiti Project in Taobao Life 2 leverages Stable Diffusion, ControlNet, and LoRA to let users generate and stylize clothing designs via text‑image prompts, employing parallel processing, face repair, and content filtering, and has launched successfully, inviting algorithm engineers to join the team.

AIAIGCComputer Vision

0 likes · 14 min read

AI Clothing Graffiti Project: Implementation and Optimization of AIGC Technology in Taobao Life 2

NewBeeNLP

Feb 22, 2024 · Artificial Intelligence

Practical Tips for CPT, SFT, and LoRA in Large Language Model Fine‑Tuning

This article shares hands‑on guidance on using continual pre‑training (CPT), supervised fine‑tuning (SFT), and LoRA adapters for large language models, covering dataset size requirements, learning‑rate scheduling, warm‑up ratios, epoch strategies, and practical routing choices based on real‑world experiments.

CPTLLM fine-tuningLoRA

0 likes · 12 min read

Practical Tips for CPT, SFT, and LoRA in Large Language Model Fine‑Tuning

Ximalaya Technology Team

Feb 1, 2024 · Artificial Intelligence

Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques

This guide explains how AI image generators such as Stable Diffusion and DALL·E 3 turn text prompts into pictures by using diffusion models, CLIP‑aligned embeddings, and optional controls like negative prompts, fine‑tuned LoRA checkpoints and ControlNet conditioning, highlighting their differences, workflow, and practical customization.

AI image generationCLIPControlNet

0 likes · 18 min read

Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques

Open Source Tech Hub

Jan 26, 2024 · Fundamentals

How Smaz2 Compresses LoRa Messages on Tiny Devices

This article explains the motivation, dictionary design, bigram table, encoding rules, and real‑world compression results of the Smaz2 library, a space‑optimized C/Python compressor for short LoRa messages on microcontrollers with less than 2 KB RAM.

CLoRAPython

0 likes · 8 min read

How Smaz2 Compresses LoRa Messages on Tiny Devices

Rare Earth Juejin Tech Community

Jan 21, 2024 · Artificial Intelligence

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

This article explains the concepts of pretraining and fine‑tuning for large language models, compares full‑parameter, LoRA and QLoRA approaches, discusses resource consumption, introduces the ModelScope SWIFT framework with code examples, and shows how fine‑tuning can improve data‑visualisation tasks while reducing token usage.

Data visualizationLLMLoRA

0 likes · 24 min read

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications