Author

Fun with Large Models

Master's graduate from Beijing Institute of Technology, published four top‑journal papers, previously worked as a developer at ByteDance and Alibaba. Currently researching large models at a major state‑owned enterprise. Committed to sharing concise, practical AI large‑model development experience, believing that AI large models will become as essential as PCs in the future. Let's start experimenting now!

115

Articles

Likes

218

Views

Comments

Latest from Fun with Large Models

100 recent articles max

Fun with Large Models

Sep 17, 2025 · Artificial Intelligence

Evaluating Fine-Tuned Large Model Performance: Methods and Interview Tips

The article explains how to assess fine‑tuned large models using both human judgment and dataset‑driven metrics, outlines common pitfalls, introduces benchmark datasets and evaluation frameworks, and provides concise answers to related interview questions.

EvalScopeEvaluationbenchmark datasets

0 likes · 7 min read

Evaluating Fine-Tuned Large Model Performance: Methods and Interview Tips

Fun with Large Models

Sep 16, 2025 · Artificial Intelligence

LangGraph Data Analysis Assistant Agent: Step‑by‑Step Project Guide (Part 5)

This tutorial walks you through building a LangGraph-powered data analysis assistant that converts natural language into SQL, executes queries via NL2SQL and NL2Python tools, visualizes results with a Python interpreter, and deploys the agent using LangGraph CLI and Agent Chat UI for end‑to‑end interaction.

AI AgentAgent Chat UILangGraph

0 likes · 21 min read

LangGraph Data Analysis Assistant Agent: Step‑by‑Step Project Guide (Part 5)

Fun with Large Models

Sep 12, 2025 · Artificial Intelligence

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

The article explains the technical background and suitable scenarios for Retrieval‑Augmented Generation (RAG) and model fine‑tuning, compares their strengths, discusses how they can be combined, and provides interview‑style Q&A on their capabilities, risks, and differences from model distillation.

AI InterviewFine‑TuningRAG

0 likes · 7 min read

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

Fun with Large Models

Sep 6, 2025 · Artificial Intelligence

How to Build a High-Quality Domain-Specific Fine-Tuning Dataset for Large Models

This article outlines a systematic engineering workflow for creating professional domain fine‑tuning datasets for large models, covering data processing, validation, optimal sample size, industrial‑environment practices, and special considerations for reinforcement‑learning based fine‑tuning.

Data ValidationDataset Constructiondata processing

0 likes · 7 min read

How to Build a High-Quality Domain-Specific Fine-Tuning Dataset for Large Models

Fun with Large Models

Sep 3, 2025 · Artificial Intelligence

Mastering Multimodal Fine-Tuning of Large Models: Interview‑Ready Techniques

The article explains how to fine‑tune large multimodal models by focusing on the projection layer, optionally using LORA for language‑model adaptation, and highlights data alignment, common applications, and the added difficulty of modality alignment for interview preparation.

Large Modelsfine-tuningmultimodal

0 likes · 6 min read

Mastering Multimodal Fine-Tuning of Large Models: Interview‑Ready Techniques

Fun with Large Models

Sep 2, 2025 · Artificial Intelligence

How to Improve Agent Performance with Fine‑Tuning: Key Strategies for AI Interviews

This article explains how to boost large‑model agent performance for interview questions by using efficient fine‑tuning—building multi‑tool parallel and chain‑call datasets—and reinforcement‑learning fine‑tuning with reward functions that target tool accuracy, task completion, and call efficiency, illustrated with concrete JSON examples and open‑source references.

AgentDatasetfine-tuning

0 likes · 9 min read

How to Improve Agent Performance with Fine‑Tuning: Key Strategies for AI Interviews

Fun with Large Models

Sep 1, 2025 · Artificial Intelligence

Build a LangGraph AI Agent in Two Lines Using the Prebuilt Graph API

This tutorial shows how to set up a Python environment, install LangGraph, and use its high‑level prebuilt graph API—specifically create_react_agent—to quickly create a weather‑assistant AI agent with just two lines of code, illustrating the full tool‑calling workflow and ReACT loop.

AI AgentsLangGraphPython

0 likes · 11 min read

Build a LangGraph AI Agent in Two Lines Using the Prebuilt Graph API

Fun with Large Models

Aug 30, 2025 · Artificial Intelligence

How to Fine‑Tune Large Models on Multiple Nodes and GPUs – A Must‑Know Interview Answer

This article explains how to fine‑tune large models across multiple machines and GPUs by covering data, model, tensor, and pipeline parallelism, hybrid 3D parallel strategies, engineering details such as NCCL, PyTorch Distributed, DeepSpeed, fault‑tolerance, checkpointing, and the ZeRO optimizer stages that dramatically reduce memory usage.

Data ParallelDeepSpeedDistributed Training

0 likes · 8 min read

How to Fine‑Tune Large Models on Multiple Nodes and GPUs – A Must‑Know Interview Answer

Fun with Large Models

Aug 29, 2025 · Artificial Intelligence

How to Estimate Hardware Costs for Large-Model Fine-Tuning and Training (Interview Classic #1)

The article explains how to estimate GPU memory and overall hardware requirements for fine-tuning and training large dense and MoE models, detailing calculations for full-parameter and LoRA approaches, scaling rules, and hidden costs relevant to interview assessments.

GPU memoryLoRAMixture of Experts

0 likes · 8 min read

How to Estimate Hardware Costs for Large-Model Fine-Tuning and Training (Interview Classic #1)

Fun with Large Models

Aug 28, 2025 · Artificial Intelligence

A Deep Dive into LangGraph: Understanding the New Graph‑Based AI Agent Framework

The article compares LangGraph with LangChain, explains why a graph‑based architecture offers greater flexibility than linear chains, outlines LangGraph’s three‑layer core architecture and its ecosystem tools—including LangSmith, LangGraph Studio, CLI, and Agent Chat UI—while noting its reliance on LangChain and the need for VPN for CLI usage.

AI AgentsGraph WorkflowLLM

0 likes · 11 min read

A Deep Dive into LangGraph: Understanding the New Graph‑Based AI Agent Framework