Fun with Large Models
Author

Fun with Large Models

Master's graduate from Beijing Institute of Technology, published four top‑journal papers, previously worked as a developer at ByteDance and Alibaba. Currently researching large models at a major state‑owned enterprise. Committed to sharing concise, practical AI large‑model development experience, believing that AI large models will become as essential as PCs in the future. Let's start experimenting now!

113
Articles
0
Likes
0
Views
0
Comments
Recent Articles

Latest from Fun with Large Models

100 recent articles max
Fun with Large Models
Fun with Large Models
Sep 6, 2025 · Artificial Intelligence

How to Build a High-Quality Domain-Specific Fine-Tuning Dataset for Large Models

This article outlines a systematic engineering workflow for creating professional domain fine‑tuning datasets for large models, covering data processing, validation, optimal sample size, industrial‑environment practices, and special considerations for reinforcement‑learning based fine‑tuning.

Fine-tuningdata processingdata validation
0 likes · 7 min read
How to Build a High-Quality Domain-Specific Fine-Tuning Dataset for Large Models
Fun with Large Models
Fun with Large Models
Sep 2, 2025 · Artificial Intelligence

How to Improve Agent Performance with Fine‑Tuning: Key Strategies for AI Interviews

This article explains how to boost large‑model agent performance for interview questions by using efficient fine‑tuning—building multi‑tool parallel and chain‑call datasets—and reinforcement‑learning fine‑tuning with reward functions that target tool accuracy, task completion, and call efficiency, illustrated with concrete JSON examples and open‑source references.

AgentFine-tuningFunction Calling
0 likes · 9 min read
How to Improve Agent Performance with Fine‑Tuning: Key Strategies for AI Interviews
Fun with Large Models
Fun with Large Models
Sep 1, 2025 · Artificial Intelligence

Build a LangGraph AI Agent in Two Lines Using the Prebuilt Graph API

This tutorial shows how to set up a Python environment, install LangGraph, and use its high‑level prebuilt graph API—specifically create_react_agent—to quickly create a weather‑assistant AI agent with just two lines of code, illustrating the full tool‑calling workflow and ReACT loop.

AI agentsLangGraphPython
0 likes · 11 min read
Build a LangGraph AI Agent in Two Lines Using the Prebuilt Graph API
Fun with Large Models
Fun with Large Models
Aug 30, 2025 · Artificial Intelligence

How to Fine‑Tune Large Models on Multiple Nodes and GPUs – A Must‑Know Interview Answer

This article explains how to fine‑tune large models across multiple machines and GPUs by covering data, model, tensor, and pipeline parallelism, hybrid 3D parallel strategies, engineering details such as NCCL, PyTorch Distributed, DeepSpeed, fault‑tolerance, checkpointing, and the ZeRO optimizer stages that dramatically reduce memory usage.

Data ParallelDeepSpeedDistributed Training
0 likes · 8 min read
How to Fine‑Tune Large Models on Multiple Nodes and GPUs – A Must‑Know Interview Answer
Fun with Large Models
Fun with Large Models
Aug 28, 2025 · Artificial Intelligence

A Deep Dive into LangGraph: Understanding the New Graph‑Based AI Agent Framework

The article compares LangGraph with LangChain, explains why a graph‑based architecture offers greater flexibility than linear chains, outlines LangGraph’s three‑layer core architecture and its ecosystem tools—including LangSmith, LangGraph Studio, CLI, and Agent Chat UI—while noting its reliance on LangChain and the need for VPN for CLI usage.

AI agentsGraph WorkflowLLM
0 likes · 11 min read
A Deep Dive into LangGraph: Understanding the New Graph‑Based AI Agent Framework
Fun with Large Models
Fun with Large Models
Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashScopeDeepSeekFAISS
0 likes · 13 min read
Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek
Fun with Large Models
Fun with Large Models
Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4

DeepSeek V3.1, released on August 19, expands context length to 128 K tokens and updates its knowledge base to July 2024, and the author’s benchmarks show its programming and agent capabilities now rival Claude 4, with detailed prompt examples, code generation demos, and performance comparisons.

Agent EvaluationClaude 4Context Length
0 likes · 9 min read
DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4