Author

Fun with Large Models

Master's graduate from Beijing Institute of Technology, published four top‑journal papers, previously worked as a developer at ByteDance and Alibaba. Currently researching large models at a major state‑owned enterprise. Committed to sharing concise, practical AI large‑model development experience, believing that AI large models will become as essential as PCs in the future. Let's start experimenting now!

115

Articles

Likes

218

Views

Comments

Latest from Fun with Large Models

100 recent articles max

Fun with Large Models

May 30, 2025 · Artificial Intelligence

DeepSeek‑R1 Upgrade: Does Its Coding Ability Match Claude 4? – In‑Depth Model Evaluation

The DeepSeek‑R1‑0528 model released on May 28 2025 shows major gains in coding, function‑calling and long‑text generation, with benchmark scores that surpass Qwen3‑235B, approach Claude 4 in programming, and include detailed hands‑on prompts and results.

AI AgentsDeepSeekModel Evaluation

0 likes · 9 min read

DeepSeek‑R1 Upgrade: Does Its Coding Ability Match Claude 4? – In‑Depth Model Evaluation

Fun with Large Models

May 28, 2025 · Artificial Intelligence

Step‑by‑Step Deployment of Suna: The Open‑Source AI Agent That Beats Manus

This article walks through the complete installation and configuration of Suna, an open‑source, fully offline AI agent built on Claude 3.7, LangChain, and Daytona, comparing its richer UI and privacy advantages to Manus, and demonstrates its capabilities by automating a Kaggle competition analysis.

AI AgentClaude 3.7Docker

0 likes · 12 min read

Step‑by‑Step Deployment of Suna: The Open‑Source AI Agent That Beats Manus

Fun with Large Models

May 25, 2025 · Artificial Intelligence

A Complete Breakdown of Claude 4’s Core Features – How Close Are We to Programmer Unemployment?

Claude 4, released in May 2025 with Opus and Sonnet variants, combines hybrid inference, a 200 K context window, advanced code interpreter, RAG retrieval and MCP integration, delivering industry‑leading programming and AI‑agent performance at relatively low cost, as confirmed by multiple company and user evaluations.

AI AgentsAnthropicClaude 4

0 likes · 10 min read

A Complete Breakdown of Claude 4’s Core Features – How Close Are We to Programmer Unemployment?

Fun with Large Models

May 23, 2025 · Backend Development

Rapidly Build a Streamable HTTP MCP Server with the Official MCP SDK – Full End‑to‑End Guide

This article walks through the complete process of creating, testing, and publishing a streamable HTTP MCP server using the official MCP SDK, covering environment setup with Anaconda and uv, project structuring, code implementation, tool integration, Inspector testing, PyPI deployment, and client verification with CherryStudio.

ASGICherryStudioMCP

0 likes · 16 min read

Rapidly Build a Streamable HTTP MCP Server with the Official MCP SDK – Full End‑to‑End Guide

Fun with Large Models

May 19, 2025 · Backend Development

Build a Streamable HTTP MCP Server from Scratch: Theory, Protocol Deep‑Dive and Full Python Implementation

This article explains the limitations of the original Stdio and HTTP SSE communication modes for MCP, introduces the Streamable HTTP protocol that resolves those issues, and provides a step‑by‑step Python implementation of both a Streamable HTTP MCP server and a matching client, complete with environment setup, FastAPI code, JSON‑RPC handling, and tool‑calling examples.

FastAPIJSON-RPCMCP protocol

0 likes · 28 min read

Build a Streamable HTTP MCP Server from Scratch: Theory, Protocol Deep‑Dive and Full Python Implementation

Fun with Large Models

May 14, 2025 · Artificial Intelligence

Discover the mcp-server-chart MCP Server—Your One‑Click AI Chart Generator

This article introduces the AntV‑developed mcp-server-chart MCP Server, explains how to set up the VSCode + Cline + Node environment, configure the server via JSON, and demonstrates its ability to generate network and bar charts through large‑model function calls, while also discussing current limitations and future improvements.

AIAntVChart Generation

0 likes · 7 min read

Discover the mcp-server-chart MCP Server—Your One‑Click AI Chart Generator

Fun with Large Models

May 13, 2025 · Artificial Intelligence

Build a MiniManus AI Agent in 10 Minutes with Qwen3, Qwen‑Agent, and MCP

This tutorial walks through registering API keys, setting up a conda environment, integrating the Firecrawl MCP server, writing Qwen‑Agent code, and extending the agent with Amap MCP to create a multi‑functional MiniManus AI application in roughly ten minutes.

AmapFirecrawlMCP

0 likes · 9 min read

Build a MiniManus AI Agent in 10 Minutes with Qwen3, Qwen‑Agent, and MCP

Fun with Large Models

May 8, 2025 · Artificial Intelligence

Building AI Agents with Qwen3 and Qwen‑Agent: A Hands‑On Guide to MCP Integration

This tutorial walks through registering a Qwen3 API key, setting up Qwen‑Agent, creating a multi‑turn chatbot, and integrating the MCP SQLite tool to enable natural‑language driven database operations, complete with step‑by‑step code examples and screenshots.

AnacondaMCPPython

0 likes · 11 min read

Building AI Agents with Qwen3 and Qwen‑Agent: A Hands‑On Guide to MCP Integration

Fun with Large Models

Apr 29, 2025 · Artificial Intelligence

Beginner’s Guide to Large Model Fine‑Tuning with Unsloth: Tips and Parameter Ranges

This article walks beginners through the entire fine‑tuning workflow for large language models using Unsloth, covering model and method selection, key hyper‑parameters, dataset formats, training scripts, evaluation strategies, and model‑saving options with concrete code examples.

Fine‑tuningLoRAQLoRA

0 likes · 16 min read

Beginner’s Guide to Large Model Fine‑Tuning with Unsloth: Tips and Parameter Ranges

Fun with Large Models

Apr 25, 2025 · Artificial Intelligence

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

This article analyzes common shortcomings of RAG pipelines—data preparation, retrieval, and LLM generation—and provides concrete optimization techniques such as advanced chunking, embedding model selection, retrieval parameter tuning, rerank models, and prompt engineering, promising up to a 20% performance gain.

EmbeddingPrompt EngineeringRAG

0 likes · 17 min read

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%