Fun with Large Models
Author

Fun with Large Models

Master's graduate from Beijing Institute of Technology, published four top‑journal papers, previously worked as a developer at ByteDance and Alibaba. Currently researching large models at a major state‑owned enterprise. Committed to sharing concise, practical AI large‑model development experience, believing that AI large models will become as essential as PCs in the future. Let's start experimenting now!

113
Articles
0
Likes
0
Views
0
Comments
Recent Articles

Latest from Fun with Large Models

100 recent articles max
Fun with Large Models
Fun with Large Models
Jul 10, 2025 · Artificial Intelligence

Grok 4: The ‘Problem‑Solving Champion’ That Falters in Real‑World Use – Detailed Evaluation

The article reviews Grok 4’s flashy launch and claimed first‑principles advantage, then presents benchmark results—showing strong reasoning, multimodal and agent scores but disappointing coding performance versus DeepSeek‑R1—concluding that the model’s real‑world capabilities fall short of its hype.

AgentGrok4LLM
0 likes · 11 min read
Grok 4: The ‘Problem‑Solving Champion’ That Falters in Real‑World Use – Detailed Evaluation
Fun with Large Models
Fun with Large Models
Jun 23, 2025 · Artificial Intelligence

Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction

This article walks through the complete GraphRAG knowledge‑graph building pipeline—text splitting, entity extraction, relation mining, community clustering, and report generation—using a concrete example from the book “The Age of Big Data,” and explains why each step improves retrieval and answer quality.

GraphRAGKnowledge GraphRAG
0 likes · 20 min read
Boost RAG Answer Accuracy: Detailed Step‑by‑Step GraphRAG Knowledge‑Graph Construction
Fun with Large Models
Fun with Large Models
Jun 12, 2025 · Artificial Intelligence

Implement GRPO to Give LLMs Reasoning Ability with Qwen2.5‑0.5B

This article explains the GRPO reinforcement‑learning algorithm, shows its core idea of internal group competition without a separate evaluator model, and provides a complete, step‑by‑step code walkthrough—including environment setup, dataset preparation, reward‑function design, training configuration, and evaluation—using the Qwen2.5‑0.5B‑Instruct model on the GSM8K math dataset.

GRPOGSM8KQwen2.5
0 likes · 23 min read
Implement GRPO to Give LLMs Reasoning Ability with Qwen2.5‑0.5B
Fun with Large Models
Fun with Large Models
Jun 5, 2025 · Artificial Intelligence

EvalScope: The Ultimate Large‑Model Evaluation Framework You Control

This article introduces EvalScope, an open‑source framework for evaluating large language models, detailing its architecture, built‑in benchmarks, installation steps, and step‑by‑step guides for both performance stress testing and dataset‑based capability assessment, enabling users to independently verify model quality without relying on official documentation.

EvalScopeVisualizationbenchmark datasets
0 likes · 12 min read
EvalScope: The Ultimate Large‑Model Evaluation Framework You Control