Author

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

316

Articles

Likes

Views

Comments

Latest from Data Party THU

100 recent articles max

Data Party THU

Apr 21, 2026 · Artificial Intelligence

Can LLM Attack Detection Work Without Storing Any Conversation Text?

This article experimentally evaluates a privacy‑preserving LLM security pipeline that discards raw dialogue after extracting 28 telemetry features, showing that using only 11 text‑independent signals retains about 98.5% of detection performance while reducing false‑positive rates.

LLM securityfeature engineeringjailbreak detection

0 likes · 10 min read

Can LLM Attack Detection Work Without Storing Any Conversation Text?

Data Party THU

Apr 20, 2026 · Artificial Intelligence

Can AI Rewrite Its Own Evolution Engine? Inside HyperAgents' Self‑Modification Breakthrough

The article analyzes the HyperAgents framework (DGM‑H), showing how merging task and meta agents enables metacognitive self‑modification, improves performance across coding and non‑coding benchmarks, automatically builds supporting infrastructure, and raises new safety and industry‑impact considerations.

AI safetyHyperagentsLLM post-training

0 likes · 11 min read

Can AI Rewrite Its Own Evolution Engine? Inside HyperAgents' Self‑Modification Breakthrough

Data Party THU

Apr 20, 2026 · Artificial Intelligence

How MemPO Uses Reinforcement Learning to Turn Agent Memory into a Trainable Policy

MemPO introduces a self‑memory policy optimization framework that lets long‑horizon LLM agents autonomously manage and refine their memory via reinforcement learning, using global‑trajectory and informative‑memory advantage estimates, achieving up to 25.98% F1 gain and 73% token reduction on benchmark tasks.

LLMLong-Horizon AgentsMemPO

0 likes · 8 min read

How MemPO Uses Reinforcement Learning to Turn Agent Memory into a Trainable Policy

Data Party THU

Apr 19, 2026 · Artificial Intelligence

Mapping Large-Scale AI Agent Networks: A 3‑Dimensional Classification Framework

The article reviews recent growth in AI agent marketplaces and systems, introduces a three‑dimensional framework—topology, memory scope, and update behavior—to categorize large‑scale multi‑agent networks, and highlights world‑model inconsistency as the core scalability bottleneck.

AI agentsclassification frameworkmemory scope

0 likes · 8 min read

Mapping Large-Scale AI Agent Networks: A 3‑Dimensional Classification Framework

Data Party THU

Apr 17, 2026 · Artificial Intelligence

Mastering Text Chunking: 21 Strategies to Supercharge Your RAG Pipelines

This comprehensive guide presents 21 practical text‑chunking techniques—from simple line‑based splits to advanced embedding‑ and LLM‑driven methods—explaining their implementations, code examples, and ideal use‑cases to help you build efficient Retrieval‑Augmented Generation systems while avoiding common pitfalls.

AIChunkingLLM

0 likes · 57 min read

Mastering Text Chunking: 21 Strategies to Supercharge Your RAG Pipelines

Data Party THU

Apr 16, 2026 · Artificial Intelligence

Can Multimodal LLMs Truly Understand Emotions? Inside the MME-Emotion Benchmark

The MME-Emotion benchmark, introduced by researchers from CUHK and Alibaba Tongyi and accepted at ICLR 2026, provides a large‑scale, multimodal evaluation of emotional intelligence in large language models, revealing current models’ limited emotion recognition and reasoning abilities across diverse real‑world scenarios.

AIMME-Emotionbenchmark

0 likes · 10 min read

Can Multimodal LLMs Truly Understand Emotions? Inside the MME-Emotion Benchmark

Data Party THU

Apr 15, 2026 · Backend Development

Which Backend Language Wins in 2026? A Deep Dive into 11 Options

This article evaluates the top eleven backend programming languages for 2026, outlining each language's strengths, drawbacks, typical adopters, and offering a four‑question framework to help teams select the most suitable technology for their projects.

2026 trendsbackendecosystem

0 likes · 13 min read

Which Backend Language Wins in 2026? A Deep Dive into 11 Options

Data Party THU

Apr 15, 2026 · Artificial Intelligence

Why “Raising Lobsters” (OpenClaw) Is the Hottest AI Agent Trend in 2026

The article examines the rapid rise of OpenClaw, the open‑source AI agent dubbed “raising lobsters,” outlining its deployment steps, five major benefits, three key risks, and the six user profiles best suited for this execution‑type AI, while urging cautious adoption.

AI agentsOpenClawUse Cases

0 likes · 11 min read

Why “Raising Lobsters” (OpenClaw) Is the Hottest AI Agent Trend in 2026

Data Party THU

Apr 14, 2026 · Artificial Intelligence

Heterogeneous Hyperbolic Manifolds for Better Vision-Language Tree Alignment

This paper introduces a novel framework that constructs and aligns dual visual‑textual trees on heterogeneous hyperbolic manifolds, addressing asymmetric modality alignment in hierarchical classification tasks and achieving state‑of‑the‑art performance on benchmarks such as CIFAR‑100, ImageNet and Rare Species datasets.

Cross-AttentionHierarchical Classificationhyperbolic manifolds

0 likes · 8 min read

Heterogeneous Hyperbolic Manifolds for Better Vision-Language Tree Alignment

Data Party THU

Apr 14, 2026 · Backend Development

10 Advanced Pydantic V2 Tricks to Harden Your FastAPI Production

Discover ten essential Pydantic V2 techniques—including strict mode, field constraints, separate create/update/response models, cross‑field validators, custom error handling, reusable types, forbidden extra fields, nested models, computed fields, and discriminated unions—to prevent subtle bugs and ensure robust, secure FastAPI APIs in production.

FastAPIPydanticPython

0 likes · 17 min read

10 Advanced Pydantic V2 Tricks to Harden Your FastAPI Production