Data Party THU
Author

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

385
Articles
0
Likes
581
Views
0
Comments
Recent Articles

Latest from Data Party THU

100 recent articles max
Data Party THU
Data Party THU
Jun 11, 2026 · Artificial Intelligence

Boost 18 LLM Agents Without Retraining Using LIFE‑HARNESS

The article introduces LIFE‑HARNESS, a runtime‑interface adaptation framework that keeps model weights unchanged, extracts reusable failure patterns from a single model's training trace, and achieves an average 88.5% relative performance gain across 18 LLM agents and 7 deterministic environments, with successful transfer to 17 other models.

LLM Agentsbenchmark evaluationcross‑model transfer
0 likes · 8 min read
Boost 18 LLM Agents Without Retraining Using LIFE‑HARNESS
Data Party THU
Data Party THU
Jun 11, 2026 · Artificial Intelligence

GBrain’s 14K‑Star Open‑Source System Solves AI Agent Forgetting

GBrain, the open‑source AI agent memory platform with over 14,000 GitHub stars, uses a three‑layer architecture—Markdown‑based truth source, hybrid retrieval with PGLite, and 34 skill workflows—to eliminate agent forgetting, achieve a 31.4% retrieval boost, and provide Python integration via the MCP protocol, while outlining practical deployment pitfalls.

AI memoryGBrainHybrid retrieval
0 likes · 17 min read
GBrain’s 14K‑Star Open‑Source System Solves AI Agent Forgetting
Data Party THU
Data Party THU
Jun 10, 2026 · Industry Insights

Synthetic Cells from Scratch: 100+ Asian Teams Unveil 10‑Year Roadmap

The article outlines the SynCell Asia Initiative’s ten‑year plan to create fully synthetic cells, identifying four major technical barriers—metabolism, ribosome assembly, biophysical coupling, and cell‑cycle control—and proposes a two‑stage strategy (ProtoCell then AutoCell) driven by AI‑enabled automation and broad international collaboration.

AI automationcell engineeringinterdisciplinary collaboration
0 likes · 7 min read
Synthetic Cells from Scratch: 100+ Asian Teams Unveil 10‑Year Roadmap
Data Party THU
Data Party THU
Jun 10, 2026 · Artificial Intelligence

How Visual Para-Thinker Tackles Visual Hallucination with a Clever Parallel Reasoning Design

The article introduces Visual Para-Thinker, a parallel reasoning framework for large vision‑language models that mitigates attention drift and visual hallucination by employing path‑aware attention, learnable parallel rotary position embeddings, and hybrid block‑and‑scan visual token partitions, and validates the approach with extensive multimodal benchmarks.

Hallucination MitigationLPRoPEMultimodal Benchmarks
0 likes · 10 min read
How Visual Para-Thinker Tackles Visual Hallucination with a Clever Parallel Reasoning Design
Data Party THU
Data Party THU
Jun 8, 2026 · Artificial Intelligence

Can Large Language Models Design Chemical Synthesis? ChemReason‑Bench Exposes AI’s Logic Gaps

The ChemReason‑Bench benchmark, introduced by Shanghai Jiao Tong University, evaluates large language models on six program‑reasoning tasks for chemical synthesis, revealing that while top general models show modest reasoning ability, step‑completion remains difficult and domain‑specific models lag behind, prompting new training datasets for improvement.

AI chemistryChemReason-Benchbenchmark
0 likes · 8 min read
Can Large Language Models Design Chemical Synthesis? ChemReason‑Bench Exposes AI’s Logic Gaps
Data Party THU
Data Party THU
Jun 7, 2026 · Frontend Development

Build Web Tools with Python Only: Introducing NiceGUI

This article introduces NiceGUI, a pure‑Python web UI framework that lets developers create fully functional, visually appealing web applications without writing any HTML, CSS, or JavaScript, covering its core concepts, quick‑start example, advanced features, component library, layout system, data‑visualisation integration, multi‑page support, suitable scenarios, and a comparison with traditional web development.

NiceGUINo-code FrontendPython
0 likes · 14 min read
Build Web Tools with Python Only: Introducing NiceGUI
Data Party THU
Data Party THU
Jun 7, 2026 · Artificial Intelligence

When Long Prompts Cause Forgetting: Understanding Generalization in In‑Context Continual Learning

The paper introduces a theoretical framework for In‑Context Continual Learning, showing how shared attention in large language models creates bias, variance, and a novel interference term that explains why longer prompts can lead to forgetting, and provides concrete guidelines for prompt design based on task similarity, context length, and order.

Prompt Engineeringattention mechanismbias-variance-interference
0 likes · 25 min read
When Long Prompts Cause Forgetting: Understanding Generalization in In‑Context Continual Learning
Data Party THU
Data Party THU
Jun 6, 2026 · Artificial Intelligence

How a 400B MoE Model Runs on iPhone 17 Pro with Flash‑MoE

The article details how the open‑source Flash‑MoE engine enables the 400B‑parameter Qwen3.5‑397B‑A17B mixture‑of‑experts model to run on an iPhone 17 Pro, achieving about 0.6 tokens per second through a custom Metal pipeline, GCD‑driven SSD streaming, and aggressive caching strategies.

400BFlash-MoELLM inference
0 likes · 6 min read
How a 400B MoE Model Runs on iPhone 17 Pro with Flash‑MoE