Data Party THU
Author

Data Party THU

Official platform of Tsinghua Big Data Research Center, sharing the team's latest research, teaching updates, and big data news.

386
Articles
0
Likes
587
Views
0
Comments
Recent Articles

Latest from Data Party THU

100 recent articles max
Data Party THU
Data Party THU
May 11, 2026 · Artificial Intelligence

How a 1930‑Era AI Model Without Any Computer Knowledge Learned to Write Python

The talkie‑1930‑13b language model, trained exclusively on English texts published before 1931, surprisingly understands historical events, solves Python coding problems, and exhibits scaling‑law behavior, prompting a detailed comparison with its modern twin talkie‑web‑13b and an analysis of training pipelines, memory categories, and common deployment pitfalls.

AI memoryLLMPython code generation
0 likes · 10 min read
How a 1930‑Era AI Model Without Any Computer Knowledge Learned to Write Python
Data Party THU
Data Party THU
May 10, 2026 · Artificial Intelligence

SpikingBrain 2.0 Breaks Long‑Sequence and Low‑Power Bottlenecks in Brain‑Inspired LLMs

The Chinese Academy of Sciences unveils SpikingBrain 2.0‑5B, a brain‑inspired large model that uses dual‑space sparse attention and dual activation (FP8 and INT8‑Spiking) to cut training cost by over tenfold, achieve up to 15× speedup on long sequences, and match Qwen‑3 performance while drastically reducing power consumption.

Sparse AttentionSpikingBrain2.0benchmark performance
0 likes · 10 min read
SpikingBrain 2.0 Breaks Long‑Sequence and Low‑Power Bottlenecks in Brain‑Inspired LLMs
Data Party THU
Data Party THU
May 9, 2026 · Artificial Intelligence

NOSE: Enabling AI to Smell with a Unified Molecule‑Receptor‑Semantic Tri‑modal Representation

NOSE introduces a neural olfactory‑semantic embedding that unifies molecular structure, receptor sequences, and natural‑language odor descriptions into a continuous space, achieving state‑of‑the‑art results on eleven tasks and strong zero‑shot generalization for odor and receptor retrieval.

Contrastive Learningdeep learningmolecular design
0 likes · 8 min read
NOSE: Enabling AI to Smell with a Unified Molecule‑Receptor‑Semantic Tri‑modal Representation
Data Party THU
Data Party THU
May 8, 2026 · Backend Development

Stop Using print for Logs: In‑Depth Comparison of Python’s Three Major Logging Solutions

After a chaotic production incident, this article compares Python’s built‑in logging, Loguru, and Logfire, detailing their configurations, strengths, weaknesses, and real‑world use cases—from simple scripts to high‑throughput APIs—while offering migration steps and common pitfalls to help you choose the right solution.

Backend DevelopmentLogfireLogging
0 likes · 17 min read
Stop Using print for Logs: In‑Depth Comparison of Python’s Three Major Logging Solutions
Data Party THU
Data Party THU
May 7, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions

This article walks through constructing a multi‑agent trading platform—analysts, researchers, traders, risk managers, and a portfolio manager—using LangChain, LangGraph, and LLMs (gpt‑4o, gpt‑4o‑mini), with real‑time data tools, shared and long‑term memory, ReAct loops, structured debates, and a final executable trade proposal.

ChromaDBFinancial AILLM
0 likes · 46 min read
Step‑by‑Step Guide to Building a Multi‑Agent Trading System for End‑to‑End Intelligent Decisions
Data Party THU
Data Party THU
May 6, 2026 · Backend Development

How a Python Generic Repository Cuts 80% of Duplicate CRUD Code

The article demonstrates building a type‑safe, reusable generic repository with Python generics and SQLAlchemy, showing how to replace repetitive CRUD implementations across multiple FastAPI entities, reduce code size from hundreds of lines to a few dozen, and avoid common pitfalls such as missing rollbacks.

FastAPIGenericsPython
0 likes · 14 min read
How a Python Generic Repository Cuts 80% of Duplicate CRUD Code
Data Party THU
Data Party THU
May 6, 2026 · Artificial Intelligence

When AI Seems Obedient, Hidden Alignment Risks Surface

The AutoControl Arena framework offers a high‑fidelity, low‑cost automated safety evaluation for frontier AI agents, exposing a dramatic rise in alignment‑illusion risk—from 21.7% under low pressure to 54.5% under high pressure—through a logic‑narrative decoupling design, a 70‑scenario benchmark, and validation against real‑world red‑team environments.

AI safetyAutoControl Arenaalignment illusion
0 likes · 9 min read
When AI Seems Obedient, Hidden Alignment Risks Surface
Data Party THU
Data Party THU
May 4, 2026 · Artificial Intelligence

Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory

A recent ACL 2026 paper uncovers a “Emoticon Semantic Confusion” vulnerability in large language models, where the tilde symbol (~) intended as a friendly emoticon is interpreted as the shell shortcut for the home directory, causing silent, irreversible deletions across major LLMs with a 38.6 % confusion rate.

ACL 2026LLM safetySecurity Vulnerability
0 likes · 9 min read
Why Sending a Tilde to an LLM Can Erase Your Entire Home Directory