Tagged articles
23 articles
Page 1 of 1
Machine Heart
Machine Heart
May 7, 2026 · Artificial Intelligence

Genesis AI Shows Embodied Model That Cooks, Experiments and Plays Piano

Genesis AI’s new GENE‑26.5 embodied foundation model demonstrates long‑horizon robot capabilities—from cooking a multi‑step meal and solving a Rubik’s cube to playing a high‑speed piano piece—using a full‑stack system that combines human‑like hands, a data‑glove, extensive simulation, and ultra‑low‑latency control.

Embodied AIdata glovefoundation model
0 likes · 11 min read
Genesis AI Shows Embodied Model That Cooks, Experiments and Plays Piano
DataFunSummit
DataFunSummit
Apr 10, 2026 · Artificial Intelligence

How Can AI Agents Truly Remember? A Deep Dive into Long‑Term Memory Engineering

This article examines the shortcomings of current AI assistants, outlines the ideal of long‑term memory engineering, reviews mainstream industry solutions such as hard‑context models and Retrieval‑Augmented Generation, proposes a four‑layer memory loop architecture, and looks ahead to online learning and collective intelligence for future agents.

AIHybrid ArchitectureMemory
0 likes · 15 min read
How Can AI Agents Truly Remember? A Deep Dive into Long‑Term Memory Engineering
AI Explorer
AI Explorer
Apr 1, 2026 · Artificial Intelligence

Google Open‑Sources TimesFM: A Foundation Model for Plug‑and‑Play Time‑Series Forecasting

Google’s open‑source TimesFM is a decoder‑only Transformer foundation model that delivers plug‑and‑play time‑series forecasting with zero‑shot accuracy, larger context windows, quantile predictions, and a simple Hugging Face API, making it suitable for retail, energy, finance, monitoring, and IoT use cases.

Hugging FacePyTorchTimesFM
0 likes · 7 min read
Google Open‑Sources TimesFM: A Foundation Model for Plug‑and‑Play Time‑Series Forecasting
Amap Tech
Amap Tech
Mar 30, 2026 · Artificial Intelligence

ABot-M0: A Unified VLA Framework Solving the One‑Brain Many‑Forms Robotics Challenge

ABot-M0 is an open‑source Vision‑Language‑Action foundation model that unifies fragmented robot data, introduces Action Manifold Learning for smoother action prediction, and offers a plug‑and‑play dual‑stream perception architecture, achieving state‑of‑the‑art results on major manipulation benchmarks.

Embodied AIRoboticsaction manifold learning
0 likes · 4 min read
ABot-M0: A Unified VLA Framework Solving the One‑Brain Many‑Forms Robotics Challenge
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 28, 2026 · Artificial Intelligence

Do All Physical Signals Reduce to a Single Discrete Token? LongCat‑Next Explained

LongCat‑Next, Meituan’s new 3‑billion‑parameter foundation model, adopts a pure‑discrete DiNA architecture with next‑token prediction, converting vision, audio and text into unified tokens; it surpasses same‑size multimodal models on OmniDocBench‑EN, CharXivRQ and SWE‑Bench, avoids catastrophic forgetting, and introduces dNaViT, RVQ compression and a dual‑path detokenizer for high‑fidelity generation.

DiNALongCat-NextMultimodal
0 likes · 10 min read
Do All Physical Signals Reduce to a Single Discrete Token? LongCat‑Next Explained
Data Party THU
Data Party THU
Mar 22, 2026 · Artificial Intelligence

How scLong’s Billion‑Parameter Model Reads the Whole Single‑Cell Transcriptome

The scLong foundation model, trained on 48 million cells and 28 k genes, integrates full‑gene expression with Gene Ontology knowledge to outperform existing methods on genetic perturbation, chemical response, cancer drug prediction, gene‑regulatory network inference, and batch integration tasks.

bioinformaticsfoundation modelgene ontology
0 likes · 13 min read
How scLong’s Billion‑Parameter Model Reads the Whole Single‑Cell Transcriptome
PaperAgent
PaperAgent
Feb 25, 2026 · Artificial Intelligence

How RynnBrain Unifies Perception, Reasoning, and Planning for Embodied AI

RynnBrain, an open‑source unified spatiotemporal foundation model from Alibaba DAMO Academy, integrates perception, localization, physics‑based reasoning and planning across 2 B, 8 B and 30 B MoE scales, handles multimodal visual inputs, and outperforms existing models on over 20 embodied benchmarks.

AlibabaEmbodied AIMultimodal
0 likes · 3 min read
How RynnBrain Unifies Perception, Reasoning, and Planning for Embodied AI
AI Engineering
AI Engineering
Feb 15, 2026 · Industry Insights

OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg

Peter Steinberger announced his move to OpenAI and the conversion of OpenClaw into an independent foundation, sparking community debate over OpenAI's open‑source strategy, the future of AI agents, and the strategic implications of this partnership.

AI agentsOpenAIOpenClaw
0 likes · 4 min read
OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg
HyperAI Super Neural
HyperAI Super Neural
Feb 3, 2026 · Artificial Intelligence

Walrus: 1.3B Transformer Model Beats Prior Foundations Across 19 Physics Domains

Walrus, a 1.3 billion‑parameter Transformer built by Polymathic AI, is pretrained on 19 diverse physics scenarios—including astrophysics, geoscience, rheology, plasma physics and acoustics—using techniques like patch jittering, adaptive compute tokenization and space‑time factorized attention, and consistently outperforms earlier foundation models on both short‑ and long‑term continuum dynamics predictions.

TransformerWalruscontinuum dynamics
0 likes · 13 min read
Walrus: 1.3B Transformer Model Beats Prior Foundations Across 19 Physics Domains
HyperAI Super Neural
HyperAI Super Neural
Jan 29, 2026 · Artificial Intelligence

Skild AI Secures $1.4B Funding to Build a General‑Purpose Robot Brain

Skild AI raised about $1.4 billion in a C‑round led by SoftBank, with participation from Nvidia, Sequoia, Bezos Expeditions and others, to develop a universal foundation model—Skild Brain—that can be deployed across diverse robot platforms, leveraging large‑scale visual data and a hierarchical control architecture.

General AIRoboticsSkild AI
0 likes · 11 min read
Skild AI Secures $1.4B Funding to Build a General‑Purpose Robot Brain
Alimama Tech
Alimama Tech
Jan 7, 2026 · Artificial Intelligence

How Bid2X Revolutionizes Online Ad Bidding with a Universal Foundation Model

Bid2X introduces a bidding‑environment foundation model that unifies heterogeneous ad‑bidding data, leverages variable and time attention mechanisms, handles zero‑inflated distributions, and demonstrates superior offline performance across eight large‑scale datasets and significant online gains in GMV and ROI when deployed on a major e‑commerce platform.

Advertisingbiddingfoundation model
0 likes · 20 min read
How Bid2X Revolutionizes Online Ad Bidding with a Universal Foundation Model
Kuaishou Tech
Kuaishou Tech
Dec 4, 2025 · Artificial Intelligence

Can a Tree‑Reasoned Model Master Video Emotion Understanding?

The paper introduces VidEmo, a multimodal video foundation model that uses a two‑stage emotion‑clue‑guided reasoning framework and a large emotion‑centric dataset (Emo‑CFG) to achieve state‑of‑the‑art performance on facial attribute, expression, and fine‑grained emotion tasks, surpassing Gemini 2.0.

AIComputer VisionDataset
0 likes · 15 min read
Can a Tree‑Reasoned Model Master Video Emotion Understanding?
HyperAI Super Neural
HyperAI Super Neural
Nov 24, 2025 · Artificial Intelligence

Introducing AION-1: The First Astronomical Multimodal Foundation Model Trained on 200M Targets

AION-1, developed by a consortium including UC Berkeley, Cambridge and Oxford, is the first large‑scale multimodal foundation model for astronomy that unifies images, spectra and catalog data via an early‑fusion backbone, achieving zero‑shot and linear‑probe performance that rivals or surpasses task‑specific models across diverse scientific tasks.

Multimodal AIastronomycross‑modal generation
0 likes · 18 min read
Introducing AION-1: The First Astronomical Multimodal Foundation Model Trained on 200M Targets
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 23, 2025 · Artificial Intelligence

FinCast: A Foundation Model for Financial Time‑Series Forecasting

FinCast introduces a decoder‑only Transformer foundation model for financial time‑series forecasting that tackles non‑stationarity, multi‑domain diversity, and multi‑resolution challenges through input chunking with frequency embeddings, a sparse MoE decoder, and a PQ‑loss, achieving zero‑shot and supervised gains over state‑of‑the‑art baselines while running five times faster on consumer GPUs.

PQ lossTransformerfinancial time series
0 likes · 12 min read
FinCast: A Foundation Model for Financial Time‑Series Forecasting
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Oct 17, 2025 · Artificial Intelligence

LucaOne: Unified Nucleic Acid & Protein Language Model Surpasses Other Models

Researchers present LucaOne, a Transformer‑based foundation model that unifies DNA/RNA and protein sequences using a 39‑token vocabulary, rotary positional encoding, and molecule‑type embeddings, and demonstrate through extensive multi‑task benchmarks that it outperforms domain‑specific models across seven biological tasks.

DNAMultimodalTransformer
0 likes · 5 min read
LucaOne: Unified Nucleic Acid & Protein Language Model Surpasses Other Models
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 20, 2025 · Artificial Intelligence

Recent Time-Series Paper Summaries (Sep 13‑19, 2025)

This article summarizes four recent time‑series forecasting papers, covering a universal delay‑embedding foundation model, a dual causal network that leverages exogenous variables, a distribution‑aware alignment plug‑in called TimeAlign, and a shapelet‑based framework for interpretable directional forecasting in noisy financial markets.

Time Seriescausal networkfinancial markets
0 likes · 9 min read
Recent Time-Series Paper Summaries (Sep 13‑19, 2025)
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 7, 2025 · Artificial Intelligence

Paper Review: Kronos – A Temporal Foundation Model for Financial Market Language

This article reviews Kronos, a unified and scalable pre‑training framework designed for financial K‑line data, detailing its tokenization approach, autoregressive architecture, large‑scale pre‑training on 12 billion records, and experimental results that show substantial gains in price prediction, volatility forecasting, synthetic data generation, and investment simulation.

Kronosautoregressive pretrainingfinancial time series
0 likes · 9 min read
Paper Review: Kronos – A Temporal Foundation Model for Financial Market Language
Tencent Advertising Technology
Tencent Advertising Technology
Sep 3, 2025 · Artificial Intelligence

Boosting Ads Revenue: LFM4Ads’ Full‑Representation Multi‑Granular Transfer Raises GMV 2.45%

Tencent's LFM4Ads introduces a full‑representation, multi‑granular knowledge transfer framework that moves user, item, and cross representations from a large foundation model to downstream tasks, achieving up to 2.45% platform GMV uplift across more than ten advertising scenarios.

Knowledge Transferads recommendationfoundation model
0 likes · 12 min read
Boosting Ads Revenue: LFM4Ads’ Full‑Representation Multi‑Granular Transfer Raises GMV 2.45%
Data Party THU
Data Party THU
Aug 24, 2025 · Artificial Intelligence

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

This article reviews the Centaur foundation model—fine‑tuned from Llama 3‑70B on the Psych‑101 dataset—to assess its ability to predict human choices, brain activity, and decision rationales across diverse psychological experiments, while discussing generalization, over‑fitting, and future research limits.

CentaurPsychologycognitive modeling
0 likes · 17 min read
Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive
Ma Wei Says
Ma Wei Says
Mar 4, 2025 · Artificial Intelligence

Microsoft’s Open‑Source Multimodal AI Agent Model Magma: Capabilities and Innovations

On February 25 2025, Microsoft open‑sourced its first multimodal AI agent foundation model, Magma, which extends multimodal processing to images, video, and text, introduces Set‑of‑Mark and Trace‑of‑Mark techniques for spatial‑temporal reasoning, optimizes modular inference for edge devices, and integrates reinforcement learning for adaptive task execution.

Edge ComputingMagmaMultimodal AI
0 likes · 6 min read
Microsoft’s Open‑Source Multimodal AI Agent Model Magma: Capabilities and Innovations
AntTech
AntTech
Mar 1, 2024 · Artificial Intelligence

Ant Group Unveils SkySense: 2.06‑Billion‑Parameter Multimodal Remote‑Sensing Foundation Model Accepted at CVPR 2024

Ant Group introduced SkySense, a 2.06‑billion‑parameter multimodal remote‑sensing foundation model that outperformed 18 international rivals across 17 benchmark tasks, was accepted to CVPR 2024, and aims to support applications such as agriculture, urban planning, and disaster response.

Ant GroupCVPR 2024Multimodal AI
0 likes · 6 min read
Ant Group Unveils SkySense: 2.06‑Billion‑Parameter Multimodal Remote‑Sensing Foundation Model Accepted at CVPR 2024