Tagged articles

foundation model

27 articles · Page 1 of 1

Jun 27, 2026 · Artificial Intelligence

FTP-1: First Generalist Tactile Foundation Model Unifying 21 Sensors for Diverse Robots

FTP-1, a new generalist tactile foundation policy trained on the 3,000‑hour FTP‑1‑Dataset covering 21 heterogeneous sensors from 26 sources, introduces a morphology‑aware token space and an independent tactile transformer expert, achieving up to 31.6‑percentage‑point gains on unseen sensors and consistently outperforming prior VLA baselines across 14 real‑world manipulation tasks.

Multimodaldatasetfoundation model

0 likes · 12 min read

FTP-1: First Generalist Tactile Foundation Model Unifying 21 Sensors for Diverse Robots

HyperAI Super Neural

Jun 10, 2026 · Artificial Intelligence

Pixel‑Level Foundation Model for Earth Observation Sets New SOTA Across Tasks, Excelling with Sparse Labels

A joint team from Cambridge, Aalto and Bristol introduces TESSERA, a pixel‑level remote‑sensing foundation model that leverages a Barlow‑Twins self‑supervised scheme and a novel d‑pixel data organization to achieve state‑of‑the‑art accuracy on classification, segmentation and regression tasks, especially when annotations are scarce.

Sentinel-1Sentinel-2d-pixel

0 likes · 12 min read

Pixel‑Level Foundation Model for Earth Observation Sets New SOTA Across Tasks, Excelling with Sparse Labels

Machine Heart

May 27, 2026 · Artificial Intelligence

Samsung’s Bold Move from Foundation Models to Physical AI: Leveraging ROM‑Based Architecture and New Benchmarks

Samsung is rapidly building its own foundation model ecosystem, introducing the memory‑based Meki architecture to leverage ROM for edge AI, the multi‑domain M2RL reinforcement‑learning paradigm, and the LiveClawBench 3‑dimensional benchmark to evaluate physical‑world AI, signaling a strategic shift from cloud‑centric to physical AI deployment.

LiveClawBenchMemory-Based ArchitectureMulti-Domain RL

0 likes · 8 min read

Samsung’s Bold Move from Foundation Models to Physical AI: Leveraging ROM‑Based Architecture and New Benchmarks

Machine Heart

May 7, 2026 · Artificial Intelligence

Genesis AI Shows Embodied Model That Cooks, Experiments and Plays Piano

Genesis AI’s new GENE‑26.5 embodied foundation model demonstrates long‑horizon robot capabilities—from cooking a multi‑step meal and solving a Rubik’s cube to playing a high‑speed piano piece—using a full‑stack system that combines human‑like hands, a data‑glove, extensive simulation, and ultra‑low‑latency control.

Embodied AISimulationdata glove

0 likes · 11 min read

Genesis AI Shows Embodied Model That Cooks, Experiments and Plays Piano

DataFunSummit

Apr 10, 2026 · Artificial Intelligence

How Can AI Agents Truly Remember? A Deep Dive into Long‑Term Memory Engineering

This article examines the shortcomings of current AI assistants, outlines the ideal of long‑term memory engineering, reviews mainstream industry solutions such as hard‑context models and Retrieval‑Augmented Generation, proposes a four‑layer memory loop architecture, and looks ahead to online learning and collective intelligence for future agents.

AIAgentEvaluation

0 likes · 15 min read

How Can AI Agents Truly Remember? A Deep Dive into Long‑Term Memory Engineering

AI Explorer

Apr 4, 2026 · Artificial Intelligence

Google TimesFM: A GPT‑style Foundation Model Redefining Time‑Series Forecasting

Google's open‑source TimesFM model brings pre‑trained, GPT‑like capabilities to time‑series forecasting, offering few‑shot and zero‑shot predictions, extended context length, continuous quantile outputs, and easy integration via a simple PyTorch API for developers across domains.

GooglePyTorchTime Series Forecasting

0 likes · 7 min read

Google TimesFM: A GPT‑style Foundation Model Redefining Time‑Series Forecasting

AI Explorer

Apr 1, 2026 · Artificial Intelligence

Google Open‑Sources TimesFM: A Foundation Model for Plug‑and‑Play Time‑Series Forecasting

Google’s open‑source TimesFM is a decoder‑only Transformer foundation model that delivers plug‑and‑play time‑series forecasting with zero‑shot accuracy, larger context windows, quantile predictions, and a simple Hugging Face API, making it suitable for retail, energy, finance, monitoring, and IoT use cases.

Hugging FacePyTorchTime Series Forecasting

0 likes · 7 min read

Google Open‑Sources TimesFM: A Foundation Model for Plug‑and‑Play Time‑Series Forecasting

Amap Tech

Mar 30, 2026 · Artificial Intelligence

ABot-M0: A Unified VLA Framework Solving the One‑Brain Many‑Forms Robotics Challenge

ABot-M0 is an open‑source Vision‑Language‑Action foundation model that unifies fragmented robot data, introduces Action Manifold Learning for smoother action prediction, and offers a plug‑and‑play dual‑stream perception architecture, achieving state‑of‑the‑art results on major manipulation benchmarks.

Embodied AIaction manifold learningfoundation model

0 likes · 4 min read

ABot-M0: A Unified VLA Framework Solving the One‑Brain Many‑Forms Robotics Challenge

Machine Learning Algorithms & Natural Language Processing

Mar 28, 2026 · Artificial Intelligence

Do All Physical Signals Reduce to a Single Discrete Token? LongCat‑Next Explained

LongCat‑Next, Meituan’s new 3‑billion‑parameter foundation model, adopts a pure‑discrete DiNA architecture with next‑token prediction, converting vision, audio and text into unified tokens; it surpasses same‑size multimodal models on OmniDocBench‑EN, CharXivRQ and SWE‑Bench, avoids catastrophic forgetting, and introduces dNaViT, RVQ compression and a dual‑path detokenizer for high‑fidelity generation.

DiNALongCat-NextMultimodal

0 likes · 10 min read

Do All Physical Signals Reduce to a Single Discrete Token? LongCat‑Next Explained

Data Party THU

Mar 22, 2026 · Artificial Intelligence

How scLong’s Billion‑Parameter Model Reads the Whole Single‑Cell Transcriptome

The scLong foundation model, trained on 48 million cells and 28 k genes, integrates full‑gene expression with Gene Ontology knowledge to outperform existing methods on genetic perturbation, chemical response, cancer drug prediction, gene‑regulatory network inference, and batch integration tasks.

bioinformaticsfoundation modelgene ontology

0 likes · 13 min read

How scLong’s Billion‑Parameter Model Reads the Whole Single‑Cell Transcriptome

PaperAgent

Feb 25, 2026 · Artificial Intelligence

How RynnBrain Unifies Perception, Reasoning, and Planning for Embodied AI

RynnBrain, an open‑source unified spatiotemporal foundation model from Alibaba DAMO Academy, integrates perception, localization, physics‑based reasoning and planning across 2 B, 8 B and 30 B MoE scales, handles multimodal visual inputs, and outperforms existing models on over 20 embodied benchmarks.

AlibabaBenchmarkEmbodied AI

0 likes · 3 min read

How RynnBrain Unifies Perception, Reasoning, and Planning for Embodied AI

AI Engineering

Feb 15, 2026 · Industry Insights

OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg

Peter Steinberger announced his move to OpenAI and the conversion of OpenClaw into an independent foundation, sparking community debate over OpenAI's open‑source strategy, the future of AI agents, and the strategic implications of this partnership.

AI agentsOpenAIOpenClaw

0 likes · 4 min read

OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg

HyperAI Super Neural

Feb 3, 2026 · Artificial Intelligence

Walrus: 1.3B Transformer Model Beats Prior Foundations Across 19 Physics Domains

Walrus, a 1.3 billion‑parameter Transformer built by Polymathic AI, is pretrained on 19 diverse physics scenarios—including astrophysics, geoscience, rheology, plasma physics and acoustics—using techniques like patch jittering, adaptive compute tokenization and space‑time factorized attention, and consistently outperforms earlier foundation models on both short‑ and long‑term continuum dynamics predictions.

Scientific AITransformerWalrus

0 likes · 13 min read

Walrus: 1.3B Transformer Model Beats Prior Foundations Across 19 Physics Domains

HyperAI Super Neural

Jan 29, 2026 · Artificial Intelligence

Skild AI Secures $1.4B Funding to Build a General‑Purpose Robot Brain

Skild AI raised about $1.4 billion in a C‑round led by SoftBank, with participation from Nvidia, Sequoia, Bezos Expeditions and others, to develop a universal foundation model—Skild Brain—that can be deployed across diverse robot platforms, leveraging large‑scale visual data and a hierarchical control architecture.

General AISkild AIfoundation model

0 likes · 11 min read

Skild AI Secures $1.4B Funding to Build a General‑Purpose Robot Brain

Alimama Tech

Jan 7, 2026 · Artificial Intelligence

How Bid2X Revolutionizes Online Ad Bidding with a Universal Foundation Model

Bid2X introduces a bidding‑environment foundation model that unifies heterogeneous ad‑bidding data, leverages variable and time attention mechanisms, handles zero‑inflated distributions, and demonstrates superior offline performance across eight large‑scale datasets and significant online gains in GMV and ROI when deployed on a major e‑commerce platform.

Advertisingbiddingfoundation model

0 likes · 20 min read

How Bid2X Revolutionizes Online Ad Bidding with a Universal Foundation Model

Kuaishou Tech

Dec 4, 2025 · Artificial Intelligence

Can a Tree‑Reasoned Model Master Video Emotion Understanding?

The paper introduces VidEmo, a multimodal video foundation model that uses a two‑stage emotion‑clue‑guided reasoning framework and a large emotion‑centric dataset (Emo‑CFG) to achieve state‑of‑the‑art performance on facial attribute, expression, and fine‑grained emotion tasks, surpassing Gemini 2.0.

AIMultimodalcomputer vision

0 likes · 15 min read

Can a Tree‑Reasoned Model Master Video Emotion Understanding?

HyperAI Super Neural

Nov 24, 2025 · Artificial Intelligence

Introducing AION-1: The First Astronomical Multimodal Foundation Model Trained on 200M Targets

AION-1, developed by a consortium including UC Berkeley, Cambridge and Oxford, is the first large‑scale multimodal foundation model for astronomy that unifies images, spectra and catalog data via an early‑fusion backbone, achieving zero‑shot and linear‑probe performance that rivals or surpasses task‑specific models across diverse scientific tasks.

Multimodal AITokenizationastronomy

0 likes · 18 min read

Introducing AION-1: The First Astronomical Multimodal Foundation Model Trained on 200M Targets

Bighead's Algorithm Notes

Oct 23, 2025 · Artificial Intelligence

FinCast: A Foundation Model for Financial Time‑Series Forecasting

FinCast introduces a decoder‑only Transformer foundation model for financial time‑series forecasting that tackles non‑stationarity, multi‑domain diversity, and multi‑resolution challenges through input chunking with frequency embeddings, a sparse MoE decoder, and a PQ‑loss, achieving zero‑shot and supervised gains over state‑of‑the‑art baselines while running five times faster on consumer GPUs.

PQ lossSparse MoETransformer

0 likes · 12 min read

FinCast: A Foundation Model for Financial Time‑Series Forecasting

Network Intelligence Research Center (NIRC)

Oct 17, 2025 · Artificial Intelligence

LucaOne: Unified Nucleic Acid & Protein Language Model Surpasses Other Models

Researchers present LucaOne, a Transformer‑based foundation model that unifies DNA/RNA and protein sequences using a 39‑token vocabulary, rotary positional encoding, and molecule‑type embeddings, and demonstrate through extensive multi‑task benchmarks that it outperforms domain‑specific models across seven biological tasks.

DNAMultimodalTransformer

0 likes · 5 min read

LucaOne: Unified Nucleic Acid & Protein Language Model Surpasses Other Models

Bighead's Algorithm Notes

Sep 20, 2025 · Artificial Intelligence

Recent Time-Series Paper Summaries (Sep 13‑19, 2025)

This article summarizes four recent time‑series forecasting papers, covering a universal delay‑embedding foundation model, a dual causal network that leverages exogenous variables, a distribution‑aware alignment plug‑in called TimeAlign, and a shapelet‑based framework for interpretable directional forecasting in noisy financial markets.

causal networkfinancial marketsforecasting

0 likes · 9 min read

Recent Time-Series Paper Summaries (Sep 13‑19, 2025)

Amazon Cloud Developers

Sep 19, 2025 · Artificial Intelligence

DeepSeek‑V3.1 Launches on Amazon Bedrock: Fully Managed Model with Dual Reasoning Modes

DeepSeek‑V3.1 arrives on Amazon Bedrock as a fully managed foundation model offering two inference modes, improved benchmark performance over DeepSeek‑R1, support for over 100 languages, enhanced tool‑calling and agent capabilities, and detailed guidance for secure enterprise deployment.

Amazon BedrockBenchmarkDeepSeek-V3.1

0 likes · 7 min read

DeepSeek‑V3.1 Launches on Amazon Bedrock: Fully Managed Model with Dual Reasoning Modes

Bighead's Algorithm Notes

Sep 7, 2025 · Artificial Intelligence

Paper Review: Kronos – A Temporal Foundation Model for Financial Market Language

This article reviews Kronos, a unified and scalable pre‑training framework designed for financial K‑line data, detailing its tokenization approach, autoregressive architecture, large‑scale pre‑training on 12 billion records, and experimental results that show substantial gains in price prediction, volatility forecasting, synthetic data generation, and investment simulation.

KronosTokenizationautoregressive pretraining

0 likes · 9 min read

Paper Review: Kronos – A Temporal Foundation Model for Financial Market Language

Tencent Advertising Technology

Sep 3, 2025 · Artificial Intelligence

Boosting Ads Revenue: LFM4Ads’ Full‑Representation Multi‑Granular Transfer Raises GMV 2.45%

Tencent's LFM4Ads introduces a full‑representation, multi‑granular knowledge transfer framework that moves user, item, and cross representations from a large foundation model to downstream tasks, achieving up to 2.45% platform GMV uplift across more than ten advertising scenarios.

Knowledge TransferLarge-Scale Dataads recommendation

0 likes · 12 min read

Boosting Ads Revenue: LFM4Ads’ Full‑Representation Multi‑Granular Transfer Raises GMV 2.45%

Tencent Advertising Technology

Aug 31, 2025 · Artificial Intelligence

LFM4Ads: Full-Representation Multi-Granular Transfer Boosts Ad Recommendation

Tencent's LFM4Ads foundation model introduces a full-representation, multi-granular knowledge transfer framework that moves user, item, and cross representations to downstream tasks, dramatically improving ad recommendation metrics across dozens of business scenarios.

Knowledge TransferLarge‑Scale Trainingad recommendation

0 likes · 10 min read

LFM4Ads: Full-Representation Multi-Granular Transfer Boosts Ad Recommendation

Data Party THU

Aug 24, 2025 · Artificial Intelligence

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

This article reviews the Centaur foundation model—fine‑tuned from Llama 3‑70B on the Psych‑101 dataset—to assess its ability to predict human choices, brain activity, and decision rationales across diverse psychological experiments, while discussing generalization, over‑fitting, and future research limits.

Centaurcognitive modelingdecision prediction

0 likes · 17 min read

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

Ma Wei Says

Mar 4, 2025 · Artificial Intelligence

Microsoft’s Open‑Source Multimodal AI Agent Model Magma: Capabilities and Innovations

On February 25 2025, Microsoft open‑sourced its first multimodal AI agent foundation model, Magma, which extends multimodal processing to images, video, and text, introduces Set‑of‑Mark and Trace‑of‑Mark techniques for spatial‑temporal reasoning, optimizes modular inference for edge devices, and integrates reinforcement learning for adaptive task execution.

MagmaMultimodal AISet-of-Mark

0 likes · 6 min read

Microsoft’s Open‑Source Multimodal AI Agent Model Magma: Capabilities and Innovations

AntTech

Mar 1, 2024 · Artificial Intelligence

Ant Group Unveils SkySense: 2.06‑Billion‑Parameter Multimodal Remote‑Sensing Foundation Model Accepted at CVPR 2024

Ant Group introduced SkySense, a 2.06‑billion‑parameter multimodal remote‑sensing foundation model that outperformed 18 international rivals across 17 benchmark tasks, was accepted to CVPR 2024, and aims to support applications such as agriculture, urban planning, and disaster response.

Ant GroupCVPR 2024Multimodal AI

0 likes · 6 min read

Ant Group Unveils SkySense: 2.06‑Billion‑Parameter Multimodal Remote‑Sensing Foundation Model Accepted at CVPR 2024