Machine Heart
Author

Machine Heart

Professional AI media and industry service platform

207
Articles
0
Likes
3
Views
0
Comments
Recent Articles

Latest from Machine Heart

100 recent articles max
Machine Heart
Machine Heart
Apr 24, 2026 · Artificial Intelligence

Audio-Omni: A Unified Multimodal Model for Understanding, Generating, and Editing Audio Across Sound, Music, and Speech

Audio-Omni, a unified multimodal audio model presented at SIGGRAPH 2026, combines a frozen large multimodal language model with a trainable diffusion generator to achieve state‑of‑the‑art understanding, generation, and instruction‑based editing across general sounds, music, and speech, leveraging a million‑scale AudioEdit dataset and a hybrid conditioning architecture.

Audio-OmniAudioEditDiffusion Generation
0 likes · 11 min read
Audio-Omni: A Unified Multimodal Model for Understanding, Generating, and Editing Audio Across Sound, Music, and Speech
Machine Heart
Machine Heart
Apr 24, 2026 · Artificial Intelligence

Vision Banana Shows That Image Generation Equals Understanding – DeepMind’s GPT‑like Leap

DeepMind’s Vision Banana model demonstrates that large‑scale image‑generation pre‑training can produce powerful, universal visual representations, achieving state‑of‑the‑art results on segmentation, depth, and normal estimation without task‑specific heads, thereby supporting the hypothesis that generation and understanding are fundamentally linked.

DeepMindGenerative AIImage Generation
0 likes · 13 min read
Vision Banana Shows That Image Generation Equals Understanding – DeepMind’s GPT‑like Leap
Machine Heart
Machine Heart
Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: Dual Versions with 1M Token Context and New Mixed‑Attention Architecture

DeepSeek V4 launches two models—Flash and Pro—both supporting up to 1 million token context and 384 K output tokens, offering non‑thinking and thinking modes with a reasoning_effort parameter, and featuring mixed attention, manifold‑constrained hyperconnections, a Muon optimizer, massive training data, and up to 73% FLOPs reduction versus V3.

AI modelCambriconDeepSeek V4
0 likes · 5 min read
DeepSeek V4 Unveiled: Dual Versions with 1M Token Context and New Mixed‑Attention Architecture
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

First Survey of Attention Sink: From Utilization and Understanding to Elimination in Transformers

This survey reviews over 180 papers on the Attention Sink phenomenon in Transformers, outlining its three-stage evolution—from early exploitation to mechanistic interpretation and finally strategic mitigation—while detailing utilization tactics, theoretical explanations, removal techniques, and promising future research directions.

Attention SinkMitigationModel Interpretability
0 likes · 9 min read
First Survey of Attention Sink: From Utilization and Understanding to Elimination in Transformers
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

DeepSeek Unveils Tile Kernels and DeepEP V2 – Is V4 on the Horizon?

DeepSeek recently opened the Tile Kernels repository and released DeepEP V2, detailing new GPU kernel features, a fully JIT-enabled expert parallelism redesign that boosts peak performance by up to 1.3× while cutting SM usage fourfold, and hinting at an upcoming V4 release.

DeepEP V2DeepSeekExpert Parallelism
0 likes · 6 min read
DeepSeek Unveils Tile Kernels and DeepEP V2 – Is V4 on the Horizon?
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

UniLS: End-to-End Audio-Driven Framework Eliminates the ‘Poker Face’ in Digital Human Dialogue

UniLS, the first end‑to‑end audio‑driven framework that jointly generates speaking and listening facial motions for digital humans, achieves state‑of‑the‑art speaking accuracy, improves listening naturalness by 44.1 %, and runs at over 500 FPS, as demonstrated on the CVPR 2026‑accepted paper with extensive quantitative and user studies.

CVPR 2026audio-driven animationdigital humans
0 likes · 9 min read
UniLS: End-to-End Audio-Driven Framework Eliminates the ‘Poker Face’ in Digital Human Dialogue
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

Google's TPU 8t and 8i: Training Powerhouse vs. Inference Specialist

Google unveiled its eighth‑generation TPU line at Cloud Next 2026, introducing the training‑focused TPU 8t with a 2.7× performance boost and massive scaling, and the inference‑optimized TPU 8i featuring three‑times more on‑chip SRAM and an 80% performance uplift for agentic AI workloads, while positioning the chips as a complement—not a replacement—to Nvidia's offerings.

AI hardwareAgentic AIGoogle Cloud
0 likes · 9 min read
Google's TPU 8t and 8i: Training Powerhouse vs. Inference Specialist
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

FlowWAM Leads WorldArena: Chinese Embodied World Model Wins Dual First Place

The newly released FlowWAM model from China’s Institute of Computing Technology tops the WorldArena embodied world‑model benchmark, securing first place in both Physics Adherence and 3D Accuracy, and demonstrates a shift from visual rendering toward true spatial understanding for robotics.

3D AccuracyFlowWAMPhysics Adherence
0 likes · 5 min read
FlowWAM Leads WorldArena: Chinese Embodied World Model Wins Dual First Place