Artificial Intelligence 7 min read

Claude Finally Gets Voice: Anthropic Adds Speech to Its AI Assistant

Anthropic has introduced a voice mode for Claude, enabling English users to speak and type interchangeably with five voice personalities, while a new 3D AI startup, SpAItial, showcases photorealistic room generation and researchers present INTUITOR, a confidence‑driven training method that improves AI reasoning.

ShiZhen AI

May 28, 2025

Claude Finally Gets Voice: Anthropic Adds Speech to Its AI Assistant

Anthropic announced a new voice mode for its Claude mobile app, making it one of the last major AI labs to offer natural speech interaction. The feature will roll out to English users in the coming weeks and runs on the latest Sonnet 4 model. Users can switch seamlessly between speaking and typing, choose from five voice personalities, and see real‑time transcription. The voice mode integrates with Google Workspace, allowing paid subscribers to issue voice commands that access Calendar, Docs, and Gmail. Free users receive 20‑30 voice messages per month, while paid tiers have substantially higher limits.

Importance: With all leading labs now providing voice capabilities, competition shifts to execution factors such as latency, integration depth, and underlying model quality, highlighting a clear advantage over legacy assistants like Siri.

Postman’s Agent Generator offers a turnkey infrastructure that eliminates server setup, allowing developers to instantly build and deploy AI agents. It supports immediate workflow launch, compatibility with OpenAI and LangChain, and testing, debugging, and deployment directly within Postman.

Synthesia co‑founder Matthias Niessner launched SpAItial, a startup focused on spatial foundation models (SFMs) that natively understand 3D space, geometry, physics, and material properties. The founding team includes former leaders from Synthesia, Google, and Meta, bringing expertise in 3D AI and neural rendering. Early demos generate photorealistic 3D rooms from simple text prompts, targeting games, architecture, VR, and robotics.

Importance: While AI has mastered 2D image and video generation, creating coherent, spatially aware 3D worlds remains challenging; such models could let anyone craft complex virtual environments with a few sentences, representing a next frontier for AI.

Researchers from UC Berkeley and Yale introduced INTUITOR, a training method that measures an AI’s confidence on each generated token and uses this “intuition” as a learning signal. Unlike traditional training that requires correct answers, INTUITOR rewards confident responses. On math benchmarks it matches conventional methods, but it outperforms them on programming tasks and exhibits human‑like reasoning, such as problem decomposition, planning, and step‑by‑step explanation.

Importance: The study shows that self‑directed confidence‑driven learning can succeed without explicit correct answers, offering value for domains where ground‑truth data or expert knowledge is limited.

Quick browse of other AI news:

Claude Code – Anthropic’s agent coding tool is now fully available.

Nemotron AceReason – Nvidia’s new math and code reasoning model.

Llama‑Factory – Open‑source LLM fine‑tuning without writing code.

OpusClip Thumbnail – One‑click AI thumbnail generator.

Meta is reorganizing its AI division into product‑focused and AGI‑focused teams.

Anthropic’s Claude 4 Sonnet set a new SOTA on the ARC‑AGI‑2 benchmark.

DeepMind previewed SignGemma, a model that translates sign language to text.

Salesforce acquired Informatica for $8 billion to strengthen its data‑management platform.

The Browser Company announced it will cease Arc development and focus on the AI‑first Dia browser.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

AI research Claude voice AI Anthropic confidence learning SpAItial

Written by

ShiZhen AI

Tech blogger with over 10 years of experience at leading tech firms, AI efficiency and delivery expert focusing on AI productivity. Covers tech gadgets, AI-driven efficiency, and leisure— AI leisure community. 🛰 szzdzhp001

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.