Tagged articles
2 articles
Page 1 of 1
AI Engineering
AI Engineering
May 13, 2026 · Artificial Intelligence

First End‑to‑End Voice Agent Benchmark Shows Grok Leads with 52% Real‑World Success Rate

Artificial Analysis released the τ‑Voice benchmark, testing speech‑to‑speech agents across 278 real‑world customer‑service scenarios, and found the top‑performing Grok Voice Think Fast 1.0 achieves only a 52.1% task‑completion rate while average dialogue lengths stay under seven minutes.

BenchmarkGrok Voicespeech-to-speech
0 likes · 7 min read
First End‑to‑End Voice Agent Benchmark Shows Grok Leads with 52% Real‑World Success Rate
AI Engineering
AI Engineering
May 8, 2026 · Artificial Intelligence

How GPT‑Realtime‑2 Leverages GPT‑5‑Level Reasoning to Redefine Voice AI Architecture

OpenAI’s GPT‑Realtime‑2 embeds GPT‑5‑class reasoning into a continuous‑audio loop, achieving 96.6% accuracy on Big Bench Audio, offering adjustable inference intensity with latency from 1.12 s to 2.33 s, a 128 K context window, and demonstrable gains in real‑world call success rates, while prompting industry debate over pricing and competitive impact.

GPT-5GPT-Realtime-2Latency
0 likes · 5 min read
How GPT‑Realtime‑2 Leverages GPT‑5‑Level Reasoning to Redefine Voice AI Architecture