← Back to home

Voice

50 items tagged with this topic

Recent

Older

Official Sourcesfrom OpenAI News

What Codex unlocks for Notion

How Notion uses Codex to one-shot specs, build AI Voice Input for the web, and multiply engineering power across small teams.

Podcasts & Newslettersfrom Import AI

Import AI 458: Reckoning with the future; and a singularity story

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. This issue consists of a lengthy essay based on a speech I recently gav…

Podcasts & Newslettersfrom Training Data

Suno's Mikey Shulman: Everyone Can Make Music Now

Speaker 1 | 00:00 - 00:25 Before Suno, basically everybody was a consumer of music. You know, compared to the 8,000,000,000 people on the planet, there are very few people who make music and the rest of us consume it. The crazy thing about…

Official Sourcesfrom OpenAI News

Parloa builds service agents customers want to talk to

Parloa leverages OpenAI models to power scalable, voice-driven AI customer service agents, enabling enterprises to design, simulate, and deploy reliable, real-time interactions.

Official Sourcesfrom Microsoft Research Blog

Microsoft at NSDI 2026: Advances in large-scale networked systems

Microsoft researchers share advances in building and operating large-scale distributed systems, spanning datacenters, networking, and the growing intersection with AI during NSDI ’26. The post Microsoft at NSDI 2026: Advances in large-scal…

Podcasts & Newslettersfrom Training Data

OpenAI's Greg Brockman: Why Human Attention Is the New Bottleneck

Speaker 1 | 00:02 - 00:24 So Greg, thank you for coming back here. I don't think we ever charge you for rent. So maybe I'll send you an invoice later. But Greg, you've been part of like two really spectacular companies, Stripe as employee…

Official Sourcesfrom Mistral AI Blog

Speaking of Voxtral | Mistral AI

Voxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.