Overview Leading voice AI frameworks power realistic, fast, and scalable conversational agents across enterprise, consumer, ...
NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech models.
Google introduces MedASR, an open-weight medical speech-to-text model positioned as a foundational layer for healthcare AI ...
Chatterbox local TTS ElevenLabs Alternative adds markup cues for pauses, laughter, and emphasis, giving precise control over ...
Pipit is a free Mac dictation app that works offline. It can be used to do more than just transcribe speech—it can launch apps, toggle settings, and even launch a web search or query an AI service.
Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
This week's stories show how fast attackers change their tricks, how small mistakes turn into big risks, and how the same old ...
The release of the open-source AI models marks the next step in the Mountain View-based tech giant's push in the healthcare ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results