AI Awesome
Home
Discover the Best AI Tools
Your ultimate directory for finding the right artificial intelligence solutions for any task.
Search
Voice Generation & Conversion
(49)
J
unknown
j3k0/speech.sh
A tool that allows an agent to speak things out loud and notify when done working with a summary.
AI Text-to-Speech
text-to-speech functionality
notification system
L
unknown
Leximo-AI/leximo-ai-call-assistant-mcp-server
An AI-powered system that makes phone calls on behalf of users for tasks like booking reservations and scheduling appointments.
AI Voice Assistants
AI-powered phone calls
booking reservations
scheduling appointments
+1
I
unknown
Introducing Voicebox
Voicebox is a generative AI model for speech that generalizes across tasks with state-of-the-art performance.
AI Text-to-Speech
generative AI for speech
state-of-the-art performance
cross-task generalization
V
open source
voicetest
An open-source test harness for voice agents with support for Retell, VAPI, Bland, and LiveKit.
AI Voice Generator
Run autonomous simulations
Evaluate with LLM judges
�
open source
🎙️ OpenSource Voice Dictation Agent (like Wispr Flow
An open-source voice dictation agent for converting speech to text.
AI Speech Recognition
speech recognition
dictation
text conversion
Y
open source
ybouhjira/claude-code-tts
A Python library for generating speech from text using Claude's TTS model.
AI Text-to-Speech
text-to-speech conversion
Claude model integration
customizable speech output
T
open source
transcribe-app/mcp-transcribe
A tool for transcribing audio files into text.
AI Transcription
audio transcription
text conversion
voice recognition
M
open source
mberg/kokoro-tts-mcp
Kokoro-TTS-MCP is an open-source text-to-speech system designed for generating natural-sounding speech from text.
AI Text-to-Speech
text-to-speech conversion
natural voice generation
open-source software
M
open source
mbailey/voice-mcp
A tool for generating voice from text using AI.
AI Text-to-Speech
text-to-speech conversion
voice cloning
multiple voice options
O
open source
ovlabs/mcp-server-originalvoices
MCP-Server is an open-source project for generating original voices from text.
AI Text-to-Speech
Text-to-speech conversion
Supports multiple languages
Customizable voice parameters
V
open source
Vaibhavs10/insanely-fast-whisper
A fast and efficient implementation of the Whisper model for text-to-speech conversion.
AI Text-to-Speech
high-quality speech synthesis
fast processing speed
open-source model
S
open source
shashikg/WhisperS2T
WhisperS2T is an open-source tool for speech-to-text conversion using the Whisper model.
AI Speech-to-Text
accurate transcription
supports multiple languages
command-line interface
G
open source
ggerganov/whisper.cpp
whisper.cpp is an open-source library for real-time speech recognition and text-to-speech conversion.
AI Text-to-Speech
real-time speech recognition
text-to-speech conversion
open-source
I
unknown
Introducing Universal-1
Universal-1 is a new speech recognition model introduced by AssemblyAI.
AI Speech Recognition
high accuracy
real-time processing
multi-language support
S
unknown
Speech Studio - Microsoft Azure
Speech Studio - Microsoft Azure is a tool for converting text to speech using AI.
AI Text-to-Speech
converts text to natural-sounding speech
integrates with Microsoft Azure
customizable voice options
�
unknown
🔥] [Eleven Labs Beta
Eleven Labs Beta is an AI tool that specializes in text-to-speech capabilities.
AI Text-to-Speech
laughing AI voices
high-quality speech synthesis
beta version
A
unknown
AI Voice Generator
AI Voice Generator is a tool that converts text into speech using artificial intelligence.
AI Voice Generator
text-to-speech conversion
multiple voice options
natural-sounding audio
V
unknown
Voice-Swap
Voice-Swap.ai is a platform for voice generation and conversion.
AI Voice Changer
voice changer
audio manipulation
P
open source
p0n1/epub_to_audiobook
A tool that converts text to audiobook format using AI.
AI Text-to-Speech
epub conversion
text-to-speech conversion
audiobook generation
P
open source
Parler-TTS
Parler-TTS is an open-source text-to-speech AI model that generates high-quality speech from text.
AI Text-to-Speech
high-quality speech synthesis
open-source model
text-to-speech conversion
C
unknown
COVAL
COVAL provides AI-powered text-to-speech solutions for voice generation and conversion.
AI Text-to-Speech
high-quality voice synthesis
multiple voice options
easy integration
G
unknown
Github
A tool for generating speech from text using AI technology.
AI Text-to-Speech
text-to-speech conversion
AI-driven voice generation
G
unknown
Github
EmotiVoice is an AI tool for generating emotional and expressive voiceovers from text.
AI Text-to-Speech
text-to-speech conversion
emotionally expressive voices
customizable speech styles
G
open source
Github
A text-to-speech tool for generating voice from text.
AI Text-to-Speech
text-to-speech conversion
voice generation
open-source
Previous
Page 1 of 3
Next