AI Awesome
Home
Discover the Best AI Tools
Your ultimate directory for finding the right artificial intelligence solutions for any task.
Search
Voice Generation & Conversion
(52)
S
free
Spix-HQ/spix-mcp
Spix-MCP provides AI agents with real phone numbers and voice capabilities for outbound and inbound calls, email, and contact management.
AI Voice Generator
Real phone numbers for AI agents
Outbound and inbound call handling
Email management
+2
F
open source
fasuizu-br/brainiall-mcp-server
AI-powered speech tools for pronunciation assessment, speech-to-text, and text-to-speech.
AI Text-to-Speech
pronunciation assessment
speech-to-text with language detection
text-to-speech with multiple voices
S
unknown
samson-art/transcriptor-mcp
Transcriptor MCP is an AI tool for generating transcripts and metadata from audio content.
AI Transcriber
audio transcription
metadata generation
AI-powered analysis
J
unknown
j3k0/speech.sh
A tool that allows an agent to speak things out loud and notify when done working with a summary.
AI Text-to-Speech
text-to-speech functionality
notification system
L
unknown
Leximo-AI/leximo-ai-call-assistant-mcp-server
An AI-powered system that makes phone calls on behalf of users for tasks like booking reservations and scheduling appointments.
AI Voice Assistants
AI-powered phone calls
booking reservations
scheduling appointments
+1
I
unknown
Introducing Voicebox
Voicebox is a generative AI model for speech that generalizes across tasks with state-of-the-art performance.
AI Text-to-Speech
generative AI for speech
state-of-the-art performance
cross-task generalization
V
open source
voicetest
An open-source test harness for voice agents with support for Retell, VAPI, Bland, and LiveKit.
AI Voice Generator
Run autonomous simulations
Evaluate with LLM judges
�
open source
🎙️ OpenSource Voice Dictation Agent (like Wispr Flow
An open-source voice dictation agent for converting speech to text.
AI Speech Recognition
speech recognition
dictation
text conversion
Y
open source
ybouhjira/claude-code-tts
A Python library for generating speech from text using Claude's TTS model.
AI Text-to-Speech
text-to-speech conversion
Claude model integration
customizable speech output
T
open source
transcribe-app/mcp-transcribe
A tool for transcribing audio files into text.
AI Transcription
audio transcription
text conversion
voice recognition
M
open source
mberg/kokoro-tts-mcp
Kokoro-TTS-MCP is an open-source text-to-speech system designed for generating natural-sounding speech from text.
AI Text-to-Speech
text-to-speech conversion
natural voice generation
open-source software
M
open source
mbailey/voice-mcp
A tool for generating voice from text using AI.
AI Text-to-Speech
text-to-speech conversion
voice cloning
multiple voice options
O
open source
ovlabs/mcp-server-originalvoices
MCP-Server is an open-source project for generating original voices from text.
AI Text-to-Speech
Text-to-speech conversion
Supports multiple languages
Customizable voice parameters
V
open source
Vaibhavs10/insanely-fast-whisper
A fast and efficient implementation of the Whisper model for text-to-speech conversion.
AI Text-to-Speech
high-quality speech synthesis
fast processing speed
open-source model
S
open source
shashikg/WhisperS2T
WhisperS2T is an open-source tool for speech-to-text conversion using the Whisper model.
AI Speech-to-Text
accurate transcription
supports multiple languages
command-line interface
G
open source
ggerganov/whisper.cpp
whisper.cpp is an open-source library for real-time speech recognition and text-to-speech conversion.
AI Text-to-Speech
real-time speech recognition
text-to-speech conversion
open-source
I
unknown
Introducing Universal-1
Universal-1 is a new speech recognition model introduced by AssemblyAI.
AI Speech Recognition
high accuracy
real-time processing
multi-language support
S
unknown
Speech Studio - Microsoft Azure
Speech Studio - Microsoft Azure is a tool for converting text to speech using AI.
AI Text-to-Speech
converts text to natural-sounding speech
integrates with Microsoft Azure
customizable voice options
�
unknown
🔥] [Eleven Labs Beta
Eleven Labs Beta is an AI tool that specializes in text-to-speech capabilities.
AI Text-to-Speech
laughing AI voices
high-quality speech synthesis
beta version
A
unknown
AI Voice Generator
AI Voice Generator is a tool that converts text into speech using artificial intelligence.
AI Voice Generator
text-to-speech conversion
multiple voice options
natural-sounding audio
V
unknown
Voice-Swap
Voice-Swap.ai is a platform for voice generation and conversion.
AI Voice Changer
voice changer
audio manipulation
P
open source
p0n1/epub_to_audiobook
A tool that converts text to audiobook format using AI.
AI Text-to-Speech
epub conversion
text-to-speech conversion
audiobook generation
P
open source
Parler-TTS
Parler-TTS is an open-source text-to-speech AI model that generates high-quality speech from text.
AI Text-to-Speech
high-quality speech synthesis
open-source model
text-to-speech conversion
C
unknown
COVAL
COVAL provides AI-powered text-to-speech solutions for voice generation and conversion.
AI Text-to-Speech
high-quality voice synthesis
multiple voice options
easy integration
Previous
Page 1 of 3
Next