Kokoro Speech
Summary
Koroko Speech MCP Server provides text-to-speech capabilities using the Kokoro TTS model, enabling AI assistants to generate and play audio from text input. The implementation offers multiple tools for basic speech synthesis, customizable playback speed, and voice selection from a curated list of high-quality voices. It features robust error handling with automatic retry mechanisms for model initialization failures and temporary file management for audio playback. The server runs on stdio transport for seamless integration with Claude Desktop, making it particularly valuable for adding natural-sounding voice output to AI assistant conversations without requiring external applications.
Available Actions(4)
text_to_speech
Converts text to speech using the default settings. Parameters: text (string), voice (optional string)
text_to_speech_with_options
Converts text to speech with customizable parameters. Parameters: text (string), voice (optional string), speed (optional float, 0.5 to 2.0)
list_voices
Lists all available voices for text-to-speech. No parameters required.
get_model_status
Check the current status of the TTS model initialization. No parameters required.
Community Reviews
No reviews yet. Be the first to review!
Sign in to join the conversation