Koroko Speech MCP Server provides text-to-speech capabilities using the Kokoro TTS model, enabling AI assistants to generate and play audio from text input. The implementation offers multiple tools for basic speech synthesis, customizable playback speed, and voice selection from a curated list of high-quality voices. It features robust error handling with automatic retry mechanisms for model initialization failures and temporary file management for audio playback. The server runs on stdio transport for seamless integration with Claude Desktop, making it particularly valuable for adding natural-sounding voice output to AI assistant conversations without requiring external applications.
Converts text to speech using the default settings. Parameters: text (string), voice (optional string)
Converts text to speech with customizable parameters. Parameters: text (string), voice (optional string), speed (optional float, 0.5 to 2.0)
Lists all available voices for text-to-speech. No parameters required.
Check the current status of the TTS model initialization. No parameters required.
No reviews yet. Be the first to review!
Sign in to join the conversation