Kokoro Speech
Summary
Koroko Speech MCP Server provides text-to-speech capabilities using the Kokoro TTS model, enabling AI assistants to generate and play audio from text input. The implementation offers multiple tools for basic speech synthesis, customizable playback speed, and voice selection from a curated list of high-quality voices. It features robust error handling with automatic retry mechanisms for model initialization failures and temporary file management for audio playback. The server runs on stdio transport for seamless integration with Claude Desktop, making it particularly valuable for adding natural-sounding voice output to AI assistant conversations without requiring external applications.
Available Actions(4)
text_to_speech
Converts text to speech using the default settings. Parameters: text (string), voice (optional string)
text_to_speech_with_options
Converts text to speech with customizable parameters. Parameters: text (string), voice (optional string), speed (optional float, range: 0.5 to 2.0)
list_voices
Lists all available voices for text-to-speech.
get_model_status
Check the current status of the TTS model initialization. No parameters required.
社区评论
暂无评论. 成为第一个评论的人!
登录以参与讨论