Koroko Speech MCP Server provides text-to-speech capabilities using the Kokoro TTS model, enabling AI assistants to generate and play audio from text input. The implementation offers multiple tools for basic speech synthesis, customizable playback speed, and voice selection from a curated list of high-quality voices. It features robust error handling with automatic retry mechanisms for model initialization failures and temporary file management for audio playback. The server runs on stdio transport for seamless integration with Claude Desktop, making it particularly valuable for adding natural-sounding voice output to AI assistant conversations without requiring external applications.
暂无评论. 成为第一个评论的人!
登录以参与讨论
Converts text to speech using the default settings. Parameters: text (string), voice (optional string)
Converts text to speech with customizable parameters. Parameters: text (string), voice (optional string), speed (optional float, range 0.5 to 2.0)
Lists all available voices for text-to-speech. Parameters: None
Check the current status of the TTS model initialization. Parameters: None