This MCP server implementation provides voice interaction capabilities for AI assistants, enabling speech-to-text and text-to-speech functionality. It uses faster-whisper for improved speech recognition performance and integrates with PyAudio for audio processing. The server offers a simplified API for starting conversations and replying to user input, making it suitable for applications requiring natural language voice interfaces with AI models.
まだレビューはありません. 最初のレビューを投稿しましょう!
会話に参加するにはサインインしてください
Narrate a conversation using a script in JSON or Markdown format. Parameters: script (string), output_path (string), script_format (string - 'json' or 'markdown')
Convert text directly to speech or from a text file. Parameters: text (string), text_file_path (string - optional), output_path (string)
Transcribe speech from audio or video files. Parameters: file_path (string), include_timestamps (boolean - optional), detect_speakers (boolean - optional)