Speech Interface (Faster Whisper)
Summary
This MCP server implementation provides voice interaction capabilities for AI assistants, enabling speech-to-text and text-to-speech functionality. It uses faster-whisper for improved speech recognition performance and integrates with PyAudio for audio processing. The server offers a simplified API for starting conversations and replying to user input, making it suitable for applications requiring natural language voice interfaces with AI models.
Available Actions(3)
narrate
Convert text directly to speech. Parameters: text (string), output_path (string). Alternatively, convert text from a file using text_file_path (string).
narrate_conversation
Generate audio files with multiple voices for stories and dialogues. Parameters: script (string), output_path (string), script_format (string).
transcribe
Transcribe speech from various audio and video formats. Parameters: file_path (string), include_timestamps (optional boolean), detect_speakers (optional boolean).
コミュニティレビュー
まだレビューはありません. 最初のレビューを投稿しましょう!
会話に参加するにはサインインしてください