Speech Interface (Faster Whisper)
Summary
This MCP server implementation provides voice interaction capabilities for AI assistants, enabling speech-to-text and text-to-speech functionality. It uses faster-whisper for improved speech recognition performance and integrates with PyAudio for audio processing. The server offers a simplified API for starting conversations and replying to user input, making it suitable for applications requiring natural language voice interfaces with AI models.
Available Actions(3)
narrate_conversation
Generate audio files with multiple voices for stories and dialogues. Parameters: script (string), output_path (string), script_format (string)
narrate
Convert text directly to speech. Parameters: text (string), output_path (string) or text_file_path (string)
transcribe
Transcribe speech from various audio and video formats. Parameters: file_path (string), include_timestamps (optional boolean), detect_speakers (optional boolean)
커뮤니티 리뷰
아직 리뷰가 없습니다. 첫 번째 리뷰를 작성해 보세요!
대화에 참여하려면 로그인하세요