Speech Interface (Faster Whisper)

by kvadratni

29.8k

Summary

This MCP server implementation provides voice interaction capabilities for AI assistants, enabling speech-to-text and text-to-speech functionality. It uses faster-whisper for improved speech recognition performance and integrates with PyAudio for audio processing. The server offers a simplified API for starting conversations and replying to user input, making it suitable for applications requiring natural language voice interfaces with AI models.

Available Actions(3)

narrate_conversation

Generate audio files for conversations using a specified script format (JSON or Markdown). Parameters: script (string), output_path (string), script_format (string)

narrate

Convert text directly to speech or from a text file. Parameters: text (string, optional), text_file_path (string, optional), output_path (string)

transcribe

Transcribe speech from various audio and video formats. Parameters: file_path (string), include_timestamps (boolean, optional), detect_speakers (boolean, optional)

Last Updated: April 17, 2025

Community Reviews

0.0

0 reviews

No reviews yet. Be the first to review!

Coming soon to

Highlight AI

Documentation

View GitHub Repository

Language

TypeScript

Speech Interface (Faster Whisper)

by kvadratni

Summary

Available Actions(3)

narrate_conversation

narrate

transcribe

Community Reviews

Documentation

Language

Categories

Tags

Speech Interface (Faster Whisper)

by kvadratni

Summary

Available Actions(3)

narrate_conversation

narrate

transcribe

Community Reviews

Documentation

Language

Categories

Tags