MCPServers
Speech Interface (Faster Whisper) - MCP server logo

Speech Interface (Faster Whisper)

31
0

Summary

This MCP server implementation provides voice interaction capabilities for AI assistants, enabling speech-to-text and text-to-speech functionality. It uses faster-whisper for improved speech recognition performance and integrates with PyAudio for audio processing. The server offers a simplified API for starting conversations and replying to user input, making it suitable for applications requiring natural language voice interfaces with AI models.

Available Actions(3)

narrate_conversation

Generate audio files with multiple voices for stories and dialogues. Parameters: script (string: path to JSON or Markdown script), output_path (string: path to save the output audio), script_format (string: 'json' or 'markdown')

narrate

Convert text directly to speech. Parameters: text (string: the text to convert), output_path (string: path to save the output audio). Alternatively, use text_file_path (string: path to a file containing text).

transcribe

Transcribe speech from various audio and video formats. Parameters: file_path (string: path to the audio/video file), include_timestamps (optional boolean), detect_speakers (optional boolean).

Last Updated: April 17, 2025

Avis de la Communauté

0.0
0 avis
5
0
4
0
3
0
2
0
1
0

Aucun avis encore. Soyez le premier à donner votre avis !

Connectez-vous pour rejoindre la conversation

Coming soon to
HighlightHighlight AI

Langage

TypeScript

Catégories

Tags