A server providing text-to-speech and speech-to-text functionalities using Windows' native speech services without external dependencies.
暂无评论. 成为第一个评论的人!
登录以参与讨论
Converts text to speech using Windows SAPI. Parameters: text (required) - The text to convert to speech; voice (optional) - The voice to use (e.g., 'Microsoft David Desktop'); speed (optional) - Speech rate from 0.5 to 2.0 (default: 1.0).
Records audio and converts it to text using Windows Speech Recognition. Parameters: duration (optional) - Recording duration in seconds (default: 5, max: 60).