
Media
Model Context Protocol servers for processing images, videos, and multimedia content with AI
Speech Interface (Faster Whisper)
Integrates voice interaction capabilities using faster-whisper and PyAudio for speech recognition and synthesis, enabling natural language voice interfaces for AI models.
YouTube
Extract and analyze video subtitle data for content understanding.
YouTube Transcripts
Extract and analyze video captions and subtitles in multiple languages.
Video Editor
Integrates with Video Jungle API to enable AI-powered video editing and content searching through natural language queries and automated clip generation.
Web Fetcher
Fetches and extracts web content using Playwright's headless browser capabilities, delivering clean, readable content from JavaScript-heavy websites in HTML or Markdown format for research and information gathering.
ElevenLabs
Integrates with ElevenLabs to provide high-quality text-to-speech, voice cloning, and conversational capabilities with customizable voice profiles and audio processing features.
Comfy (Stable Diffusion)
Integrates with ComfyUI to enable text-to-image generation using customizable Stable Diffusion workflows.
HuggingFace Spaces Connector
MCP server that seamlessly integrates Hugging Face Spaces with AI assistants, enabling easy access to diverse AI models and tools without manual configuration.
EverArt
Integrates with EverArt API to generate images from text prompts using multiple AI models for creative and visual design tasks.
YouTube Downloader
Integrates with YouTube using yt-dlp to enable downloading of videos and subtitles for content analysis and processing tasks.
Fetch with Images
Integrates web scraping and image processing capabilities to fetch, extract, and optimize web content.
Jina AI
Integrates with Jina AI's web services to enable web content extraction, search, and fact-checking through natural language interactions.
Fetch (Web Content & YouTube Transcripts)
Fetches web content and YouTube video transcripts, converting HTML to Markdown and extracting timestamps for reference in conversations.
YouTube Data
Integrates with YouTube Data API to retrieve and analyze video content, transcripts, channel statistics, and engagement metrics across different regions and categories without leaving the conversation interface.
Email Sender
Enables language models to compose and send emails with attachments through SMTP servers, supporting multiple providers and secure transmission for automated email workflows.
WeChat Moments
Enables publishing content to WeChat Moments on macOS through AppleScript automation and mouse event emulation, providing a server interface for social media management workflows.
Maigret OSINT
OSINT Maigret integration to gather user info across social networks.
Chinese Trends Hub
Provides real-time access to trending topics and content from major Chinese platforms including Weibo, Zhihu, Douyin, Bilibili, Douban, Toutiao, and 36kr through separate tools with temporary caching for improved performance.
X (Twitter)
Interact with X (Twitter) by posting tweets and searching for tweets through the X API.
Replicate Flux
Integrates with Replicate's Flux image generation model, enabling image creation capabilities within conversation interfaces through a simple API token setup and TypeScript implementation available as both an npm module and Docker container.
Strapi CMS
Integrates with Strapi CMS to enable creating, reading, updating, and deleting content entries with support for filtering, pagination, sorting, and media uploads through URI-based resource patterns.
Strapi CMS
Integrates Strapi CMS content into workflows, enabling manipulation of data for content management and querying in Strapi-powered applications.
YouTube Subtitles
Integrates YouTube subtitle retrieval for natural language queries about video content.
PyAutoGUI
Enables automated GUI testing and control across operating systems by wrapping PyAutoGUI to perform mouse movements, keyboard input, screenshot capture, and image recognition tasks.
Redis
Bridge to Redis databases, enabling fast in-memory data operations for AI workflows.
Grok2 Image Generator
Enables AI assistants to generate images through the Grok2 model using stdio transport for seamless integration into existing workflows.
Lightning Nostr
Integrates with Nostr to enable posting notes and interacting with relays, simplifying decentralized social network engagement and content publishing.
Hubble AI (Solana)
Provides a bridge to Solana blockchain data through natural language queries, enabling analytics searches, chart generation, and image downloads for visualizing transaction patterns, price movements, and token distributions.
Document Forge
Integrates document processing libraries to enable extraction, conversion, and manipulation across multiple file formats including PDF, DOCX, HTML, CSV, and EPUB.
Placid Image Generator
Integrates with Placid's API to generate dynamic images from templates for tasks like social media posts and marketing materials.
Eyevinn Open Source Cloud
EyevinnOSC's MCP server enables AI assistants to provision and manage vendor-independent cloud infrastructure for databases, storage, and media processing through an open source API.
FFmpeg Media Tools
Provides a simplified interface for common FFmpeg media operations like video speed adjustment and audio extraction without requiring complex command-line syntax
Miro
Integrates with Miro's collaborative whiteboard platform, providing over 80 tools for managing boards, creating and manipulating various item types, and handling enterprise features for visual collaboration workflows.
Webcam/Screenshot Capture
Enables capturing and analyzing live webcam images and screenshots for real-time visual context in AI applications.
Kokoro Speech
Provides text-to-speech capabilities using the Kokoro TTS model, enabling natural-sounding voice output with customizable playback speed and voice selection through robust error handling and temporary file management.
Face Generator
Generates customizable human face images using thispersondoesnotexist.com, offering options for shape, dimensions, and batch processing for UI prototyping and dataset creation.
X (Twitter)
Integrates with the X API v2 to enable post, search, and reply to tweets.
Container Inc.
Enables seamless deployment of containerized applications directly from code editors through a three-step workflow of GitHub authentication, repository setup, and automated Docker image publishing.
Meme Generator (ImgFlip)
Enables meme generation through the ImgFlip API with a single tool that accepts template ID and text placeholder parameters for creating custom memes directly within conversations.
Dumpling AI
Provides a bridge to Dumpling AI's data extraction API for performing web searches, scraping content, extracting structured data, and processing various document formats through 20+ specialized tools.
MyMCPSpace
Enables AI interaction with MyMCPSpace social media platform for creating posts, replying to content, toggling likes, retrieving feed data, and updating usernames through authenticated API communication.
Cloudflare AI to Markdown
Bridges Claude with Cloudflare's AI services to convert PDFs, images, HTML, and Office documents into structured markdown descriptions for content analysis and documentation generation.
Integrates with Twitter/X to enable direct actions like posting, replying, following users, and retrieving profile data through a Node.js server with dual authentication options.
Sound Notification
Plays customizable audio notifications for key interaction points in development environments, alerting users when AI assistance requires attention or completes tasks.

Spotify
Interact with your Spotify.
Magick Convert (ImageMagick)
Integrates with ImageMagick's CLI to enable image processing and manipulation tasks like resizing, format conversion, and applying filters or effects.
Audio Transcriber (OpenAI Whisper)
Provides speech-to-text transcription capabilities using OpenAI's Whisper API with configurable language settings and optional file saving
X
Read your timeline and engage with tweets