Media

238

Model Context Protocol servers for processing images, videos, and multimedia content with AI

ElevenLabs

Integrates with ElevenLabs to provide high-quality text-to-speech, voice cloning, and conversational capabilities with customizable voice profiles and audio processing features.

5.0

690

...

YouTube Transcript Extractor

Extracts transcripts from YouTube videos using various URL formats, enabling automated content analysis and summarization.

5.0

...

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video/image generation APIs. This server allows MCP clients like Claude Desktop, Cursor, Windsurf, OpenAI Agents and others to generate speech, clone voices, generate video, generate image and more.

964

...

Kubernetes

Integrates with Kubernetes clusters to enable direct pod, deployment, and service management operations through specialized FastMCP tools, eliminating the need to switch between AI conversations and command-line interfaces for DevOps workflows.

...

Local File Organizer

Automatically organizes files by categorizing them based on extensions while preserving project structures for efficient directory management and backup preparation.

...

NASA Astronomy Picture of the Day

Integrates with NASA's Astronomy Picture of the Day API to retrieve daily space images and descriptions directly within the Cursor IDE development environment.

...

LetzAI

Integrates with LetzAI to enable image generation and upscaling through natural language commands with customizable parameters like dimensions, quality, and creativity.

...

Bing Search

Enables web, news, and image searches through Microsoft's Bing Search API, providing access to up-to-date information from the internet.

...

ComfyUI

Integrates with ComfyUI to enable natural language-driven image generation using customizable Stable Diffusion workflows

...

Comfy (Stable Diffusion)

Integrates with ComfyUI to enable text-to-image generation using customizable Stable Diffusion workflows.

...

Strapi CMS

Integrates with Strapi CMS to enable creating, reading, updating, and deleting content entries with support for filtering, pagination, sorting, and media uploads through URI-based resource patterns.

...

Speech Interface (Faster Whisper)

Integrates voice interaction capabilities using faster-whisper and PyAudio for speech recognition and synthesis, enabling natural language voice interfaces for AI models.

...

Video Editor (FFMpeg)

Integrates FFmpeg for video editing operations, enabling tasks like trimming, merging, and format conversion through natural language commands.

...

Multichat

Enables parallel communication with multiple unichat-based servers, allowing users to query different language models simultaneously and compare their responses through organized message routing and storage.

...

PancakeSwap PoolSpy

Tracks newly created PancakeSwap liquidity pools in real-time, providing detailed metrics like token pairs, transaction counts, volume, and TVL for DeFi traders and analysts on BNB Smart Chain.

...

Crypto Sentiment (Santiment)

Delivers cryptocurrency sentiment analysis by leveraging Santiment's social media and news data, enabling traders to retrieve sentiment metrics, monitor mentions, detect volume shifts, identify trending topics, and measure asset dominance in real-time.

...

CryptoPanic

Integrates with CryptoPanic to provide real-time cryptocurrency news, analysis, and video content with configurable pagination for financial analysis and investment decision support.

...

Image Dimensions

Retrieves image dimensions from URLs and local files with optional compression capabilities through the Tinify API for image analysis and processing tasks.

...

Sound Notification

Plays customizable audio notifications for key interaction points in development environments, alerting users when AI assistance requires attention or completes tasks.

...

MarkItDown

Converts diverse file formats to Markdown using MarkItDown utility, enabling unified text-based workflows for content migration, documentation, and analysis.

...

Miro

Integrates with Miro's collaborative whiteboard platform, providing over 80 tools for managing boards, creating and manipulating various item types, and handling enterprise features for visual collaboration workflows.

...

Image Toolkit

Provides image manipulation capabilities through Gemini models and third-party APIs for generating images from text, modifying existing images, and removing backgrounds with automatic FreeImage.host uploading.

...

YouTube Downloader

Integrates with YouTube using yt-dlp to enable downloading of videos and subtitles for content analysis and processing tasks.

...

Bluesky

Query and analyze data from the decentralized social network.

...

AivisSpeech

Enables AI systems to generate and play speech audio from text input through the AivisSpeech API, with configurable speaker settings for voice output applications.

...

Playwright-Lighthouse

Combines Playwright's browser automation with Lighthouse's auditing capabilities to analyze website performance, generate detailed reports, and capture screenshots for web development optimization.

...

KaiaFun

Enables interaction with the KaiaFun memecoin platform for listing, buying, selling, and managing tokens on the Kaia blockchain.

...

X (Twitter)

Integrates with X using real browser APIs to bypass rate limits and enable extensive social media operations and data analysis.

...

ComfyUI

Integrates ComfyUI's stable diffusion interface to enable programmatic image generation through customizable node-based workflows.

...

Containerd

Enables container management through natural language commands by bridging Containerd's Container Runtime Interface for listing, creating, and removing containers and pods without complex CLI syntax.

...

ComfyUI

Integrates ComfyUI with WebSocket communication for on-demand image generation, enabling customizable requests with parameters like prompt, width, and height.

...

Jina AI

Integrates with Jina AI's web services to enable web content extraction, search, and fact-checking through natural language interactions.

...

Flux Studio

Bridges Flux's image generation capabilities to coding environments, enabling text-to-image, image-to-image, inpainting, and structural control operations directly within IDEs through TypeScript-to-CLI command translation.

...

Media Automation Hub (YARR)

Integrates with popular media automation services including Sonarr, Radarr, and Plex to provide a unified interface for searching, monitoring, and managing media collections without switching between multiple web interfaces.

...

Overseerr

Integrates with Overseerr to enable media searching, retrieval, request management, and personalized recommendations for Plex libraries.

...

Pokemon TCG Card Search

Enables searching and displaying Pokemon Trading Card Game cards with rich filtering capabilities for name, type, legality, and more through the Pokemon TCG API.

...

YouTube Transcripts

Integrates with YouTube's transcript API to retrieve and process captions from video URLs, enabling content analysis and information extraction from spoken video content.

...

Florence-2

Integrates with Florence-2 to enable advanced image analysis and manipulation tasks like visual question answering, image captioning, and content-based image retrieval.

...

Jina AI

Integrates with Jina's AI services to provide efficient handling of language and multimodal AI requests for applications requiring natural language processing and image analysis.

...

OpenSCAD 3D Model Generator

Transforms natural language descriptions into parametric 3D models through a pipeline of image generation, object segmentation, 3D modeling, and OpenSCAD code conversion for customizable 3D printing.

...

DALL-E

Integrates with OpenAI's DALL-E API to enable image generation with fine-grained control over parameters including model selection, size, quality, and style through a command-line interface.

...

Goose Extensions

Extends Goose AI assistant with five specialized servers for Plex Media Server interaction, Rotten Tomatoes scraping, eBay sales data retrieval, SearxNG web searches, and Taskwarrior task management.

...

Zoom

Enables AI to create and manage Zoom meetings with customizable settings through server-to-server OAuth authentication with the Zoom API.

...

Unsplash

Integrates with Unsplash's photo library to enable image search and retrieval with customizable parameters including search terms, pagination, ordering, color filtering, and orientation preferences.

...

Vibe Worldbuilding

Guides users through systematic worldbuilding with structured prompts and Google Imagen integration for generating visual representations of fictional universe elements

...

Draw Things

Integrates with the Draw Things API to convert text prompts or JSON inputs into JSON-RPC requests, enabling AI image generation capabilities with automatic saving and error handling.

...

Image Generation (Cloudflare)

Integrates with Cloudflare's AI image generation capabilities, leveraging Workers for serverless deployment to enable on-demand image creation for content generation and visual design tasks.

...

VseGPT Image Generator

Integrates with VseGPT API to generate images from English-language prompts and store them locally with timestamp-based filenames.

...