Media

Model Context Protocol servers for processing images, videos, and multimedia content with AI

ElevenLabs - AI server banner image
ElevenLabs - MCP server logo (Official)

ElevenLabs

Integrates with ElevenLabs to provide high-quality text-to-speech, voice cloning, and conversational capabilities with customizable voice profiles and audio processing features.

5.0
690
...
YouTube Transcript Extractor - Productivity server banner image
YouTube Transcript Extractor - MCP server logo (Official)

YouTube Transcript Extractor

Extracts transcripts from YouTube videos using various URL formats, enabling automated content analysis and summarization.

5.0
1
...
MiniMax-MCP - Developer Tools server banner image
MiniMax-MCP - MCP server logo (Official)

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video/image generation APIs. This server allows MCP clients like Claude Desktop, Cursor, Windsurf, OpenAI Agents and others to generate speech, clone voices, generate video, generate image and more.

964
...
Kubernetes - MCP server logo

Kubernetes

Integrates with Kubernetes clusters to enable direct pod, deployment, and service management operations through specialized FastMCP tools, eliminating the need to switch between AI conversations and command-line interfaces for DevOps workflows.

1
...
Local File Organizer - MCP server logo

Local File Organizer

Automatically organizes files by categorizing them based on extensions while preserving project structures for efficient directory management and backup preparation.

0
...
NASA Astronomy Picture of the Day - MCP server logo

NASA Astronomy Picture of the Day

Integrates with NASA's Astronomy Picture of the Day API to retrieve daily space images and descriptions directly within the Cursor IDE development environment.

0
...
LetzAI - MCP server logo

LetzAI

Integrates with LetzAI to enable image generation and upscaling through natural language commands with customizable parameters like dimensions, quality, and creativity.

1
...
Bing Search - MCP server logo

Bing Search

Enables web, news, and image searches through Microsoft's Bing Search API, providing access to up-to-date information from the internet.

25
...
ComfyUI - MCP server logo

ComfyUI

Integrates with ComfyUI to enable natural language-driven image generation using customizable Stable Diffusion workflows

9
...
Comfy (Stable Diffusion) - MCP server logo

Comfy (Stable Diffusion)

Integrates with ComfyUI to enable text-to-image generation using customizable Stable Diffusion workflows.

31
...
Strapi CMS - MCP server logo

Strapi CMS

Integrates with Strapi CMS to enable creating, reading, updating, and deleting content entries with support for filtering, pagination, sorting, and media uploads through URI-based resource patterns.

20
...
Speech Interface (Faster Whisper) - MCP server logo

Speech Interface (Faster Whisper)

Integrates voice interaction capabilities using faster-whisper and PyAudio for speech recognition and synthesis, enabling natural language voice interfaces for AI models.

31
...
Video Editor (FFMpeg) - MCP server logo

Video Editor (FFMpeg)

Integrates FFmpeg for video editing operations, enabling tasks like trimming, merging, and format conversion through natural language commands.

21
...
Multichat - MCP server logo

Multichat

Enables parallel communication with multiple unichat-based servers, allowing users to query different language models simultaneously and compare their responses through organized message routing and storage.

0
...
PancakeSwap PoolSpy - MCP server logo

PancakeSwap PoolSpy

Tracks newly created PancakeSwap liquidity pools in real-time, providing detailed metrics like token pairs, transaction counts, volume, and TVL for DeFi traders and analysts on BNB Smart Chain.

5
...
Crypto Sentiment (Santiment) - MCP server logo

Crypto Sentiment (Santiment)

Delivers cryptocurrency sentiment analysis by leveraging Santiment's social media and news data, enabling traders to retrieve sentiment metrics, monitor mentions, detect volume shifts, identify trending topics, and measure asset dominance in real-time.

6
...
CryptoPanic - MCP server logo

CryptoPanic

Integrates with CryptoPanic to provide real-time cryptocurrency news, analysis, and video content with configurable pagination for financial analysis and investment decision support.

54
...
Image Dimensions - MCP server logo

Image Dimensions

Retrieves image dimensions from URLs and local files with optional compression capabilities through the Tinify API for image analysis and processing tasks.

2
...
Sound Notification - MCP server logo

Sound Notification

Plays customizable audio notifications for key interaction points in development environments, alerting users when AI assistance requires attention or completes tasks.

1
...
MarkItDown - MCP server logo

MarkItDown

Converts diverse file formats to Markdown using MarkItDown utility, enabling unified text-based workflows for content migration, documentation, and analysis.

13
...
Miro - MCP server logo

Miro

Integrates with Miro's collaborative whiteboard platform, providing over 80 tools for managing boards, creating and manipulating various item types, and handling enterprise features for visual collaboration workflows.

22
...
Image Toolkit - MCP server logo

Image Toolkit

Provides image manipulation capabilities through Gemini models and third-party APIs for generating images from text, modifying existing images, and removing backgrounds with automatic FreeImage.host uploading.

14
...
YouTube Downloader - MCP server logo

YouTube Downloader

Integrates with YouTube using yt-dlp to enable downloading of videos and subtitles for content analysis and processing tasks.

84
...
Bluesky - MCP server logo

Bluesky

Query and analyze data from the decentralized social network.

19
...
AivisSpeech - MCP server logo

AivisSpeech

Enables AI systems to generate and play speech audio from text input through the AivisSpeech API, with configurable speaker settings for voice output applications.

0
...
Playwright-Lighthouse - MCP server logo

Playwright-Lighthouse

Combines Playwright's browser automation with Lighthouse's auditing capabilities to analyze website performance, generate detailed reports, and capture screenshots for web development optimization.

1
...
KaiaFun - MCP server logo

KaiaFun

Enables interaction with the KaiaFun memecoin platform for listing, buying, selling, and managing tokens on the Kaia blockchain.

1
...
X (Twitter) - MCP server logo

X (Twitter)

Integrates with X using real browser APIs to bypass rate limits and enable extensive social media operations and data analysis.

5
...
ComfyUI - MCP server logo

ComfyUI

Integrates ComfyUI's stable diffusion interface to enable programmatic image generation through customizable node-based workflows.

3
...
Containerd - MCP server logo

Containerd

Enables container management through natural language commands by bridging Containerd's Container Runtime Interface for listing, creating, and removing containers and pods without complex CLI syntax.

50
...
ComfyUI - MCP server logo

ComfyUI

Integrates ComfyUI with WebSocket communication for on-demand image generation, enabling customizable requests with parameters like prompt, width, and height.

32
...
Jina AI - MCP server logo

Jina AI

Integrates with Jina AI's web services to enable web content extraction, search, and fact-checking through natural language interactions.

17
...
Flux Studio - MCP server logo

Flux Studio

Bridges Flux's image generation capabilities to coding environments, enabling text-to-image, image-to-image, inpainting, and structural control operations directly within IDEs through TypeScript-to-CLI command translation.

20
...
Media Automation Hub (YARR) - MCP server logo

Media Automation Hub (YARR)

Integrates with popular media automation services including Sonarr, Radarr, and Plex to provide a unified interface for searching, monitoring, and managing media collections without switching between multiple web interfaces.

2
...
Overseerr - MCP server logo

Overseerr

Integrates with Overseerr to enable media searching, retrieval, request management, and personalized recommendations for Plex libraries.

0
...
Pokemon TCG Card Search - MCP server logo

Pokemon TCG Card Search

Enables searching and displaying Pokemon Trading Card Game cards with rich filtering capabilities for name, type, legality, and more through the Pokemon TCG API.

8
...
YouTube Transcripts - MCP server logo

YouTube Transcripts

Integrates with YouTube's transcript API to retrieve and process captions from video URLs, enabling content analysis and information extraction from spoken video content.

19
...
Florence-2 - MCP server logo

Florence-2

Integrates with Florence-2 to enable advanced image analysis and manipulation tasks like visual question answering, image captioning, and content-based image retrieval.

4
...
Jina AI - MCP server logo

Jina AI

Integrates with Jina's AI services to provide efficient handling of language and multimodal AI requests for applications requiring natural language processing and image analysis.

3
...
OpenSCAD 3D Model Generator - MCP server logo

OpenSCAD 3D Model Generator

Transforms natural language descriptions into parametric 3D models through a pipeline of image generation, object segmentation, 3D modeling, and OpenSCAD code conversion for customizable 3D printing.

19
...
DALL-E - MCP server logo

DALL-E

Integrates with OpenAI's DALL-E API to enable image generation with fine-grained control over parameters including model selection, size, quality, and style through a command-line interface.

0
...
Goose Extensions - MCP server logo

Goose Extensions

Extends Goose AI assistant with five specialized servers for Plex Media Server interaction, Rotten Tomatoes scraping, eBay sales data retrieval, SearxNG web searches, and Taskwarrior task management.

5
...
Zoom - MCP server logo

Zoom

Enables AI to create and manage Zoom meetings with customizable settings through server-to-server OAuth authentication with the Zoom API.

7
...
Unsplash - MCP server logo

Unsplash

Integrates with Unsplash's photo library to enable image search and retrieval with customizable parameters including search terms, pagination, ordering, color filtering, and orientation preferences.

7
...
Vibe Worldbuilding - MCP server logo

Vibe Worldbuilding

Guides users through systematic worldbuilding with structured prompts and Google Imagen integration for generating visual representations of fictional universe elements

7
...
Draw Things - MCP server logo

Draw Things

Integrates with the Draw Things API to convert text prompts or JSON inputs into JSON-RPC requests, enabling AI image generation capabilities with automatic saving and error handling.

8
...
Image Generation (Cloudflare) - MCP server logo

Image Generation (Cloudflare)

Integrates with Cloudflare's AI image generation capabilities, leveraging Workers for serverless deployment to enable on-demand image creation for content generation and visual design tasks.

2
...
VseGPT Image Generator - MCP server logo

VseGPT Image Generator

Integrates with VseGPT API to generate images from English-language prompts and store them locally with timestamp-based filenames.

3
...