BrowserCat MCP Server provides cloud-based browser automation capabilities through the Model Context Protocol, enabling LLMs to interact with web pages without local browser installation. Developed by BrowserCat, it offers tools for navigation, screenshot capture, element interaction (clicking, hovering, form filling), and JavaScript execution in a real browser environment. The server connects to BrowserCat's cloud browser service via WebSocket, authenticates with an API key, and exposes both console logs and screenshots as resources. This implementation is particularly valuable for AI assistants that need to perform web-based tasks like data extraction, form submission, or visual analysis of websites.
Navigate to any URL in the browser. Input: `url` (string)
Capture screenshots of the entire page or specific elements. Inputs: `name` (string, required), `selector` (string, optional), `width` (number, optional, default: 800), `height` (number, optional, default: 600)
Click elements on the page. Input: `selector` (string): CSS selector for element to click
Hover elements on the page. Input: `selector` (string): CSS selector for element to hover
Fill out input fields. Inputs: `selector` (string): CSS selector for input field, `value` (string): Value to fill
Select an option from a dropdown menu. Inputs: `selector` (string): CSS selector for select element, `value` (string): Value to select
Execute JavaScript in the browser console. Input: `script` (string): JavaScript code to execute
No reviews yet. Be the first to review!
Sign in to join the conversation
Our bundler currently only supports TypeScript-based servers. Check back soon!