MCP-PyAutoGUI-Server provides automated GUI testing and control capabilities through a Python-based interface. Developed by He Tao, this server wraps the PyAutoGUI library to enable AI assistants to control mouse movements, simulate keyboard input, take screenshots, and find images on screen across Windows, macOS, and Linux. The implementation offers tools for precise cursor positioning, clicking, typing text, pressing hotkeys, and screen analysis through a standardized protocol. It's particularly useful for automating repetitive GUI tasks, creating test scripts, or allowing AI systems to interact directly with desktop applications through visual interfaces.
Move mouse to specific coordinates.
Click at current or specified position.
Perform drag and drop operations.
Retrieve the current position of the mouse.
Simulate typing of text.
Simulate pressing individual keys.
Simulate pressing hotkey combinations.
Capture a screenshot of the screen.
Retrieve the size of the screen.
Locate image locations on the screen.
Retrieve the color of a specific pixel.
No reviews yet. Be the first to review!
Sign in to join the conversation