MCP-PyAutoGUI-Server provides automated GUI testing and control capabilities through a Python-based interface. Developed by He Tao, this server wraps the PyAutoGUI library to enable AI assistants to control mouse movements, simulate keyboard input, take screenshots, and find images on screen across Windows, macOS, and Linux. The implementation offers tools for precise cursor positioning, clicking, typing text, pressing hotkeys, and screen analysis through a standardized protocol. It's particularly useful for automating repetitive GUI tasks, creating test scripts, or allowing AI systems to interact directly with desktop applications through visual interfaces.
Move mouse to specific coordinates.
Click at the current or specified position.
Perform drag and drop operations.
Retrieve the current position of the mouse.
Simulate typing of specified text.
Simulate pressing an individual key.
Simulate pressing a combination of keys as a hotkey.
Capture a screenshot of the current screen.
Retrieve the size of the screen.
Locate specified image locations on the screen.
Retrieve the color of a specific pixel on the screen.
No reviews yet. Be the first to review!
Sign in to join the conversation