MCP-PyAutoGUI-Server provides automated GUI testing and control capabilities through a Python-based interface. Developed by He Tao, this server wraps the PyAutoGUI library to enable AI assistants to control mouse movements, simulate keyboard input, take screenshots, and find images on screen across Windows, macOS, and Linux. The implementation offers tools for precise cursor positioning, clicking, typing text, pressing hotkeys, and screen analysis through a standardized protocol. It's particularly useful for automating repetitive GUI tasks, creating test scripts, or allowing AI systems to interact directly with desktop applications through visual interfaces.
No reviews yet. Be the first to review!
Sign in to join the conversation
Move mouse to specific coordinates. Parameters: x (int), y (int)
Click at current or specified position. Parameters: position (optional tuple of int)
Perform drag and drop operations. Parameters: start_position (tuple of int), end_position (tuple of int)
Get current mouse position.
Simulate typing text. Parameters: text (string)
Press individual keys. Parameters: key (string)
Simulate hotkey combinations. Parameters: keys (list of strings)
Take a screenshot.
Retrieve the current screen size.
Locate images on the screen. Parameters: image_path (string)
Get pixel colors at specific coordinates. Parameters: x (int), y (int)