PyAutoGUI
Summary
MCP-PyAutoGUI-Server provides automated GUI testing and control capabilities through a Python-based interface. Developed by He Tao, this server wraps the PyAutoGUI library to enable AI assistants to control mouse movements, simulate keyboard input, take screenshots, and find images on screen across Windows, macOS, and Linux. The implementation offers tools for precise cursor positioning, clicking, typing text, pressing hotkeys, and screen analysis through a standardized protocol. It's particularly useful for automating repetitive GUI tasks, creating test scripts, or allowing AI systems to interact directly with desktop applications through visual interfaces.
Available Actions(11)
Move mouse
Move mouse to specific coordinates. Parameters: x (int), y (int)
Click mouse
Click at current or specified position. Parameters: position (optional tuple of int, int)
Drag and drop
Perform drag and drop operations. Parameters: start_position (tuple of int, int), end_position (tuple of int, int)
Get current mouse position
Retrieve the current position of the mouse cursor.
Type text
Simulate keyboard input to type text. Parameters: text (string)
Press key
Press individual keys on the keyboard. Parameters: key (string)
Hotkey combinations
Simulate pressing hotkey combinations. Parameters: hotkeys (list of strings)
Take screenshot
Capture a screenshot of the current screen.
Get screen size
Retrieve the current screen size.
Find image on screen
Locate image positions on the screen. Parameters: image_path (string)
Get pixel color
Retrieve the color of a pixel at specified coordinates. Parameters: x (int), y (int)
Reseñas de la Comunidad
Aún no hay reseñas. ¡Sé el primero en reseñar!
Inicia sesión para unirte a la conversación