MCP-PyAutoGUI-Server provides automated GUI testing and control capabilities through a Python-based interface. Developed by He Tao, this server wraps the PyAutoGUI library to enable AI assistants to control mouse movements, simulate keyboard input, take screenshots, and find images on screen across Windows, macOS, and Linux. The implementation offers tools for precise cursor positioning, clicking, typing text, pressing hotkeys, and screen analysis through a standardized protocol. It's particularly useful for automating repetitive GUI tasks, creating test scripts, or allowing AI systems to interact directly with desktop applications through visual interfaces.
まだレビューはありません. 最初のレビューを投稿しましょう!
会話に参加するにはサインインしてください
Move mouse to specific coordinates. Parameters: x (int), y (int)
Click at current or specified position. Parameters: x (optional int), y (optional int)
Perform drag and drop operations. Parameters: start_x (int), start_y (int), end_x (int), end_y (int)
Get current mouse position. Returns current coordinates as (x, y)
Type text. Parameters: text (string)
Press individual keys. Parameters: key (string)
Perform hotkey combinations. Parameters: keys (list of strings)
Take a screenshot. Returns an image object or file path
Get screen size. Returns dimensions as (width, height)
Find image locations on screen. Parameters: image_path (string), confidence (optional float)
Get pixel colors. Parameters: x (int), y (int) - Returns the color of the pixel at the specified coordinates