Phone MCP is a Python-based server that enables AI assistants to control Android phones through ADB commands. Developed by hao, it provides tools for making calls, sending text messages, taking screenshots, recording screens, managing contacts, launching apps, and retrieving system information. The implementation includes specialized modules for different phone functionalities and supports map-related features when an API key is configured. It's particularly useful for scenarios requiring remote phone control, automated testing, or accessibility assistance, allowing AI agents to interact with mobile devices without direct physical manipulation.
No reviews yet. Be the first to review!
Sign in to join the conversation
Make a call to the specified phone number. Parameters: phone number (string)
End the current active call.
Send an SMS to the specified phone number with the given message. Parameters: phone number (string), message (string)
Get received messages with pagination options. Parameters: --limit (integer)
Get sent messages with pagination options. Parameters: --limit (integer)
Get phone contacts with pagination options. Parameters: --limit (integer)
Create a new contact with the specified name and phone number using UI automation. Parameters: name (string), phone (string)
Take a screenshot of the current screen.
Record the screen for the specified duration. Parameters: --duration (integer)
Launch the specified app by its name. Parameters: app name (string)
Launch the specified app using an alternative method. Parameters: app name (string)
Close the specified app by its package name. Parameters: package name (string)
List installed apps with options for pagination and detail. Parameters: --page (integer), --page-size (integer), --detailed (boolean)
Launch a specific activity of an app using its package and activity name. Parameters: package name (string), activity name (string)
Open the specified URL in the device's default browser. Parameters: URL (string)
Analyze the current screen and provide structured information about UI elements.
Execute various screen interaction actions. Parameters: action (string), params (dictionary)
Search for nearby points of interest (POIs) based on geographical coordinates and keywords. Parameters: coordinates (string), keywords (string), radius (integer)
Monitor the user interface for changes with specified options. Parameters: --interval (float), --duration (integer), --watch-for (string), --text (string), --id (string), --class-name (string), --raw (boolean)