feat: add push_notification, set_permission, and get_clipboard tools#69
feat: add push_notification, set_permission, and get_clipboard tools#69saen-ai wants to merge 12 commits into
Conversation
…dep CVEs - record_video: was hardcoded to "booted", ignoring the udid param — now uses getBootedDeviceId() consistently with all other tools; also adds udid to the tool schema so callers can target a specific simulator - ui_view: JSON.parse on idb output had no error handling — server would crash on malformed output; wrapped in try/catch with a clear error message; also validates frame dimensions are positive numbers before use - ui_view: temp PNG/JPEG files now deleted immediately after reading instead of accumulating until server exit; file names include a random suffix to prevent collisions on rapid successive calls - record_video: improved start detection — now rejects properly if the process exits early, increased timeout from 3s to 5s, tracks resolved state to avoid double-settling the promise - deps: updated @modelcontextprotocol/sdk to latest (fixes CVE ReDoS, cross-client data leak, DNS rebinding — all high severity); ran npm audit fix for 6 additional moderate/low vulns in ajv, body-parser, minimatch, path-to-regexp, qs, diff Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
idb ui text only supports ASCII keycodes and throws 'No keycode found' for any emoji or non-ASCII character. This adds a new ui_paste tool that works around the limitation using the macOS pasteboard: 1. Copies text to the Mac clipboard via pbcopy 2. Syncs it to the simulator pasteboard via xcrun simctl pbsync 3. Long-presses at the given coordinates to trigger the paste menu 4. Finds the Paste button in the accessibility tree and taps it This enables typing emoji, Arabic, Chinese, and any Unicode text into simulator inputs — essential for testing apps with international users or emoji-heavy content. ui_type is unchanged and remains the right tool for ASCII text.
1.5s was triggering iOS system gestures (app switcher / home screen), dismissing the app before the paste menu appeared. 0.8s is long enough to trigger the contextual paste menu without conflicting with system gestures.
idb ui tap requires integer x/y values — passing floats like 55.166... causes 'invalid int value' error. Round the calculated center coordinates.
terminate_app: kills a running app by bundle ID without having to relaunch it — useful for testing cold-start flows and crash recovery open_url: opens any URL or deep link in the simulator — essential for testing universal links, custom URL schemes, and OAuth redirect flows list_apps: lists all installed apps with their bundle IDs and display names, sorted alphabetically — removes the need to manually look up bundle IDs before calling launch_app or terminate_app
- max_size: resizes screenshot proportionally using sips when the image exceeds the given pixel dimension (width or height). Solves the Claude 2000px API limit issue (joshuayoes#42). - force: prevents silent overwrites by erroring when the output file already exists. Defaults to false (joshuayoes#19). Fixes joshuayoes#42, Fixes joshuayoes#19
Instead of pkill-ing all simctl recordVideo processes, we now store each recording's ChildProcess in a Map keyed by UDID. stop_recording sends SIGINT to that specific PID, leaving any other simulators or concurrent idb operations untouched. Falls back to pkill if no tracked process is found (e.g. server restarted mid-recording) so behaviour is never worse than before. Also adds an optional udid param to stop_recording for multi-simulator setups. Fixes joshuayoes#20
Adds an optional timeout (1–3600 seconds) to record_video. When set, a setTimeout fires after the given duration and sends SIGINT to the tracked recording process — same targeted kill used by stop_recording. If omitted, behaviour is unchanged: recording runs until stop_recording is called. Fixes joshuayoes#5
Adds a rotate_device tool that rotates the iOS Simulator left (counter-clockwise) or right (clockwise) using the Simulator app's built-in keyboard shortcuts via osascript. Supports an optional `times` param (1–3) for multi-increment rotations without multiple tool calls. A 500ms delay between increments lets the simulator animate each step. Contributes to joshuayoes#49
Replaces the generic prompt list with five concrete end-to-end flows that reflect how the tool is actually used: bug reproduction recording, post-implementation feature validation, React Native Redbox debugging, Unicode/emoji text input testing, and rotation-based layout checks. Also adds missing tool entries to the Tools section for ui_paste, rotate_device, terminate_app, open_url, and list_apps. Fixes joshuayoes#40
push_notification: sends a simulated APNs push to any app via xcrun simctl push. Accepts title, body, badge, and optional custom data payload. Writes payload to a temp file (cleaned up after use) and validates the 4096-byte limit before sending. set_permission: grants, revokes, or resets any iOS privacy permission via xcrun simctl privacy. Supports all 13 services (camera, photos, location, microphone, etc). Validates that bundle_id is provided for grant/revoke actions. get_clipboard: reads the simulator clipboard via xcrun simctl pbpaste. Useful for verifying copy behaviour in apps like snippet managers where clipboard correctness is a core feature.
|
Thanks — all three tools are well-scoped wrappers that follow the existing patterns (
Content-wise this is the most straightforward of your open PRs. Once rebased and the nits addressed, I'm ready to land it. |
Summary
Adds three new tools that cover common testing scenarios that were previously impossible with this MCP server.
New Tools
push_notificationSend a simulated APNs push notification to any app on the simulator.
title,body,badge, and optionaluser_infofor custom data payloads (deep link routing etc.)xcrun simctl pushunder the hoodset_permissionGrant, revoke, or reset any iOS privacy permission without touching the UI.
Supports all 13 services:
all,calendar,contacts-limited,contacts,location,location-always,photos-add,photos,media-library,microphone,motion,reminders,siriUses
xcrun simctl privacy.get_clipboardRead what's currently on the simulator clipboard — essential for testing copy behaviour.
Uses
xcrun simctl pbpaste.Test plan
push_notificationwith title + body → notification appears on simulatorpush_notificationwithuser_info→ custom data included in payloadpush_notificationwith payload > 4096 bytes → returns error before sendingset_permissionwithaction: "grant"→ permission granted without system dialogset_permissionwithaction: "revoke"→ permission deniedset_permissionwithaction: "reset"→ app prompts again on next useset_permissionwithaction: "grant"and nobundle_id→ returns validation errorget_clipboardafter copying text in app → returns correct clipboard contentget_clipboardwith empty clipboard → returns "(clipboard is empty)"