AI-powered iOS app testing framework

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

MobileTestAI

AI-powered iOS app testing framework. Describe what you want tested in plain English, and an LLM agent autonomously navigates your app on iOS simulators.

Why MobileTestAI?

There are tools that let AI agents tap buttons on a phone. MobileTestAI is the only one that coordinates multiple AI agents across multiple simulators simultaneously.

Testing a multiplayer game? A chat app? A collaborative tool? You need two (or more) devices interacting with each other — Player 1 creates a room, Player 2 joins with the room code, both verify they're in the lobby. No other open-source tool does this.

What makes it different

	MobileTestAI	Appium	XCUITest	Arbigent
Natural language goals	Yes	No	No	Yes
Multi-device orchestration	Yes	Manual	No	No
Cross-device variable passing	Yes	Manual	No	No
AI-driven navigation	Yes	No	No	Yes
iOS support	Yes	Yes	Yes	Yes
No test code required	Yes	No	No	Yes
Local LLM support	Yes	N/A	N/A	No

Use cases

Multiplayer apps — test game lobbies, matchmaking, real-time interactions across devices
Chat/messaging — verify message delivery between users on separate simulators
Collaborative tools — test shared documents, whiteboards, or workspaces
Single-device testing — works great for standard UI testing too
Regression testing — define YAML scenarios and rerun them after every build

How It Works

Each step of the agent loop:

Observe — captures the UI accessibility tree (and optionally a screenshot) from the simulator
Reason — sends the UI state to an LLM with the goal and action history
Act — executes the chosen action (tap, swipe, type, etc.) on the simulator
Repeat — continues until the agent reports "done", "fail", or hits the step limit

The agent uses structured text from the accessibility tree rather than vision alone — this is faster, cheaper, and more reliable than screenshot-only approaches.

Prerequisites

macOS with Xcode 26+ and iOS simulators
Python 3.11+ and uv
At least one device backend (see below)
At least one LLM provider (see below)

Installation

# Clone the repo
git clone https://github.com/TomMcGrath7/iOSTestAgents.git
cd iOSTestAgents

# Install Python dependencies
uv sync --extra dev

# Install XcodeBuildMCP (recommended device backend)
npm install -g xcodebuildmcp@latest

# Verify everything is set up
uv run iostestagents doctor

Setting Up an LLM Provider

MobileTestAI needs an LLM to read the UI and decide actions. It supports three providers and auto-detects which one to use based on available API keys.

Option 1: Ollama (local, free)

Run models locally on your Mac. No API key, no cost, no data leaves your machine.

# Install Ollama: https://ollama.com
brew install ollama
ollama serve

# Pull a model
ollama pull qwen3:8b

MobileTestAI auto-detects Ollama when it's running on localhost:11434.

Note on local models: Local models work but performance scales with model capability. Smaller models (7-8B parameters) can handle simple navigation tasks like "go to Settings > General". For complex multi-step flows, onboarding sequences, or apps with non-obvious UI patterns, cloud models (GPT-5.4, Claude) perform significantly better. We recommend starting with Ollama to try things out, then switching to a cloud provider for production use.

Option 2: OpenAI

Best balance of speed and capability. GPT-5.4 is the recommended model (current generation at the old gpt-4o price point); use gpt-5.5 for the premium flagship.

export OPENAI_API_KEY="sk-..."

Get your API key at platform.openai.com/api-keys.

Option 3: Anthropic (Claude)

The default model is claude-opus-4-8. For cheaper per-step costs on simple navigation tasks, use --model claude-haiku-4-5 ($1/$5 per 1M tokens vs $5/$25 for Opus).

export ANTHROPIC_API_KEY="sk-ant-..."

Get your API key at console.anthropic.com.

Choosing a provider

Provider	Cost	Speed	Quality	Privacy
Ollama (qwen3:8b)	Free	~15-20s/step	Good for simple tasks	Full — runs locally
OpenAI (gpt-5.4)	~$0.01-0.03/step	~3-5s/step	Excellent	Data sent to OpenAI
Anthropic (claude-opus-4-8)	~$0.01-0.05/step	~3-5s/step	Excellent	Data sent to Anthropic
Anthropic (claude-haiku-4-5)	~$0.005-0.01/step	~2-4s/step	Good	Data sent to Anthropic

You can override the auto-detected provider with --provider and --model:

uv run iostestagents run --provider ollama --model qwen3:8b ...
uv run iostestagents run --provider openai --model gpt-5.4 ...
uv run iostestagents run --provider anthropic --model claude-haiku-4-5 ...  # cheap per-step option

Quick Start

Run a single-device test:

uv run iostestagents run \
  --device "iPhone 17" \
  --app com.apple.Preferences \
  --goal "Navigate to General > About"

Run with a specific provider and screen recording:

uv run iostestagents run \
  --device "iPhone 17" \
  --app com.apple.mobilesafari \
  --goal "Open Safari, tap the address bar, type apple.com, and verify the page loads" \
  --provider openai --model gpt-5.4 \
  --record

Run a multi-device scenario:

uv run iostestagents scenario scenarios/multiplayer_example.yaml

Multi-Device Scenarios

This is MobileTestAI's unique capability. Scenarios are YAML files that coordinate multiple agents across multiple simulators:

name: multiplayer_join_test
app_bundle_id: com.example.myapp
app_path: "/path/to/MyApp.app"
players: 2
device: "iPhone 17"
backend: xcodebuildmcp
provider: openai
model: gpt-5.4
max_steps: 30

steps:
  - player: 1
    action: "Create a new game room"
    capture: room_code

  - player: 2
    action: "Join game using room code {room_code}"

  - all_players:
    action: "Verify both players are in the lobby"
    parallel: true

Orchestration Features

Cross-device variable passing — capture extracts a value from the screen (like a room code), {var} injects it into another player's step
Barriers — all_players steps block until all devices reach that point
Sequential or parallel — steps run one at a time by default, or parallel: true for simultaneous actions
Failure handling — on_failure: continue to keep going when a step fails

See the scenarios/ directory for more examples.

CLI Reference

iostestagents run
  --device, -d    Simulator device name (required)
  --app, -a       App bundle identifier (required)
  --goal, -g      Natural language test goal (required)
  --max-steps     Maximum agent steps (default: 20)
  --backend, -b   Device backend: xcodebuildmcp or testbridge (default: testbridge)
  --provider      LLM provider: openai, anthropic, ollama (auto-detect if omitted)
  --model         LLM model to use (default per provider)
  --output, -o    Output directory (default: output)
  --verbose, -v   Verbose logging
  --step-delay    Delay between steps in seconds (default: 1.5)
  --record        Record screen video
  --no-reset      Skip app reset before run
  --app-path      Path to .app bundle for install
  --no-vision     Skip screenshots, use accessibility tree only (faster)

iostestagents scenario <path>
  Run a multi-device YAML scenario file

iostestagents doctor
  Check environment setup and backend availability

Writing Good Goals

Goals should be specific and include any gates (onboarding, login) the agent needs to get through:

# Good — specific with clear completion criteria
"Tap Get Started, complete onboarding by tapping Continue on each screen, then navigate to Settings"

# Bad — no clear completion criteria
"Explore the app"

# Bad — assumes the agent can skip onboarding
"Go to Settings"

Think of goals as instructions for someone who has never seen the app before.

Device Backends

Backend	How it works	Multi-device	Setup
XcodeBuildMCP (recommended)	Stateless CLI calls	Yes	`npm install -g xcodebuildmcp@latest`
TestBridge	HTTP server via XCUITest	Single device only	Just Xcode (included in repo)

TestBridge is a custom XCUITest HTTP bridge included in the testbridge/ directory. It starts an HTTP server on localhost:8615 giving the Python agent full control of a running simulator. No external dependencies — just Xcode.

XcodeBuildMCP is a third-party CLI by Sentry with 59+ tools for Xcode automation, including real device support.

Architecture

src/iostestagents/
├── cli.py                    # Typer CLI
├── agent/
│   ├── loop.py               # Core observe → reason → act loop
│   ├── prompts.py            # LLM prompt templates
│   ├── models.py             # Pydantic action/result models
│   └── ui_parser.py          # Accessibility tree parser
├── device/
│   ├── base.py               # DeviceBackend protocol
│   ├── xcodebuildmcp.py      # XcodeBuildMCP backend
│   ├── bridge.py             # TestBridge backend
│   ├── simulator.py          # Simulator lifecycle (simctl)
│   └── idb.py                # Legacy idb fallback
├── llm/
│   ├── base.py               # LLM provider protocol
│   ├── openai.py             # OpenAI provider
│   ├── anthropic.py          # Anthropic provider
│   ├── ollama.py             # Ollama (local) provider
│   └── registry.py           # Provider auto-detection
├── orchestrator/             # Multi-device coordination
│   ├── coordinator.py
│   ├── scenario.py
│   └── sync.py
└── util/
    └── logging.py

Development

# Run tests
uv run pytest

# Run tests with verbose output
uv run pytest -v

# Check environment
uv run iostestagents doctor

Limitations

macOS only (requires Xcode and iOS simulators)
Each simulator uses ~2GB RAM — a MacBook can handle 3-4 concurrent simulators, a Mac Studio 8+
LLM agents are non-deterministic — the same goal may produce different action sequences across runs
System dialogs (Mail compose, Share Sheet) are generally not automatable
SwiftUI views may need .accessibilityIdentifier() for reliable element targeting
describe_ui can take 1-3 seconds on complex view hierarchies
Local models (7-8B) struggle with complex navigation — use cloud models for production

Related Projects

XcodeBuildMCP — MCP server and CLI for Xcode automation (used as a backend)
DroidRun — Similar concept for Android
Arbigent — AI agent for testing Android, iOS, and Web apps
Xcode MCP Bridge — Apple's native MCP for coding workflows (different niche — builds/tests, not UI automation)

License

Apache License 2.0 — see LICENSE for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

TomMcGrath

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.0

Jun 12, 2026

0.1.0

Mar 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iostestagents-0.2.0.tar.gz (140.3 kB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

iostestagents-0.2.0-py3-none-any.whl (49.6 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file iostestagents-0.2.0.tar.gz.

File metadata

Download URL: iostestagents-0.2.0.tar.gz
Upload date: Jun 12, 2026
Size: 140.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for iostestagents-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`75b9ddd9c9e4319d9c1cab8e792ef00a3512eefe4f714348b7eab39e12d0fb3b`
MD5	`bd17b98ee1d4b625599f113e1aa4ad5f`
BLAKE2b-256	`2379302d73d96cd6e2407e7ba93076127ce2dd3de3c6741acaa1d60f3698d915`

See more details on using hashes here.

Provenance

The following attestation bundles were made for iostestagents-0.2.0.tar.gz:

Publisher: publish.yml on TomMcGrath7/iOSTestAgents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: iostestagents-0.2.0.tar.gz
- Subject digest: 75b9ddd9c9e4319d9c1cab8e792ef00a3512eefe4f714348b7eab39e12d0fb3b
- Sigstore transparency entry: 1805348316
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: TomMcGrath7/iOSTestAgents@c4454a549b15974bcf5f0a4b30c021cd220d33f9
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/TomMcGrath7
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@c4454a549b15974bcf5f0a4b30c021cd220d33f9
- Trigger Event: release

File details

Details for the file iostestagents-0.2.0-py3-none-any.whl.

File metadata

Download URL: iostestagents-0.2.0-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 49.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for iostestagents-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`51a3263a793a42c0ad19b8187de712c71976b30fed7e3b93501a03dc5c655446`
MD5	`767652e7ad24c0900fdb72659357a4c0`
BLAKE2b-256	`c0a55905d9a3c77571bd8beb140b247eb762421a720c25987c5a9d5ae5d3f3c6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for iostestagents-0.2.0-py3-none-any.whl:

Publisher: publish.yml on TomMcGrath7/iOSTestAgents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: iostestagents-0.2.0-py3-none-any.whl
- Subject digest: 51a3263a793a42c0ad19b8187de712c71976b30fed7e3b93501a03dc5c655446
- Sigstore transparency entry: 1805348427
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: TomMcGrath7/iOSTestAgents@c4454a549b15974bcf5f0a4b30c021cd220d33f9
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/TomMcGrath7
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@c4454a549b15974bcf5f0a4b30c021cd220d33f9
- Trigger Event: release

iostestagents 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

MobileTestAI

Why MobileTestAI?

What makes it different

Use cases

How It Works

Prerequisites

Installation

Setting Up an LLM Provider

Option 1: Ollama (local, free)

Option 2: OpenAI

Option 3: Anthropic (Claude)

Choosing a provider

Quick Start

Multi-Device Scenarios

Orchestration Features

CLI Reference

Writing Good Goals

Device Backends

Architecture

Development

Limitations

Related Projects

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance