DroidAI

AI-Powered Android Device Controller
Complete User Guide & Tutorial

Control one phone or an entire farm of devices using natural language. Just tell the AI what to do — it sees the screen, taps, types, and scrolls for you.

Version 1.0 • March 2026

1 Introduction & System Requirements
2 Installation & First-Time Setup
3 Connecting Your First Device
4 Configuring LLM Providers & API Keys
5 Interface Overview
6 Your First Task — Quick Start
7 Core Features Deep Dive
8 Multi-Device & Phone Farm Setup
9 App Cards & Playbooks
10 Mirror Mode
11 Loop & Repeat Mode
12 Stealth Mode
13 Personas, AI Rules & Presets
14 Macros, Workflows & Triggers
15 Screenshot Modes & Vision
16 Telegram Bot Remote Control
17 Settings Reference
18 Troubleshooting & FAQ

1. Introduction & System Requirements

What is DroidAI?

DroidAI is a desktop application that lets you control Android devices using natural language. Instead of manually tapping through apps, you simply describe what you want done — and DroidAI's AI agent takes over. It observes the device screen, decides which actions to take, executes them, and repeats until the task is complete.

DroidAI supports controlling multiple devices simultaneously, making it ideal for phone farms, testing labs, and automation workflows at scale.

How It Works

You type a command → AI reads the device screen (UI tree + screenshot)
↓
AI decides an action (tap, type, scroll, etc.) → Executes via ADB
↓
AI observes the result → Decides next action → Repeats until done

System Requirements

Component	Requirement
Operating System	Windows 10/11 (64-bit)
RAM	4 GB minimum, 8 GB recommended
GPU	OpenGL ES 2.0 compatible (for device mirroring)
Android Devices	Android 9 (API 28)+
USB	USB 2.0+ cable (data-capable, not charge-only)
Internet	Required for LLM API calls
LLM API Key	Anthropic, OpenAI, Gemini, DeepSeek, Grok, or custom

2. Installation & First-Time Setup

Step 1: Install DroidAI

Download the latest installer from droidai.live
Run the installer and follow on-screen instructions
Launch DroidAI from the Start Menu or desktop shortcut
Sign in with your Google account when the login screen appears

Step 2: Install ADB (Android Debug Bridge)

ADB is required for DroidAI to communicate with your Android devices. DroidAI bundles its own ADB, but if you encounter issues:

Download Android SDK Platform Tools from developer.android.com
Extract to a folder (e.g., C:\platform-tools)
Add the folder to your system PATH environment variable
Open a terminal and verify: adb version

Step 3: Enable USB Debugging on Your Phone

Open Settings on your Android device
Go to About Phone (or Software Information)
Tap Build Number 7 times until you see "You are now a developer"
Go back to Settings → Developer Options
Enable USB Debugging
Connect your phone to the PC via USB cable
Tap Allow when the USB debugging authorization prompt appears. Check "Always allow from this computer".

Use a data-capable USB cable. Many cables that come with chargers are charge-only and will not work with ADB.

For Samsung devices, you may also need to enable "USB debugging (Security settings)" in Developer Options.

Step 4: Verify Connection

adb devices

You should see your device serial number followed by device:

List of devices attached
ABCD1234    device

If you see unauthorized, check your phone for the authorization popup.

3. Connecting Your First Device

USB Connection

Connect your phone via USB with debugging enabled
Launch DroidAI — your device should appear automatically in the device grid
You'll see a live screen mirror of your device
The device panel shows the device name and a numbered badge (#1)

WiFi Connection (Wireless ADB)

Connect your phone via USB first
Click the Device button in the top toolbar
Click WiFi Connect
Enter your phone's IP address (find it in Settings → WiFi → tap your network)
Click Connect — you can now unplug the USB cable

WiFi ADB is slower than USB and may cause video lag. USB is recommended for production use, especially with phone farms.

Installing Portal APK

The Portal APK is a companion app that provides:

Faster UI tree reading via accessibility service (vs. slow ADB uiautomator)
Unicode text input via custom IME keyboard (Korean, Chinese, emoji, etc.)
UI Inspector overlay for debugging element detection

Find the P button on your device panel
Click the P button to install Portal
Wait for installation (10-15 seconds) — button turns green when ready
The K button should also turn green (keyboard active)

Button	Color	Meaning
P	Green	Portal installed, accessibility ON, up to date
P	Yellow	Portal installed but outdated — click to update
P	Red	Installation failed — check USB debugging permissions
K	Green	Portal IME keyboard active and ready
K	Red	IME not available — click to enable

If Portal installation fails: (1) Enable "Install via USB" in Developer Options, (2) For Samsung, enable "USB debugging (Security settings)", (3) Click the R button to reconnect.

4. Configuring LLM Providers & API Keys

DroidAI requires an LLM API key to power its AI agent. The AI reads the device screen and decides what actions to take.

Supported Providers

Provider	Recommended Models	Vision	Notes
Anthropic	Claude Sonnet 4, Claude Haiku	Yes	Best overall. Supports prompt caching for cost savings.
OpenAI	GPT-4o, GPT-4o-mini	Yes	Good alternative with fast response times.
Google Gemini	Gemini 2.0 Flash	Yes	Cost-effective option.
DeepSeek	DeepSeek Chat	No	Budget option, text-only (no screenshots).
Grok	Grok-2	Yes	xAI
Ollama Cloud	Various	Varies	Self-hosted models via Ollama.
Kimi	Moonshot	No	Chinese provider.
Custom	Any OpenAI-compatible API	Varies	Use with any OpenAI-format provider.

Getting an API Key

Anthropic (Recommended)

Go to console.anthropic.com
Create an account or sign in
Navigate to API Keys in the dashboard
Click Create Key and copy it (starts with sk-ant-)
Add credits to your account (API is pay-per-use)

OpenAI

Go to platform.openai.com
Create an account or sign in
Navigate to API Keys
Click Create new secret key and copy it (starts with sk-)

Entering Your API Key in DroidAI

Click the Settings button (gear icon) in the top toolbar
In AI Model section, select your provider
Paste your API key
Select a model from the dropdown
Click Save

Enable Prompt Caching (Anthropic only) to significantly reduce API costs — saves up to 90% on repeated requests.

Using a Custom Provider

If you have an OpenAI-compatible API endpoint (e.g., local model, proxy, third-party):

Select Custom from the provider dropdown
Enter the base URL (e.g., http://localhost:11434/v1 for Ollama)
Enter your API key (if required)
Type your model name manually

5. Interface Overview

DroidAI has a split-panel layout: Chat Panel on the left and Device Grid on the right.

Top Toolbar

Button	Function
Device	Device management: connect/disconnect, Portal, WiFi, restart ADB, resize
Mirror	Toggle mirror mode (green = ON). Forwards mouse/keyboard to device.
Settings	Open settings (API keys, agent config, display, streaming)
Report	Submit bug reports with diagnostic logs
User Menu	Subscription status, email, sign out

Chat Panel (Left Side)

The chat panel has four tabs:

Activity

Live execution log. Shows commands, AI reasoning, tool calls, and results.

Presets

Manage saved commands, AI personas, AI rules, and app cards.

Advanced

Macros (record/replay), Workflows (multi-step chains), Triggers.

Log

Raw debug/diagnostic logs for troubleshooting.

Command Input Bar (Bottom)

Button	Function
Stealth	Toggle stealth mode (red = ON). Human-like delays and jitter.
Loop	Toggle repeat/loop mode. Set count and interval.
Screenshot	Cycle: OFF → AUTO → ALWAYS.
App Cards	Open app card grid for quick app-specific tasks.
All / None	Select or deselect all connected devices.

Device Grid (Right Side)

Each connected device appears as a panel with live screen mirror. Panel controls:

Button	Function
Selection badge	Click to select/deselect for commands
P	Portal APK status & install
K	Keyboard IME status & enable
R	Reconnect device
L	Open per-device activity log
Enlarge	Full-screen device view
×	Disconnect device

6. Your First Task — Quick Start

Let's walk through your very first DroidAI task, step by step.

Prerequisites Checklist

☑ DroidAI installed and running
☑ Signed in with Google account
☑ Phone connected via USB with USB debugging enabled
☑ Device visible in the device grid with live screen
☑ Portal installed (green P button)
☑ LLM API key configured in Settings

Example: Open YouTube and Search for a Video

Make sure your device is selected (click its selection badge)
Click the text input field at the bottom of the chat panel
Type: Open YouTube and search for "lofi hip hop radio"
Press Enter (or click the Send button)
Watch the Activity tab — the AI will launch YouTube, tap search, type the query, and press Enter

You can watch the AI's actions in real-time on both the device screen and in the Activity tab.

More Example Commands

Command	What It Does
`Open Instagram and like 3 posts in my feed`	Launches Instagram, scrolls feed, likes 3 posts
`Go to Settings and turn on WiFi`	Opens Settings, navigates to WiFi toggle
`Open Chrome and go to google.com`	Launches Chrome, navigates to URL
`Send "Hello!" to Mom on WhatsApp`	Opens WhatsApp, finds contact, sends message

Stopping a Task

Click the red Stop button during execution, or press Ctrl+C while the input field is focused.

7. Core Features Deep Dive

How the AI Agent Works

For each step of a task, the AI agent follows this cycle:

Observe — Read the UI tree and optionally take a screenshot
Think — Analyze the screen state, decide the best action
Act — Execute one action: tap, type, scroll, swipe, etc.
Repeat — Loop until task complete or max steps reached

Available Actions

Action	Description	Example
`tap(x, y)`	Tap at screen coordinates	Tap a button
`click(index)`	Click element by UI tree index	Click "Send" by index
`type(index, text)`	Type text into input field	Type a search query
`scroll(dir)`	Scroll up/down/left/right	Scroll feed
`swipe(dir)`	Swipe gesture	Swipe through stories
`long_tap(x, y)`	Long press at coordinates	Open context menu
`launch(pkg)`	Open app by package name	Launch Instagram
`back / home / enter`	System navigation buttons	Go back
`wait(ms)`	Pause execution	Wait for content
`done`	Mark task finished	Task complete

Planning Mode

When enabled, the AI generates a step-by-step plan before execution. Useful for complex tasks where you want to see the approach first.

Toggle via the Mode button in the command bar
The plan appears in the Activity tab before execution starts
Minimum 5 actions enforced before early completion

Max Steps

Maximum actions per task (default: 30, configurable: 10-100). Prevents infinite loops. Increase for long tasks; decrease for simple actions.

8. Multi-Device & Phone Farm Setup

DroidAI is built for scale. Connect and control dozens of Android devices simultaneously from a single PC.

Hardware Setup

USB Hub Selection

For multiple phones, you need a powered USB hub:

Hub	Ports	Best For	Notes
Sipolar A-423	20	Medium farm (10-20)	Industrial grade, dedicated power supply
Sipolar A-400	10	Small farm (5-10)	Compact, desk-friendly
Sipolar A-812	30	Large farm (20-30)	Rack-mountable
Anker USB 3.0	7-13	Small setups (3-7)	Consumer-grade, reliable

Always use a powered USB hub with external power adapter. Unpowered hubs cannot supply enough current for multiple phones.

Recommended Phones for Farms

Phone	Price (Used)	Android	Pros
Samsung Galaxy S8/S9	$40-60	9-10	Cheap, reliable ADB, common
Samsung Galaxy A series	$50-80	11-13	Good value, newer Android
Google Pixel 3/4	$50-70	12-13	Stock Android, fast ADB
Xiaomi Redmi Note	$40-60	11-13	Budget, good specs

For best compatibility, use Samsung Galaxy S8/S9 or Google Pixel devices.

Connecting Multiple Devices

Connect all phones to the USB hub via data cables
Connect the USB hub to your PC
Enable USB debugging on every device
Authorize USB debugging on each phone
Launch DroidAI — all devices should appear in the grid
Install Portal on all: Device menu → Install Portal All

Sending Commands to Multiple Devices

Option 1: All Devices

Click "All" to select every device, then type your command. Each device gets its own independent AI agent.

Option 2: Selected Devices

Click the selection badge on each device to target. Commands run only on selected devices.

Performance Considerations

Devices	Recommended PC	Notes
1-5	Any modern PC, 8 GB RAM	Runs smoothly
5-15	16 GB RAM, decent GPU	Lower resolution to 480p
15-30	32 GB RAM, dedicated GPU	Use 360p resolution
30+	Multiple PCs recommended	Split devices across PCs

9. App Cards & Playbooks

App Cards are pre-configured guides that help the AI understand specific apps. They contain navigation hints and app-specific tips that significantly improve accuracy.

How App Cards Work

When you send a command, DroidAI checks if the current app has an app card. If found, the card's instructions are injected into the AI's system prompt.

Using the App Card Grid

Click App Cards button in the command bar
A grid popup shows all available app cards (25+ pre-configured)
Click any card to open the detail dialog
View/edit card content, add instructions, then click Run

Pre-Configured Apps

Social Media

Instagram, Facebook, X (Twitter), Threads, TikTok, Snapchat, Reddit, Pinterest, LinkedIn

Messaging

WhatsApp, Telegram, Discord, KakaoTalk, LINE, WeChat

Media

YouTube, Spotify, Netflix

Utility

Chrome, Gmail, Google Maps, Play Store, Settings, Amazon, Naver

Creating Custom App Cards

Go to Presets tab → App Cards
Click New Card
Enter the package name
Write navigation tips and key UI elements
Click Save

10. Mirror Mode

Mirror Mode lets you manually control a device using your PC's mouse and keyboard.

Enabling Mirror Mode

Click Mirror in the top toolbar (turns green)
Click on a device panel to select it
Mouse clicks and keyboard input now go to the device

Controls in Mirror Mode

PC Input	Device Action
Left click	Tap at position
Click and drag	Swipe/drag gesture
Mouse wheel	Scroll on device
Keyboard typing	Text input
Right click	Back button

Mirror Mode + Multi-Device

When multiple devices are selected, input is forwarded to all selected devices simultaneously. Useful for setting up multiple phones with the same configuration.

Mirror Mode and AI agent commands can conflict. Turn off Mirror Mode before sending AI commands.

11. Loop & Repeat Mode

Loop Mode repeats the same command multiple times with optional delays between cycles. Essential for repetitive tasks.

Setting Up a Loop

Click Loop button in the command bar
Set Count: how many times to repeat (1-999)
Set Interval: minutes between cycles (0-999)
Type your command and send — it repeats automatically

Example: Social Media Engagement Loop

Loop: 10 cycles, 5 minute interval
Command: "Open Instagram, scroll feed, like 3 posts, then close the app"

Result: Every 5 minutes, each device opens Instagram,
likes 3 posts, and closes. Repeats 10 times over ~50 minutes.

Use longer intervals (5-15 min) for social media tasks to appear more natural.

12. Stealth Mode

Stealth Mode makes the AI's actions appear more human-like by introducing natural variations.

What Stealth Mode Does

Feature	Description
Tap Jitter	Random offset on tap coordinates (±12px)
Speed Variation	Random ±20% variation in action timing
Reading Pauses	Random pauses (0.5-3s) between actions
Action Delays	Variable delays between consecutive actions

When to Use Stealth Mode

Social media automation — Prevents detection by platform algorithms
Account farming — Makes bot behavior less detectable
Extended sessions — More natural interaction patterns

Stealth Mode is ON by default. For testing/debugging, turn it OFF for fast, precise actions.

13. Personas, AI Rules & Presets

Saved Commands

Save frequently-used commands for one-click execution:

Go to Presets tab → Saved Commands
Enter a name and command text
Click Add
Click Play next to any saved command to execute

AI Personas

Personas customize the AI agent's behavior. Only one persona can be active at a time.

Examples: Speed Runner (fast, skip verifications), Careful (verify before/after each action), Social Media Expert (navigate social apps expertly).

AI Rules

Rules are constraints always injected into the AI's system prompt. Multiple rules can be active simultaneously.

Examples: "Never purchase anything", "Close ads immediately", "Always use search instead of scrolling", "Skip sponsored content when liking".

14. Macros, Workflows & Triggers

Macros (Record & Replay)

Macros record AI actions and replay them without the LLM, saving API costs.

Recording a Macro

Go to Advanced tab → Macros
Click Record (turns red)
Send a command and let the AI execute
Click Stop Recording

Playing a Macro

Select the macro and click Play — replays exact actions without calling the LLM.

Macros replay exact coordinates. If the app UI has changed, the macro may fail. Re-record if needed.

Workflows

Chain multiple commands into sequential flows. Each step runs only after the previous one completes.

Go to Advanced tab → Workflows
Click New Workflow
Add steps with commands
Click Run to execute the chain

Triggers

Condition-based auto-execution. Set conditions and DroidAI evaluates them with the LLM to decide when to act.

Go to Advanced tab → Triggers
Create a trigger with condition and action
Enable the trigger — DroidAI monitors and acts when conditions are met

15. Screenshot Modes & Vision

DroidAI can send screenshots to the AI for visual understanding. Three modes available:

Mode	Description
OFF	UI tree only, no screenshots
AUTO	UI tree + screenshot when tree empty or agent stuck (≥2 failures)
ALWAYS	UI tree + screenshot every iteration

When to Use Each Mode

Reels/Shorts — Video players have minimal UI trees. Use ALWAYS.
Games — Canvas-rendered UIs. Screenshots essential.
Standard apps — OFF or AUTO is usually sufficient.

Screenshots increase API costs. Use AUTO for the best balance of accuracy and cost.

Non-vision LLM models automatically fall back to OFF mode.

16. Telegram Bot Remote Control

Control DroidAI remotely from your phone using a Telegram bot.

Setting Up the Telegram Bot

Open Telegram and search for @BotFather
Send /newbot and follow prompts to create a bot
Copy the bot token
Find your Chat ID via @userinfobot
In DroidAI: Settings → Telegram Bot
Paste token, enter Chat ID, enable Auto-start, click Save

Telegram Commands

Command	Description
`/help`	List all available commands
`/devices`	Show all connected devices
`/select [device]`	Set default device
`/run [command]`	Execute a task
`/screenshot`	Take and receive a screenshot
`/repeat [n] [interval] [cmd]`	Loop execution remotely
`/stop`	Stop current execution
`/status`	Check device and agent status

17. Settings Reference

AI Model

Setting	Default	Description
Provider	Anthropic	LLM provider selection
API Key	—	Your provider's API key
Model	—	Specific model to use
Prompt Caching	ON	Cache system prompts (Anthropic only)

Agent

Setting	Default	Range	Description
Max Steps	30	10-100	Maximum actions per task
Action Delay	0s	0-5s	Pause between actions
Conversation History	20	5-40	Messages kept in context
Action History	5	1-15	Prior actions in prompt
Full Context	3	1-5	UI tree detail level
UI Tree Filter	Concise	—	Concise or Detailed output

Display

Setting	Default	Description
Language	English	UI language (13 languages)
Device Panel Size	100%	Scale device panels (40-200%)
Font Scale	100%	UI text size (50-200%)

Streaming (Scrcpy)

Setting	Default	Range	Description
Resolution	720p	240-1080p	Device screen resolution
Bitrate	4 Mbps	1-12 Mbps	Stream quality
Max FPS	30	5-60	Frame rate

18. Troubleshooting & FAQ

Device Not Appearing

Symptom	Solution
Device not listed	Check USB cable (data, not charge-only). Try `adb devices`.
`unauthorized` in adb	Check phone for USB debugging popup. Tap "Allow".
`offline` in adb	Unplug and replug USB. Try different port.
Device appears then disappears	Faulty cable or USB port. Try different cable/port.

Portal Issues

Symptom	Solution
P button stays red	Enable "Install via USB" in Developer Options.
P button yellow	Click P to reinstall latest version.
Accessibility not enabling	Manually: Settings → Accessibility → DroidAI Portal → ON.
K button red	Settings → Language & Input → enable DroidAI Keyboard.

AI Agent Issues

Symptom	Solution
Agent does nothing	Check API key. Verify internet. Check Activity for errors.
Agent taps wrong elements	Enable Screenshot mode (AUTO or ALWAYS).
Agent stuck in loop	Click Stop. Try rephrasing your command.
"Max steps reached"	Increase Max Steps in Settings (up to 100).
Text input fails	Ensure Portal IME is active (green K).

Frequently Asked Questions

Q: How much does the LLM API cost?

A typical task (10-15 steps) costs ~$0.01-0.03 with Claude Sonnet, ~$0.005-0.01 with GPT-4o-mini. Screenshots add ~$0.01-0.02 each.

Q: Can I use DroidAI offline?

Internet is required for LLM APIs. However, you can use a local model via Ollama with the Custom provider for offline use.

Q: How many devices can I connect?

No hard limit. Practical limits depend on hardware. Users commonly run 10-30 devices per PC.

Q: Does it work with emulators?

Yes. Any ADB-compatible device works: BlueStacks, NoxPlayer, Android Studio AVD. Connect via adb connect localhost:PORT.

Q: Is my API key stored securely?

API keys are stored locally in settings.json in your AppData folder. Never sent to DroidAI servers.

Select Language

DroidAI

Table of Contents

1. Introduction & System Requirements

What is DroidAI?

How It Works

System Requirements

2. Installation & First-Time Setup

Step 1: Install DroidAI

Step 2: Install ADB (Android Debug Bridge)

Step 3: Enable USB Debugging on Your Phone

Step 4: Verify Connection

3. Connecting Your First Device

USB Connection

WiFi Connection (Wireless ADB)

Installing Portal APK

4. Configuring LLM Providers & API Keys

Supported Providers

Getting an API Key

Anthropic (Recommended)

OpenAI

Entering Your API Key in DroidAI

Using a Custom Provider

5. Interface Overview

Top Toolbar

Chat Panel (Left Side)

Activity

Presets

Advanced

Log

Command Input Bar (Bottom)

Device Grid (Right Side)

6. Your First Task — Quick Start

Prerequisites Checklist

Example: Open YouTube and Search for a Video

More Example Commands

Stopping a Task

7. Core Features Deep Dive

How the AI Agent Works

Available Actions

Planning Mode

Max Steps

8. Multi-Device & Phone Farm Setup

Hardware Setup

USB Hub Selection

Recommended Phones for Farms

Connecting Multiple Devices

Sending Commands to Multiple Devices

Option 1: All Devices

Option 2: Selected Devices

Performance Considerations

9. App Cards & Playbooks

How App Cards Work

Using the App Card Grid

Pre-Configured Apps

Social Media

Messaging

Media

Utility

Creating Custom App Cards

10. Mirror Mode

Enabling Mirror Mode

Controls in Mirror Mode

Mirror Mode + Multi-Device

11. Loop & Repeat Mode

Setting Up a Loop

Example: Social Media Engagement Loop

12. Stealth Mode

What Stealth Mode Does

When to Use Stealth Mode

13. Personas, AI Rules & Presets

Saved Commands

AI Personas

AI Rules

14. Macros, Workflows & Triggers

Macros (Record & Replay)

Recording a Macro

Playing a Macro

Workflows

Triggers