Overview¶

✅ Implemented 🧪 Tested

Current state: Navigation graph is fully implemented. Screen fingerprinting (4-tier strategy), navigateTo smart routing, explore (discover/validate/hybrid modes), and graph persistence in SQLite are all active. Benchmarked at ≤1ms. See the Status Glossary for chip definitions.

As AutoMobile explores an app it automatically maps what it observes into a navigation graph.

flowchart TD
    subgraph Navigation Graph
        Home["🏠 Home Screen"]
        Profile["👤 Profile Screen"]
        Settings["⚙️ Settings Screen"]
        EditProfile["✏️ Edit Profile"]
        Notifications["🔔 Notifications"]
        Privacy["🔒 Privacy Settings"]
    end

    Home -->|"👆 tapOn 'Profile'"| Profile
    Home -->|"👆 tapOn 'Settings'"| Settings
    Home -->|"👆 tapOn 'Notifications'"| Notifications
    Profile -->|"👆 tapOn 'Edit'"| EditProfile
    Profile -->|"🔘 pressButton 'back'"| Home
    EditProfile -->|"🔘 pressButton 'back'"| Profile
    Settings -->|"👆 tapOn 'Privacy'"| Privacy
    Settings -->|"🔘 pressButton 'back'"| Home
    Privacy -->|"🔘 pressButton 'back'"| Settings
    Notifications -->|"🔘 pressButton 'back'"| Home

    classDef screen fill:#525FE1,stroke-width:0px,color:white;
    class Home,Profile,Settings,EditProfile,Notifications,Privacy screen;

Upon every observation after a screen has reached UI stability:

Create unique screen signature by fingerprinting the observation paired with AutoMobile SDK navigation events
Compare current vs previous screen
If we’re on a different unique navigation fingerprint, record the tool call as the edge in the graph.

This process has been benchmarked to take at most 1ms and it is a project goal to keep it within the limit. The graph is persisted as exploration takes place whether by the user or AI. As its built you can take advantage of it:

Navigate to Screen¶

The 🗺️ navigateTo tool uses the graph to find paths:

Finds target screen in graph
Calculates shortest path from current node to the target
Executes recorded actions to reach target
Verifies arrival at destination

Explore Efficiently¶

The 🔍 explore tool uses the graph to:

Avoid revisiting known screens
Prioritize unexplored branches
Track coverage of app features

Edge Cases & Limitations¶

Known Limitations¶

Multiple similar screens without navigation IDs
Risk: May produce same fingerprint
Mitigation: Include static text for differentiation
Cache expiration during long keyboard sessions
Risk: Lost navigation ID reference
Mitigation: Adjust cacheTTL based on use case
Screens with identical structure and no selected state
Risk: Cannot differentiate
Mitigation: Encourage SDK integration for perfect identification

Handled Edge Cases¶

✅ Nested scrollable containers
✅ Scrollable tab rows (critical fix)
✅ Keyboard show/hide transitions
✅ Empty hierarchies
✅ Deeply nested structures

Best Practices¶

For SDK-Instrumented Apps¶

✅ Do: - Use unique navigation resource-ids for each screen - Follow navigation.* naming convention - Ensure navigation IDs persist during keyboard

✅ Consider: - Add navigation IDs even to modal/overlay screens - Use descriptive names: navigation.ProfileEditScreen

For Non-SDK Apps¶

✅ Do: - Rely on Tier 3 shallow scrollable strategy - Ensure screens have distinguishing static text or selected states - Test fingerprinting across different app states

⚠️ Watch for: - Screens with identical layout but different data - Heavy use of dynamic content without static labels

For All Apps¶

✅ Do: - Cache previous fingerprint results for stateful tracking - Monitor confidence levels - Log fingerprint method for debugging

❌ Don’t: - Assume 100% accuracy without navigation IDs - Ignore confidence levels in decision-making - Skip validation on critical navigation paths