Why We Built Taskmosis on Chrome DevTools Protocol
A technical look at how we achieve 85% accuracy and sub-2-minute execution times by combining AI with full browser control.
When we set out to build Taskmosis, we had a clear goal: create a browser automation tool that actually works reliably on real websites. After evaluating every approach in the market, we chose to build on the Chrome DevTools Protocol (CDP).
This wasn't an obvious choice. Some competitors avoid CDP entirely, claiming it's "detectable" or "insecure." Others go the opposite direction with screenshot-based vision models. We believe both extremes miss the mark.
Here's why CDP, when used correctly through a Chrome Extension, provides the best foundation for AI-powered browser automation.
Full Browser Control
Access every browser capability through the official Chrome debugging interface
- DOM manipulation
- Network control
- Input simulation
- File handling
Native Integration
Runs within your browser using your existing sessions and credentials
- Your cookies
- Your logins
- Your extensions
- Your settings
AI-Powered Decisions
Intelligent task planning with semantic understanding of web pages
- Natural language
- Context awareness
- Error recovery
- Smart retries
How Taskmosis Works
Comparing Automation Approaches
There are three main ways to build browser automation. Here's how they stack up:
CDP-Based (Taskmosis)
RECOMMENDEDThe Best of Both Worlds
Combines extension convenience with full browser control. Uses minimized accessibility tree sent to AI for planning, with screenshots when visual context needed. CDP commands execute locally.
Advantages
- Complete DOM access
- Native input events
- 90%+ smaller payload than raw DOM
- Works with your sessions
- Credentials never transmitted
Limitations
- Requires debugger permission
Extension-Only
Limited by Design
Uses only Chrome Extension APIs without debugger access. Simpler but significantly restricted.
Advantages
- No special permissions
- Simple architecture
Limitations
- Cannot access Shadow DOM
- Synthetic events detectable
- No file upload support
- No network control
- CSP restrictions
CUA (Cloud Screenshot Agents)
NOT RECOMMENDEDExpensive, Slow, Privacy Risk
Takes screenshots and sends them to external AI providers for interpretation. Fundamentally inefficient and exposes your screen content.
Advantages
- Works on any visual content
Limitations
- 10x slower execution
- 5-10x higher cost
- Screenshots sent to cloud
- Privacy concerns
- Hallucination prone
What CDP Enables
The Chrome DevTools Protocol provides access to powerful browser internals that extension-only approaches simply cannot reach.
DOM Domain
Full access to the document structure, including Shadow DOM and iframes
Input Domain
Native mouse and keyboard events that are indistinguishable from real user input
Network Domain
Monitor and intercept network requests for advanced automation scenarios
Page Domain
Control page lifecycle, capture screenshots, and handle navigation
Runtime Domain
Execute JavaScript in the page context for complex interactions
Accessibility
Read the accessibility tree for semantic understanding of page structure
Why Cloud Screenshot Agents Struggle
CUA (Cloud Screenshot) Approach
The Taskmosis Approach
Addressing the "CDP is Detectable" Myth
You may have heard that CDP-based automation is easily detected by websites. This is only true for headless browsers launched with remote debugging enabled.
When CDP is accessed through Chrome's extension APIs (using chrome.debugger), the situation is completely different:
- No navigator.webdriver flag - This flag is only set for automated browser launches, not extension debugger access
- No CDP JavaScript objects - The window.cdc_* objects only appear in remote debugging mode
- Identical browser fingerprint - Your browser looks exactly the same as manual browsing
- Your existing session - Uses your cookies, logins, and browser state
Bottom line: Taskmosis works on LinkedIn, banking sites, and other platforms with aggressive bot detection because it's indistinguishable from you browsing manually.
Frequently Asked Questions
Ready to Automate Your Browser Tasks?
Experience the power of CDP-based automation. 85% accuracy, ~1.2 minute execution, $0.10 per task.