Documentation v0.2.7

What is
Comet AI?

Comet AI Browser is an open-source, AI-native browser with permission-gated OS automation. It combines the speed of native applications with the intelligence of large language models to create a truly autonomous browsing experience.

Core Features

Built for Control

Architecture

System Design

Comet AI uses a layered architecture that separates concerns while maintaining tight integration between AI intelligence and platform capabilities.

Layer 1

User Interface Layer

Electron shell with native menus, SwiftUI sidebar on macOS, Flutter mobile app

Next.js FrontendSwiftUI macOS UIFlutter Mobile App
Layer 2

AI Orchestration Layer

Multi-model support (Gemini, Claude, GPT-4, Ollama) with RAG memory and thinking panels

AI Chat SidebarThinking PanelCommand ParserRAG Memory
Layer 3

Automation Engine

Task scheduling, shell command execution, browser automation, and screenshot capture

Scheduler ServiceShell ExecutorBrowser ControllerOCR Service
Layer 4

Security Layer

Triple-lock architecture: visual sandbox, syntactic firewall, human-in-the-loop

Permission StorePII ScrubberInjection DetectorQR Auth
Layer 5

Platform Integration

Cross-platform support for Windows, macOS, Linux, Android with native APIs

Electron Main ProcessFlutter BridgeFirebase SyncWiFi P2P

AI Integration

Multi-Model Support

Choose the right model for your task. Use cloud APIs for power or run locally for complete privacy.

G

Google Gemini

Cloud

Recommended
O

OpenAI GPT-4

Cloud

Supported
A

Anthropic Claude

Cloud

Supported
G

Groq

Cloud

Fastest
O

Ollama

Local

Private

Comparison

Why Comet AI?

FeatureComet AIStandard BrowserOther AI Browsers
AI Agent ControlLimited
Local LLM Support-
Shell Command Execution-
Background Scheduling-
Triple-Lock SecurityBasic
Cross-Device Control-
Open SourcePartial

Ready to get started?

Follow our comprehensive getting started guide to install Comet AI and configure your first AI-powered browsing session.

Edit on GitHub