CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

CanvasToAPI is a proxy server that wraps Gemini Canvas's web interface and exposes it as API endpoints compatible with OpenAI, Gemini, and Anthropic API formats. The system uses browser automation (Playwright with Camoufox/Firefox) to interact with the Canvas web interface and translates API requests into browser interactions.

Common Commands

Development

npm run dev              # Start dev server with hot reload (server + UI)
npm run dev:server       # Start only the server in dev mode
npm run dev:ui           # Build UI in watch mode

Production

npm start                # Build UI and start production server

Authentication Setup

npm run setup-auth       # Interactive auth setup (launches browser)
npm run save-auth        # Save authentication credentials

Code Quality

npm run lint             # Lint JavaScript and CSS
npm run lint:fix         # Auto-fix linting issues
npm run lint:js          # Lint only JavaScript files
npm run lint:css         # Lint only CSS/Less files
npm run format           # Format all files with Prettier
npm run format:check     # Check formatting without changes

UI Development

npm run build:ui         # Build Vue.js UI for production
npm run preview:ui       # Preview built UI

Architecture

Core System Components

The system follows a modular architecture with clear separation of concerns:

ProxyServerSystem (src/core/ProxyServerSystem.js)

Main orchestrator that integrates all modules
Manages HTTP/WebSocket servers
Coordinates between session routing, WebSocket connections, and request handling
Entry point: main.js instantiates and starts this system

SessionRegistry (src/core/SessionRegistry.js)

Manages WebSocket connections from browser contexts
Routes messages to appropriate MessageQueue instances
Implements grace period for reconnection attempts
Supports multiple concurrent connections (one per browser session)

RequestHandler (src/core/RequestHandler.js)

Processes incoming API requests
Coordinates retry logic and session switching
Delegates to session routing helpers for browser session management
Delegates to FormatConverter for API format translation

Session Routing

Uses ROUND to auto-select a browser session for each new request
Retries on the next available session for any non-user-aborted browser error
Uses SESSION_ERROR_THRESHOLD to disable unhealthy browser sessions after repeated browser / WebSocket errors

FormatConverter (src/core/FormatConverter.js)

Converts between API formats (OpenAI ↔ Gemini ↔ Anthropic)
Handles streaming and non-streaming responses

AuthSource (src/auth/AuthSource.js)

Loads authentication data from configs/auth/auth-N.json files
Validates and deduplicates accounts by email
Tracks loaded browser session material from configs/auth/auth-N.json

Request Flow

Client sends API request (OpenAI/Gemini/Anthropic format) → Express routes
RequestHandler receives request → FormatConverter normalizes to Gemini format
RequestHandler checks ConnectionRegistry for active WebSocket
If no connection: the request fails fast until a browser session reconnects
Request sent via WebSocket to browser context → injected script interacts with Canvas
Response streams back via WebSocket → FormatConverter translates to requested format
On selected failures: RequestHandler may retry on another browser session

Multi-Context Architecture

The system maintains multiple browser contexts simultaneously:

Each Google account gets its own browser context and page
Contexts are initialized on-demand or in background
Current session is chosen per request by the session registry
Immediate retry can switch to another live session when configured
Context pool rebalancing ensures optimal resource usage

UI Structure

Frontend: Vue.js 3 + Element Plus + Vite
Location: ui/ directory
Build output: ui/dist/ (served by Express)
Features: Session status monitoring and runtime settings

Configuration

Environment Variables

Key variables (see .env.example for full list):

PORT: API server port (default: 7861)
API_KEYS: Comma-separated API keys for client authentication
STREAMING_MODE: "real" or "fake" streaming
ROUND: Session selection strategy (round or random)
SESSION_ERROR_THRESHOLD: Disable a browser session after repeated browser / WebSocket errors (default: 3)
CAMOUFOX_EXECUTABLE_PATH: Custom browser executable path
MAX_CONTEXTS: Maximum number of accounts logged in simultaneously for faster switching (default: 1, memory usage: ~700MB per account)
LOG_LEVEL: Set to "DEBUG" for verbose logging

Model Configuration

Edit configs/models.json to customize available models and their settings.

Authentication Files

Location: configs/auth/auth-N.json (N = 0, 1, 2, ...)
Format: Playwright browser context state (cookies, localStorage, etc.)
Generated by: npm run setup-auth or VNC login in Docker

Key Technical Details

Browser Automation

Uses Playwright with Camoufox (privacy-focused Firefox fork)
Injects build.js script into the Canvas page for WebSocket communication
Script location: public/build.js (built from ui/app/)
Health monitoring via periodic checks and reconnection logic

WebSocket Communication

Each connection identified by a session id
MessageQueue pattern for request/response correlation
Grace period (60s) for reconnection before triggering callback

Account Switching

Automatic selection based on the configured session routing strategy
Supports immediate switching on specific HTTP status codes
System busy flag prevents concurrent switches
Lightweight reconnect attempts before full context switch

Streaming Modes

Real streaming: True SSE streaming from Canvas
Fake streaming: Buffer complete response, then stream to client

Development Notes

Testing

Test files in test/ directory
Client test scripts in scripts/client/
Auth test scripts in scripts/auth/

Linting & Formatting

ESLint for JavaScript (includes Vue plugin)
Stylelint for CSS/Less
Prettier for code formatting
Pre-commit hooks via Husky + lint-staged

Docker

Dockerfile supports VNC for browser interaction
Auth files mounted via volume: /app/configs/auth
Environment variables for configuration

Git Workflow

Main branch: main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Common Commands

Development

Production

Authentication Setup

Code Quality

UI Development

Architecture

Core System Components

Request Flow

Multi-Context Architecture

UI Structure

Configuration

Environment Variables

Model Configuration

Authentication Files

Key Technical Details

Browser Automation

WebSocket Communication

Account Switching

Streaming Modes

Development Notes

Testing

Linting & Formatting

Docker

Git Workflow

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Common Commands

Development

Production

Authentication Setup

Code Quality

UI Development

Architecture

Core System Components

Request Flow

Multi-Context Architecture

UI Structure

Configuration

Environment Variables

Model Configuration

Authentication Files

Key Technical Details

Browser Automation

WebSocket Communication

Account Switching

Streaming Modes

Development Notes

Testing

Linting & Formatting

Docker

Git Workflow