Docs/AI Chat

AI Chat

Gemini BYOKPremium for more providers

Conversational chat, agent mode with 37 tools and multi-file auto-iteration. The Free plan enables Gemini; Premium unlocks OpenAI, Claude and GitHub Copilot. In every case you connect your own account or API key (BYOK / OAuth) — Pocket Code does not provide AI credits and does not proxy requests.

Your account, your model

Pocket Code is not a proxy: requests go directly from your device to the provider. Supports API key or OAuth login through the official CLIs (Claude Code, Codex, Gemini) — including automatic token refresh.

Supported providers

Google Gemini

Gemini 2.5 Pro · 2.5 Flash · 2.0 Flash · Flash Lite

Free · BYOK

OpenAI

GPT-4.1 · GPT-4.1 Mini · GPT-4o · GPT-4o Mini · o3-mini

Premium · BYOK / OAuth

Anthropic Claude

Sonnet 4 · Opus 4 · Claude 3.5 Sonnet · 3.5 Haiku

Premium · BYOK / OAuth

GitHub Copilot

13 multi-vendor models (OpenAI, Anthropic, Meta Llama 4, Mistral, DeepSeek)

Premium · BYOK / OAuth

Every conversation remembers which provider created it and automatically restores the model when reopened. Individual messages also store their provider, so you can mix Claude, OpenAI and Gemini answers in the same thread.

Reasoning & effort

Every provider exposes its own extended-thinking scale. Pocket Code surfaces them through a single model so you pick how hard the model thinks per conversation — and per agent in auto-iteration.

Provider	Tiers
Claude	low · medium · high · xhigh · max
OpenAI / Codex	none · minimal · low · medium · high · xhigh
Gemini	off · low · medium · high · max
GitHub Copilot	Per model family (gpt-5: minimal→high; Sonnet: Claude tiers)

•Tiers are discovered dynamically by probing each official CLI at startup — not hardcoded — so new levels appear automatically as the CLIs update.
•A single value object carries both formats: the provider's native tier name and the equivalent integer thinking budget — each client uses whichever it needs.
•Wired the right way per path: thinking block (Claude REST), --effort flag (Claude CLI), reasoning_effort field (OpenAI), -c model_reasoning_effort (Codex), LSP params (Copilot).
•A pink Reasoning chip in the chat toolbar lists the active model's tiers with descriptions; it hides itself when the model exposes no reasoning.
•In AI Office auto-iteration each agent role can be assigned its own reasoning tier, falling back to the session default.

Conversation modes

Three levels selectable from the chat top bar:

Plan

The model describes the steps without executing anything. Useful for reviewing the approach before touching code.

Code

Applies changes automatically via tools, but asks for confirmation before destructive operations.

Agent

Autonomous: chains tool calls, edits multiple files and only stops to ask on critical operations.

37 tools across 11 categories

The agent can read, write, search, run commands, control Git and connect to external services. Each category has granular individual permissions per provider and model.

Read-only (6)

read_file, list_dir, file_search, grep_search, get_errors, semantic_search

Write (6)

create_file, insert_edit, replace_string, multi_replace, delete_file, rename_file

Terminal (1)

run_in_terminal capturing stdout/stderr and exit code

TODOs (1)

manage_todo_list — visible list in the UI

Git (6)

status, commit, log, diff, push, pull

Database (5)

query, execute, schema, list_tables, list_connections

API (2)

REST request and list saved collections

SSH (2)

Remote exec on configured hosts

Browser (4)

Navigate, page info, evaluate JS, read console

Design / Deploy (4)

Export designs and trigger deploys

Granular permissions

From Settings → AI → Tool Permissions you decide exactly what each model can do. Granularity per provider, per model or per category, with individual switches per tool. Quick Allow/Deny all by section.

•4 provider chips (OpenAI, Claude, Gemini, Copilot) with optional scope to a specific model
•Filtering applied at every layer: tool builders, MCP HTTP server and Copilot LSP
•Binary toggle to disable all Copilot Language Server tools at once
•Interactive confirmation on destructive operations (Code and Plan modes)

Attachments & mentions

Image

Multimodal vision (GPT-4o, Gemini, Sonnet 4) for screenshots or diagrams

Document

Attach PDFs and text files that get concatenated to the prompt

Project code

Built-in picker for the active project file tree

@file mentions

Inject a reference to the file in the prompt without loading the content

Multi-file auto-iteration

Describe a complete project ("a notes app with authentication and sync") and the orchestrator generates a plan, presents it for review, and executes the tasks in order with resumable checkpoints.

•Structured plan: architecture, tech stack, modules, task list with dependencies
•Task states (Backlog → Todo → In progress → Review → Done) with real-time visualization
•Specialized workers per task type (documents, mapping, refactor, optimization, audit)
•Persistent checkpoints: if the app closes, the session resumes where it left off
•Automatic lint + syntax validation of each file before continuing
•~40 settings: separate providers for planning/coding/review, parallelism, tests, docs, Git auto-commit

Automated workflows

Define automations triggered by IDE events: after saving a file, on commit, when a test fails, on cron, etc. Each workflow can chain tool invocations or LLM calls.

on_file_save

on_file_change

on_git_commit

on_git_push

on_test_fail

on_build_complete

on_deploy

scheduled

on_chat_message

Skills & custom agents

Quick access to pre-packaged prompts from the / button on the input. Ten predefined skills cover the most common cases:

/explain

/fix

/tests

/docs

/optimize

/simplify

/modernize

/git-help

/sql-query

/learn

Create your own agents in .pocketcode/agents/*.md with frontmatter (name, description, model, tools, systemPrompt). Compatible with the Claude Code convention and MCP servers.

Project memory

Persistent information the model reads every turn from .pocketcode/memory/. Four types:

user

Developer role, preferences and knowledge

feedback

Rules or corrections on how to approach tasks

project

State and context of the active project

reference

Pointers to external systems (Linear, Grafana, Slack)

INSTRUCTIONS.md, AGENTS.md, CLAUDE.md or .cursorrules files are also supported — they are automatically injected into the system prompt.

Conversation management

•Unified sheet: local conversations alongside Claude CLI, Codex CLI and Gemini CLI sessions
•Pin, rename, deep search (also searches inside message bodies)
•Auto-compression of context when reaching 85% of the model window
•Export to Markdown or JSON preserving code blocks and tool calls
•Stop button during streaming: cancels response and leaves it truncated in history
•Visible quota: rate-limit windows with countdown (Claude 5h/7d, Copilot monthly, etc.)

Free / Premium comparison

	Free	Premium
Gemini (API or CLI)	✓	✓
OpenAI (GPT-4.1, o-series)	—	✓
Claude (Sonnet/Opus 4)	—	✓
GitHub Copilot	—	✓
Plan / Code / Agent modes	✓	✓
Tool permissions	✓	✓
Reasoning / effort control	✓	✓
Auto-iteration	✓	✓
Workflows	✓	✓
MD / JSON export	✓	✓
Cloud sync of conversations	—	✓

Next: Code Editor