ask

AI agent for your terminal. Everything stays local.

Written in Go. Runs everywhere. Full control over your data.

_{Click to see full demo}

A no-nonsense CLI for talking to LLMs with actual superpowers. Chat in REPL mode or fire one-off questions. Optional agent mode lets the AI run shell commands, edit files, make HTTP calls, and manage local todos/memory—all with approval gates you control.

Features

REPL chat with slash commands for config on the fly
One-shot mode for quick questions (pipes stdin too)
Streaming markdown rendering as you type
Agent mode with tool calling (shell, file ops, HTTP, clipboard, todos, memory)
Approval gates per action (bypass with --yolo if you're in a hurry)
Chat history persisted in SQLite
Vector memory store for long-term context (add/view/update/delete)
Named lists/todos stored locally (accessible to the agent)
Shell completions for bash/zsh/fish
Custom system prompts (load from file)

Requirements

Go 1.25+ and a Gemini API key. Set it in your environment:

export GEMINI_API_KEY="your_key_here"

Environment Variables

The program reads these environment variables at runtime:

Variable	Required for	Notes
`GEMINI_API_KEY`	Core CLI and agent mode	Required to talk to the Gemini API.
`ASKCLI_SERVER_KEY`	Remote server mode	Server-side: API key that clients must provide to authenticate. Client-side: used as fallback if `ASKCLI_CLIENT_KEY` not set.
`ASKCLI_CLIENT_KEY`	Remote client mode	Client-side: API key to authenticate with a remote server (alternative to `ASKCLI_SERVER_KEY`).
`TELEGRAM_BOT_TOKEN`	Telegram background mode	Required when running `ask --background=true`.
`AGENT_MAIL_API_KEY`	AgentMail tool	Required for the `mail` tool.
`INBOX_NAME`	AgentMail tool	The inbox name used by the `mail` tool.
`ELEVEN_LABS_API_KEY`	TTS tool	Required for `text_to_speech_file`.
`DISPLAY` or `WAYLAND_DISPLAY`	Clipboard tool	Needed when using clipboard features in a graphical session.
`PORT`	Server mode	Port to run the server on (default: 3000).

If you only use the local CLI, GEMINI_API_KEY is the only required variable.

Quick Start

git clone https://github.com/zephex/go-ask.git
cd go-ask
go build -o ask
./ask "Your question here"

Or install globally:

sudo mv ask /usr/local/bin/

Usage

One-shot mode:

ask "What is a goroutine?"
ask --model exp "Analyze this architecture"
cat main.go | ask "Explain this code"

Chat mode:

ask --chat
# or
ask chat

Agent mode (enable tool calling):

ask --chat --agent

Auto-approve tool actions (use with caution):

ask --chat --agent --yolo

Model Aliases

Quick names for common models:

free – gemma-4-26b-a4b-it (default, fast)
cheap – gemini-3.1-flash-lite-preview (ultra-light)
exp – gemini-3-flash-preview (more capable)

Or pass any full model name.

Reasoning Control

Dial up the thinking time (higher = slower, more accurate):

HIGH – deep reasoning
MED / MEDIUM / MID
LOW
MIN / MINIMAL – fast, lightweight

Common Flags

--chat              Start REPL mode
--agent             Enable tool calling
--yolo              Auto-approve all actions
--stream            Stream markdown as it renders (default: on)
--system <file>     Load custom system prompt
--cache             Enable explicit Gemini context caching (system prompt + tools)
--cache-ttl <dur>   Explicit cache TTL (e.g. 30m, 2h). 0 uses API default
--model <alias>     Pick a model
--reason <level>    Set reasoning level
--clear             Nuke chat history on startup
--connect <url>     Connect to a remote ask server (e.g. http://host:3000)
--server-key <key>  API key for remote server authentication (overrides env vars)
--background        Run as background Telegram bot

Remote Server

Run ask as a server that other clients can connect to. The server processes requests and returns responses via HTTP.

Server setup:

Set the API key that clients will need to provide:

export ASKCLI_SERVER_KEY="your-secret-key-here"
export GEMINI_API_KEY="your-gemini-key"

Start the server (runs on port 3000 by default, or set PORT env):

ask --background=true
# or directly (without Telegram):
go run . 2>/dev/null &
# The server listens on /ask (authenticated) and /health (no auth)

Client usage:

Connect to the remote server from another machine or terminal using --connect:

One-shot query:

ask --connect http://server:3000 --server-key YOUR_KEY "your question"

Interactive chat:

ask --connect http://server:3000 --server-key YOUR_KEY --chat

Using env vars (on client):

export ASKCLI_CLIENT_KEY="your-secret-key-here"
ask --connect http://server:3000 --chat

Notes:

The --server-key flag overrides environment variables (ASKCLI_CLIENT_KEY, ASKCLI_SERVER_KEY).
Server validates the x-askcli-api-key header on each request.
The server shares the same SQLite database and vector memory across all clients.
Remote clients do not support streaming --connect (server-side only for now, YOLO mode is set to true in server letting agent perform any tool calls.).

Chat Mode (REPL)

Drop into an interactive session with slash commands for everything:

ask --chat

Available commands:

/help – show this list
/status – what model/settings are active
/model <name> – switch models on the fly
/reason <level> – adjust reasoning (HIGH/MED/LOW/MIN)
/stream on|off – toggle streaming output
/agent on|off – enable/disable tool calling
/yolo on|off – auto-approve tools
/pwd – print working directory
/cd <path> – change directory for tool commands
/history [n] – show last n messages
/clear – wipe current conversation
/memories – open the memory manager
/exit or /quit – leave

Memory (Vector Store)

Store facts locally and let the AI access them across chats. Useful for storing coding patterns, project context, or anything you want the agent to remember.

Access:

CLI: ask memories (list), ask memories manage (interactive editor)
Agent tools: memory_view, memory_add, memory_update, memory_delete

Manager commands:

l / list – show all
d <n> / del <n> – delete entry n
da / delall – nuke everything
q / quit – exit manager

How It Works

Storage: Chromem persistent DB in ~/db. Each memory gets a stable hash-based ID.

Management: Explicit (for now). Memories don't auto-inject into every prompt. You manage them via CLI or the agent tools. Automatic extraction/saving is disabled by design—keep it simple.

Architecture:

Memories live in a local vector DB under ~/db
Each entry has a stable id (content hash) and content
Retrieval code exists but isn't wired into agent prompts yet
Automatic per-turn saving is commented out (can be enabled if needed)

Status: Memory is read/write explicit only. No automatic context injection yet. Call memory tools in the agent to use them.

Agent Tools

Enable with --agent. The AI can call these tools automatically (with approval, unless --yolo):

run_shell_command – Execute bash

Runs in your selected directory
Returns stdout, stderr, exit code, timing
Approval required (unless --yolo)

read_file – Read file contents

Supports start_line / end_line for partial reads
No approval needed (read-only)

write_file – Edit files

Exact string replacement (old_str → new_str)
Shows diff preview before confirming
Approval required (unless --yolo)

clipboard – Read/write system clipboard

Read: no approval
Write: approval required (unless --yolo)

lists – Manage todos/lists

Actions: create_list, delete_list, get_lists, add_item, update_item, delete_item, get_items
Deletions need approval (unless --yolo)

http_request – Make HTTP calls

Verbs: GET, POST, PUT, PATCH, DELETE
GET: no approval
Write ops (POST/PUT/PATCH/DELETE): approval required (unless --yolo)

mail – Manage AgentMail inbox threads and messages

Actions: get_threads, get_thread, send_email, reply_to_message, forward_message, delete_thread
Requires AGENT_MAIL_API_KEY and INBOX_NAME environment variables
Send/reply/forward/delete: approval required (unless --yolo)

memory_view – List stored memories

No approval needed

memory_add – Store a new memory

No approval needed

memory_update – Update existing memory

No approval needed

memory_delete – Delete memory entry

No approval needed

text_to_speech_file – Generate voice notes (MP3 audio)

Converts plain text into an MP3file using ElevenLabs
Output can be sent over Telegram with send_document_over_telegram
Requires ELEVEN_LABS_API_KEY

send_document_over_telegram – Send files over Telegram

Sends any file (documents, MP3s, voice notes, etc.) directly to Telegram
Works seamlessly with voice note generation for AI-to-user voice delivery

send_image_over_telegram – Send images over Telegram

Sends image files directly to Telegram chat

Telegram Integration

Run ask as a Telegram bot. Chat with the AI directly in Telegram with slash commands for config.

Setup:

Create a bot with BotFather on Telegram (get your token)
Set env var: export TELEGRAM_BOT_TOKEN="your_token_here"
Start the bot: ask --background=true

Shared Context: The Telegram bot uses the same SQLite database and vector memory as the CLI, so your chat history and memories persist seamlessly across both interfaces. Switch between Telegram and terminal—context is always there.

Available commands:

/start – welcome message
/help or /about – show commands
/model <name> – switch AI model
/reasoning <level> – adjust reasoning (HIGH/MEDIUM/LOW/MINIMAL)

Voice & File Features:

Send voice notes: The agent can generate voice notes (MP3 audio) from text and send them back to you over Telegram using the text_to_speech_file and send_document_over_telegram tools.
Receive voice notes: You can send voice notes to the bot, and it will transcribe them into text and understand the content in responses.
Send images and documents: The agent can send images and document files directly to your Telegram chat with full support for multimodal content.
Reply context: When you reply to any message (text, image, voice note, or document), the full context of the replied-to message is properly passed to the agent, allowing it to understand and respond with proper context awareness.

Just send regular messages, voice notes, images, or documents—they'll be processed by the AI and responses saved locally in SQLite. Perfect for keeping an AI assistant in your pocket that responds in voice, images, and files too.

Shell Completions

Generate completions for your shell:

ask completion bash
ask completion zsh
ask completion fish

Data Persistence

Chat history & lists (SQLite): ~/.ask-go.db
Vector memory (chromem): ~/db/

Everything stays on your machine.

Important Notes

--yolo is dangerous. Auto-approves shell commands, file writes, and HTTP requests. Only use in controlled environments or when you fully trust the AI's behavior.
Chat data is local. Your conversations aren't sent anywhere except to the model provider (Gemini API).
No telemetry. This is just Go + SQLite + local vectors.

License

MIT (see LICENSE)

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent.go		agent.go
background.go		background.go
cache.go		cache.go
completion.go		completion.go
db.go		db.go
embeddings_generator.go		embeddings_generator.go
gemini.go		gemini.go
go.mod		go.mod
go.sum		go.sum
hash_utils.go		hash_utils.go
help.go		help.go
http_request.go		http_request.go
main.go		main.go
media_telegram.go		media_telegram.go
memory_async.go		memory_async.go
memory_crud.go		memory_crud.go
memory_manage.go		memory_manage.go
remember.go		remember.go
remote_client.go		remote_client.go
renderer.go		renderer.go
repl.go		repl.go
server.go		server.go
telegram.go		telegram.go
tool_clipboard.go		tool_clipboard.go
tool_lists.go		tool_lists.go
tool_mail.go		tool_mail.go
tool_read_file.go		tool_read_file.go
tool_shell.go		tool_shell.go
tool_tts.go		tool_tts.go
tool_write_file.go		tool_write_file.go
ui.go		ui.go
utils.go		utils.go
vector.go		vector.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ask

Features

Requirements

Environment Variables

Quick Start

Usage

Model Aliases

Reasoning Control

Common Flags

Remote Server

Chat Mode (REPL)

Memory (Vector Store)

How It Works

Agent Tools

Telegram Integration

Shell Completions

Data Persistence

Important Notes

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ask

Features

Requirements

Environment Variables

Quick Start

Usage

Model Aliases

Reasoning Control

Common Flags

Remote Server

Chat Mode (REPL)

Memory (Vector Store)

How It Works

Agent Tools

Telegram Integration

Shell Completions

Data Persistence

Important Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages