Patter SDK

Quickstart • Features • Templates • Configuration • Docs

Patter is the open-source SDK that gives your AI agent a phone number. Point it at any function that returns a string, and Patter handles the rest: telephony, speech-to-text, text-to-speech, and real-time audio streaming. You build the agent — we connect it to the phone.

Quickstart

Set the env vars your carrier and engine need:

export TWILIO_ACCOUNT_SID=ACxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
export TWILIO_AUTH_TOKEN=your_auth_token
export OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxx

Python

pip install getpatter

from getpatter import Patter, Twilio, OpenAIRealtime

phone = Patter(carrier=Twilio(), phone_number="+15550001234")
agent = phone.agent(engine=OpenAIRealtime(), system_prompt="You are a friendly receptionist for Acme Corp.", first_message="Hello! How can I help?")
await phone.serve(agent, tunnel=True)

TypeScript

npm install getpatter

import { Patter, Twilio, OpenAIRealtime } from "getpatter";

const phone = new Patter({ carrier: new Twilio(), phoneNumber: "+15550001234" });
const agent = phone.agent({ engine: new OpenAIRealtime(), systemPrompt: "You are a friendly receptionist for Acme Corp.", firstMessage: "Hello! How can I help?" });
await phone.serve({ agent, tunnel: true });

tunnel: true spawns a Cloudflare tunnel and points your Twilio number at it. In production, pass webhook_url / webhookUrl to the constructor instead. Every carrier and provider reads its credentials from environment variables by default; see each SDK's README for the full catalog.

How Patter compares

Patter is purpose-built for production voice over real telephony. Out of the box you get Twilio + Telnyx parity (DTMF, transfer, AMD, voicemail drop, recording), both architectures from one API — speech-to-speech (Realtime / ConvAI engines) and the sandwich pipeline (STT → LLM → TTS) — and production-grade barge-in / VAD / IVR primitives that work the same on every carrier. Observability is vendor-neutral OpenTelemetry tracing, plus a built-in dashboard and tunnel; no extra collector required. The 4-line quickstart above replaces ~50 lines of glue you'd otherwise write against a generic voice-agent toolkit, and the Python and TypeScript SDKs are identical — same surface, same hooks, same events — so cross-runtime teams ship the same agent twice without rewriting it.

Features

Feature	Method	Template
Inbound calls	`phone.serve(agent)`	patter-inbound-agent
Outbound calls + AMD	`phone.call(to, machineDetection)`	patter-outbound-calls
Tool calling (webhooks)	`agent(tools=[...])`	patter-tool-calling
Custom STT + TTS	`agent(provider="pipeline")`	patter-custom-voice
Dynamic variables	`agent(variables={...})`	patter-dynamic-variables
Custom LLM (any model)	`serve(onMessage=handler)`	patter-custom-llm
Dashboard + metrics	`serve(dashboard=True)`	patter-dashboard
Output guardrails	`agent(guardrails=[...])`	docs
Call recording	`serve(recording=True)`	docs
Call transfer	`transfer_call` (auto-injected)	docs
Voicemail drop	`call(voicemailMessage="...")`	patter-outbound-calls
Test mode (no phone)	`phone.test(agent)`	docs
Built-in tunnel	Cloudflare (auto)	docs
Phone-as-a-tool (LangChain / OpenAI Assistants / Hermes)	`PatterTool(phone, agent).execute(...)`	docs

How It Works

From code to phone call — Your AI agent connects to real phone calls through the Patter SDK.

AI Agents		Patter SDK		Phone Calls
ChatGPT _OpenAI Claude _Anthropic Your AI Agent _{on_message}	→	Patter SDK Connect any AI to any phone `STT` · `TTS` · `WebSocket`	→	Twilio _Telephony Telnyx _Telephony

Templates

Each template is a self-contained repo — clone, add your .env, and run. Both Python and TypeScript included.

Template	Description	Repo
Inbound Agent	Answer calls as a restaurant booking assistant	patter-inbound-agent
Outbound Calls	Place calls with AMD and voicemail drop	patter-outbound-calls
Tool Calling	CRM lookup + ticket creation via webhook tools	patter-tool-calling
Custom Voice	Pipeline mode: Deepgram STT + ElevenLabs TTS	patter-custom-voice
Dynamic Variables	Personalize prompts per caller using CRM data	patter-dynamic-variables
Custom LLM	Bring your own model (Claude, Mistral, LLaMA)	patter-custom-llm
Dashboard	Real-time monitoring with cost + latency tracking	patter-dashboard
Production Setup	Everything enabled: tools, guardrails, recording, dashboard	patter-production

# Example: clone and run the inbound agent template
git clone https://github.com/PatterAI/patter-inbound-agent
cd patter-inbound-agent
cp .env.example .env    # fill in your keys
cd python && pip install -r requirements.txt && python main.py

Configuration

Environment Variables

Variable	Required	Description
`OPENAI_API_KEY`	Yes (Realtime mode)	OpenAI API key with Realtime access
`TWILIO_ACCOUNT_SID`	Yes	Twilio account SID
`TWILIO_AUTH_TOKEN`	Yes	Twilio auth token
`TWILIO_PHONE_NUMBER`	Yes	Your Twilio phone number (E.164)
`DEEPGRAM_API_KEY`	Pipeline mode	Deepgram STT key
`ELEVENLABS_API_KEY`	Pipeline mode	ElevenLabs TTS key
`ANTHROPIC_API_KEY`	Custom LLM	For bringing your own model
`WEBHOOK_URL`	No	Public URL (auto-tunneled via Cloudflare if omitted)

cp .env.example .env
# Edit .env with your API keys

Telnyx: Telnyx is a fully supported telephony provider alternative to Twilio. Both carriers receive equal support for DTMF, transfer, and metrics. Recording is Twilio-only.

Docker

docker compose up

See Dockerfile and docker-compose.yml for the default configuration.

Voice Modes

Mode	Quality	Best For
OpenAI Realtime (`engine=OpenAIRealtime()`)	High	Fluid, low-latency conversations
ElevenLabs ConvAI (`engine=ElevenLabsConvAI()`)	High	ElevenLabs-managed conversation flow
Pipeline (`stt=DeepgramSTT(), tts=ElevenLabsTTS()`)	High	Independent control over STT / LLM / TTS

Pipeline mode composes STT + LLM + TTS sequentially and inherits the latency of each provider plus the endpointing window. For the fastest turn UX pick engine=OpenAIRealtime(), or pair the pipeline with low-latency providers such as Cerebras/Groq for the LLM and ElevenLabs Turbo v2.5 for TTS.

Provider Notes

ElevenLabs free tier — the library voice catalog is not reachable via API (402 Payment Required). Set ELEVENLABS_VOICE_ID to a voice you own (cloned or generated) before phone.serve(). The SDK resolves ~45 well-known names ("rachel", "adam", …) to their UUIDs automatically; custom voices must be referenced by ID.
Telnyx outbound — calls will return 403 D38 until your connection has an "Outbound Profile" attached in the Telnyx portal. Inbound and Call Control answer flows work without it.
Google Gemini free tier — gemini-2.0-flash has a hard quota=0 on the free tier. Enable billing on the project before using Gemini as the LLM.
Whisper STT — on mulaw 8 kHz inputs Whisper routinely hallucinates short fillers ("you", ".", "thank you"). The pipeline drops these by default plus any duplicate / sub-500 ms back-to-back final, so you won't hear overlapping turns. For production prefer OpenAITranscribeSTT (gpt-4o-transcribe) — same OPENAI_API_KEY, ~10× faster, no hallucination floor.
Model IDs — keep these updated per vendor release notes. Examples currently in use: gpt-4o-mini-realtime-preview, claude-haiku-4-5, llama-3.3-70b-versatile (Groq), gpt-oss-120b (Cerebras default — pass model="llama3.1-8b" for the smaller free-tier alternative).

API Reference

`Patter` (Python & TypeScript)

Method	Description
`Patter(carrier=Twilio(), phone_number, ...)`	Create client bound to a carrier and phone number
`agent(engine=..., system_prompt, first_message?, tools?, ...)`	Create an agent configuration
`serve(agent, port?, tunnel?, dashboard?, ...)`	Start the embedded server and listen for calls
`call(to, agent?, machine_detection?, voicemail_message?, ring_timeout?, ...)`	Place an outbound call

call() accepts a ring_timeout (seconds) that maps to Twilio's Timeout dial parameter and Telnyx's timeout_secs. When the carrier reports no-answer, busy, or canceled, the outcome is forwarded to the dashboard's /webhooks/twilio/status callback so it appears in the call log even if no media frames were ever exchanged.

serve() options:

Option	Type	Description
`agent`	`Agent`	Agent configuration to use for calls
`port`	`int`	Port to listen on (default: 8000)
`dashboard`	`bool`	Enable the built-in monitoring dashboard
`recording`	`bool`	Enable call recording via the telephony provider
`onCallStart`	`callable`	Called when a call connects
`onCallEnd`	`callable`	Called when a call ends with transcript and metrics
`onTranscript`	`callable`	Called on each transcript turn

agent() options:

Option	Type	Description
`system_prompt`	`str`	Prompt with optional `{variable}` placeholders
`variables`	`dict`	Values substituted into prompts
`voice`	`str`	TTS voice name
`first_message`	`str`	Opening message (supports `{variable}` placeholders)
`tools`	`list`	Tool definitions with `name`, `description`, `parameters`, `webhook_url`
`guardrails`	`list`	Output guardrail rules

Contributing

Pull requests are welcome.

# Python SDK
cd libraries/python && pip install -e ".[dev]" && pytest tests/ -v

# TypeScript SDK
cd libraries/typescript && npm install && npm test

Please open an issue before submitting large changes so we can discuss the approach first.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.agents/skills/line-voice-agent		.agents/skills/line-voice-agent
.crush/skills		.crush/skills
.githooks		.githooks
.github		.github
.goose/skills		.goose/skills
dashboard-app		dashboard-app
docs		docs
examples/notebooks		examples/notebooks
libraries		libraries
scripts		scripts
tests/parity		tests/parity
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
skills-lock.json		skills-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Patter SDK

Quickstart

How Patter compares

Features

How It Works

Templates

Configuration

Environment Variables

Docker

Voice Modes

Provider Notes

API Reference

`Patter` (Python & TypeScript)

Contributing

License

About

Uh oh!

Releases 10

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Patter SDK

Quickstart

How Patter compares

Features

How It Works

Templates

Configuration

Environment Variables

Docker

Voice Modes

Provider Notes

API Reference

Patter (Python & TypeScript)

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`Patter` (Python & TypeScript)

Packages