fix: increase OpenAI client max_retries to handle 429 rate limits by KRRT7 · Pull Request #245 · microsoft/typeagent-py

KRRT7 · 2026-04-23T08:41:55Z

Summary

Bumps max_retries from the default (2) to 5 across all four AsyncOpenAI/AsyncAzureOpenAI construction sites (create_async_openai_client and _make_azure_provider)
The openai SDK's built-in exponential backoff handles transient 429s automatically — this just gives it more attempts
Three online tests (test_ingest_podcast, test_query_method_basic, test_transcript_knowledge_extraction_slow) were failing intermittently due to 429s

Test plan

make check (pyright) — 0 errors
pytest tests/test_model_adapters.py tests/test_utils.py — 29/30 pass (1 pre-existing failure fixed in test: add tests for resolve_azure_model_name #244)

The default max_retries=2 is too low for bursty workloads. Bump to 5 across all four AsyncOpenAI/AsyncAzureOpenAI construction sites so the built-in exponential backoff handles transient 429s.

## Summary - Adds 3 tests for `resolve_azure_model_name` covering: deployment name extraction from endpoint, fallback to provided model name, and default `AZURE_OPENAI_ENDPOINT` envvar usage - Fixes `test_create_embedding_model_uses_azure_deployment_name` — wasn't clearing `OPENAI_EMBEDDING_MODEL` from the environment, so the assertion failed when that envvar was set - Addresses review comment from #241 ## Test plan - [x] `make check` (pyright) — 0 errors - [x] `pytest tests/test_utils.py tests/test_model_adapters.py` — 33/33 pass - [x] `make test` — 496 pass, 3 fail (all transient 429 rate limits, addressed in #245)

fix: increase OpenAI client max_retries to handle 429 rate limits

3b20b7b

The default max_retries=2 is too low for bursty workloads. Bump to 5 across all four AsyncOpenAI/AsyncAzureOpenAI construction sites so the built-in exponential backoff handles transient 429s.

KRRT7 mentioned this pull request Apr 23, 2026

test: add tests for resolve_azure_model_name #244

Merged

3 tasks

gvanrossum approved these changes Apr 23, 2026

View reviewed changes

gvanrossum merged commit f707547 into microsoft:main Apr 23, 2026
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: increase OpenAI client max_retries to handle 429 rate limits#245

fix: increase OpenAI client max_retries to handle 429 rate limits#245
gvanrossum merged 1 commit intomicrosoft:mainfrom
KRRT7:fix/openai-retry-rate-limits

KRRT7 commented Apr 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KRRT7 commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

KRRT7 commented Apr 23, 2026 •

edited

Loading