OptPilot

OptPilot is a lightweight orchestration layer for iterative optimization studies. It connects a user-owned method to a user-owned environment, runs candidate solutions, records objective metrics, and keeps an auditable evidence trail.

OptPilot is not an optimizer, simulator, RL framework, or LLM agent framework. Those pieces remain yours. OptPilot standardizes the loop around them:

A method proposes one or more candidates.
OptPilot validates and materializes each candidate.
An environment evaluates the candidate and reports metrics.
OptPilot records trials, observations, saved output files, method calls, and run metadata.
The method can use the accumulated evidence to propose the next candidates.

flowchart LR
  Env["Environment\nwhat can be evaluated"]
  Method["Method\nhow candidates are produced"]
  Study["Study\nobjective + budget + runtime"]
  Runner["OptPilot runner\nvalidate + materialize + evaluate"]
  Evidence["Evidence\nobservations + artifacts"]

  Study --> Env
  Study --> Method
  Env --> Runner
  Method --> Runner
  Runner --> Evidence
  Evidence --> Method

The boundary between environment and method is the candidate contract. Start with docs/candidate-contracts.md after the quickstart if you are adding a new integration.

Current Surface

Users author three public YAML config files:

config: environment: candidate contract, evaluator, metrics, trial workspace, saved output-file rules, and optional records.
config: method: method entrypoint, protocol, settings, compatibility requirements, and optional method runtime.
config: study: the concrete run binding an environment config to a method config with objective, budget, execution, and evidence settings.

OptPilot validates those YAML files with packaged JSON Schemas, compiles them into an internal StudySpec, and writes the compiled spec into every run directory.

Included in the current release:

JSON Schema validation for public environment, method, and study configs
parameter, file, and opaque candidate contracts
Python and command environment evaluators
Python and command methods with batch protocol, plus Python session protocol
local thread, local subprocess, and Docker/Podman-compatible environment execution
Docker/Podman-compatible command-method runtime isolation
local JSONL evidence store with run summaries, trials, observations, candidate records, saved output files, method calls, and events
curated job-shop scheduling tutorial environment with shared validation cases, a shared objective, parameter/file candidate variants, JobShopLib-backed method wrappers, Stable-Baselines3 RL, and LLM file-candidate examples
strategic-airlift DEVS example using an external generated simulator
OptPilot Studio, a local UI for browsing reusable catalogs, opening workspaces, checking compatibility, launching studies, inspecting runs, and optionally using an OpenHands-backed assistant

Not included:

production Bayesian optimization, RL, LLM, or metaheuristic frameworks
remote execution backends
automatic dependency inference for study runtimes
multi-user UI authentication

Prerequisites

OptPilot currently supports Python 3.10 and newer.

Before running the examples below, install:

Python 3.10+
uv

Install

OptPilot uses uv.

uv sync
uv run optpilot --help

Quickstart

Start with the job-shop parameter baseline. It is the recommended first run, works from a fresh checkout, and does not require API keys or external solvers.

The job-shop examples are the main tutorial comparison set: environments declare what they can evaluate, methods declare how they produce candidates, and study files bind one environment, one method, objective, budget, and runtime.

Run the job-shop parameter baseline:

uv run optpilot run examples/studies/job_shop_rule_parameters_baseline.yaml

Validate a config without running it:

uv run optpilot validate examples/studies/job_shop_rule_parameters_baseline.yaml

Open the local UI:

uv run optpilot ui --open-browser

The UI scans examples/ and user_catalog/ by default. Stop the local server with Ctrl-C in the terminal when you are done.

For the assistant-enabled Studio workflow with OpenHands, embedded Code Server, and per-workspace containers, see UI.

Some advanced examples and integration templates, such as Strategic Airlift and upstream llm_heuristic_search repository wrappers, require extra setup. Use the job-shop example first, then continue with the example-specific docs.

Full Config Examples

The first tutorial shows the full environment, method, and study YAML files for a runnable job-shop baseline:

examples/environments/job_shop_scheduling/environment_rule_parameters.yaml
examples/methods/fixed_rule_parameters/method.yaml
examples/studies/job_shop_rule_parameters_baseline.yaml

Read Getting Started for the full configs and the explanation of how the three files fit together. Python evaluator references use module:function; Python method references use module:Class.

User-Owned Catalog

Put reusable environments, methods, and resources under user_catalog/. Study YAML files are concrete run plans; keep them where you draft or launch them rather than registering them as reusable catalog entries:

user_catalog/
  environments/my_environment/
    environment.yaml
    evaluator.py
    assets/
  methods/my_method/
    method.yaml
    method.py
    assets/
  resources/my_resource/
    README.md

Environment and method directories own reusable implementation code and reusable config variants. Resources are reusable reference folders. Study configs are project-specific bindings.

Container Runtime Example

Run environment trials in a container:

execution:
  backend: local
  runtime:
    sandbox: container
    network: disabled
    container:
      image: python:3.11-slim
      executable: docker

Run a command method in its own container:

entrypoint:
  command: [python, my_agent.py, "{input_file}", "{output_file}"]
  protocol: batch

runtime:
  sandbox: container
  network: disabled
  container:
    image: my-agent-image:latest
    executable: docker
    build:
      context: .
      dockerfile: Dockerfile.agent
      tag: my-agent-image:latest
  envFromHost: [OPENAI_API_KEY]

Documentation

Build the docs locally:

uv run --extra docs mkdocs serve

Development Checks

uv run python -m unittest discover -s tests -p 'test_*.py'
uv run python -m compileall src/optpilot
./scripts/smoke_test.sh

OptPilot is licensed under the Apache License 2.0. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OptPilot

Current Surface

Prerequisites

Install

Quickstart

Full Config Examples

User-Owned Catalog

Container Runtime Example

Documentation

Development Checks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.agents		.agents
.github/workflows		.github/workflows
docs		docs
examples		examples
resource		resource
scripts		scripts
src/optpilot		src/optpilot
tests		tests
user_catalog		user_catalog
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

OptPilot

Current Surface

Prerequisites

Install

Quickstart

Full Config Examples

User-Owned Catalog

Container Runtime Example

Documentation

Development Checks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages