AutOffload

AutOffload is a hybrid local-cloud agent task delegator and hardware diagnostics runner built on Clean Architecture principles. It allows cloud-based developer agents (like Antigravity or Claude Code) to offload multi-turn, iterative coding tasks (such as syntax fixes, test cycles, and unit test generation) to a local Ollama model. This drastically reduces cloud token consumption and speeds up minor refactoring loops.

1. System Requirements

Hardware Guidelines

To run local coding models with comfortable generation speeds (~30–60 tokens/sec), the model must fit entirely within your GPU VRAM:

12GB VRAM (Sweet Spot - e.g., RTX 3060, RTX 4070): Recommended model: qwen2.5-coder:7b (extremely fast, low footprint) or qwen2.5-coder:14b (higher reasoning, tight fit).
8GB VRAM (Minimum GPU): Recommended model: qwen2.5-coder:7b or deepseek-r1:8b.
No GPU (CPU-only fallback): The tool will fall back to CPU execution. Note that running models on system RAM is highly sluggish (~2 tokens/sec) and not recommended for agentic iterative loops.

Software

Node.js: v22.15.0 or higher.
Ollama: Client and service installed and listening (typically on http://localhost:11434).

2. Step-by-Step Setup

Execute these steps in sequence to install the CLI tool and register the global agent skill.

Step 1: Clone the Repository

git clone <repository-url>
cd AutOffload

Step 2: Install Dependencies

npm install

Step 3: Compile the Project

npm run build

Step 4: Link the CLI Globally

Link the package globally on your OS so the autoffload command is available in any terminal session:

npm link

Verify installation by running:

autoffload specs

Step 5: Install the Global Antigravity Skill

Register the custom skill so that your Antigravity agent knows how and when to call this tool:

npm run install-skill

This copies the global skill definition directly to your local .gemini settings: C:\Users\<Username>\.gemini\config\skills\autoffload\SKILL.md

3. Ollama Preparation

Make sure the local Ollama instance is running and has the optimized coding model loaded.

Start Ollama: Ensure the Ollama app or system service is active.
Pull the Recommended Model:
```
ollama pull qwen2.5-coder:7b
```

4. Configuration (`autoffload.config.json`)

You can create an autoffload.config.json file in the root of your target project workspace to override settings:

{
  "ollamaUrl": "http://localhost:11434",
  "defaultModel": "qwen2.5-coder:7b",
  "maxRetries": 3
}

ollamaUrl: The HTTP API URL where your Ollama service is listening.
defaultModel: The model to fall back on if no model override -m parameter is specified in the CLI.
maxRetries: The number of self-correction code-compilation loops the agent executes before declaring failure.

5. CLI Usage Examples

A. Run Hardware Diagnostics

autoffload specs

Examines your CPU, total RAM, and GPU VRAM to output a compatibility report and suggest the best model for your hardware.

B. Run Coding Task with Compilation Verification Loop

autoffload run \
  -t "Fix spelling error 'rturn' to 'return' in the add function" \
  -f "test_workspace/calculator.ts" \
  -c "npx tsc --noEmit test_workspace/calculator.ts"

-t, --task: Detailed instructions of the coding task to execute.
-f, --files: Comma-separated list of target files (local model reads them and writes changes back).
-c, --test: Optional. The validation test command. If it returns a non-zero exit code, the compiler error logs are fed back to the model to correct the code in a loop.
-m, --model: Optional. Override the targeted model.
-r, --retries: Optional. Override the max self-correction attempts.

6. Project Architecture (Clean Architecture)

AutOffload is structured to isolate core business rules from infrastructure implementations:

src/
├── domain/                  # Core Models & Contracts (Zero dependencies)
│   ├── entities/            # SystemSpecs definitions and recommended rules
│   └── ports/               # Interfaces for FileSystem, ProcessExecutor, LLMProvider
│
├── application/             # Use Cases
│   └── use-cases/           # GetSpecsUseCase, RunTaskUseCase (Self-correction logic)
│
└── infrastructure/          # Adapters (Concrete implementations)
    ├── cli/                 # CLI entry flag parsing and stdout streaming
    ├── config/              # JSON config loader
    ├── spec-providers/      # Windows PowerShell specs extraction
    ├── llm-providers/       # Ollama REST client (HTTP JSON-lines parser)
    ├── executors/           # Subprocess runner (child_process)
    └── file-system/         # Node fs/promises file reader/writer

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
dist		dist
node_modules		node_modules
scripts		scripts
skills/autoffload		skills/autoffload
src		src
test_workspace		test_workspace
README.md		README.md
autoffload.config.json		autoffload.config.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutOffload

1. System Requirements

Hardware Guidelines

Software

2. Step-by-Step Setup

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Compile the Project

Step 4: Link the CLI Globally

Step 5: Install the Global Antigravity Skill

3. Ollama Preparation

4. Configuration (`autoffload.config.json`)

5. CLI Usage Examples

A. Run Hardware Diagnostics

B. Run Coding Task with Compilation Verification Loop

6. Project Architecture (Clean Architecture)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AutOffload

1. System Requirements

Hardware Guidelines

Software

2. Step-by-Step Setup

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Compile the Project

Step 4: Link the CLI Globally

Step 5: Install the Global Antigravity Skill

3. Ollama Preparation

4. Configuration (autoffload.config.json)

5. CLI Usage Examples

A. Run Hardware Diagnostics

B. Run Coding Task with Compilation Verification Loop

6. Project Architecture (Clean Architecture)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

4. Configuration (`autoffload.config.json`)

Packages