feat: add Ollama provider support with streaming inference by independenter · Pull Request #162 · nat/openplayground

independenter · 2026-04-28T13:55:23Z

Description

This PR adds comprehensive support for the Ollama LLM provider, enabling users to run local models through Ollama with full streaming support.

Changes Made

✅ Implemented ollama_text_generation and ollama_chat_generation methods in [server/lib/inference/init.py](server/lib/inference/init.py)
✅ Added api_url field to ProviderDetails and Provider entities in [server/lib/entities.py](server/lib/entities.py) for custom API endpoints
✅ Registered Ollama routes in [server/app.py](server/app.py)
✅ Updated [server/models.json](server/models.json) with Ollama model configurations (e.g., gemma2, llama3)
✅ Added comprehensive documentation in Chinese: docs/添加大模型提供商指南.md

Features

🔄 Full streaming support for real-time token generation
🎯 Support for both text generation and chat modes
⚙️ Configurable API URL (default: http://localhost:11434)
📊 Comprehensive parameter support (temperature, top_p, top_k, etc.)
❌ Graceful cancellation support

How to Test

Install Ollama: https://ollama.com/
Pull a model: ollama pull llama3 or ollama pull gemma2
Ensure Ollama is running (default: http://localhost:11434)
Start the OpenPlayground server
Select an Ollama model from the dropdown
Start chatting and verify streaming responses work correctly

Configuration Example

Add to [server/models.json](server/models.json):

{
  "ollama": {
    "models": {
      "llama3": {
        "enabled": true,
        "status": "ready",
        "parameters": {
          "temperature": 0.7,
          "topP": 0.9,
          "topK": 40,
          "maximumLength": 512
        }
      }
    },
    "requiresAPIKey": false,
    "remoteInference": true,
    "apiURL": "http://localhost:11434"
  }
}

- Add ollama dependency handling in lib/inference/__init__.py - Implement __ollama_text_generation__ and __ollama_chat_generation__ methods - Add api_url field to ProviderDetails and Provider entities - Register Ollama routes in server/app.py - Update models.json with Ollama model configurations (gemma4:e4b) - Add comprehensive documentation in docs/添加大模型提供商指南.md - Support both text generation and chat generation with streaming responses

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Ollama provider support with streaming inference#162

feat: add Ollama provider support with streaming inference#162
independenter wants to merge 1 commit into
nat:mainfrom
independenter:feature/add-ollama-provider

independenter commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

independenter commented Apr 28, 2026

Description

Changes Made

Features

How to Test

Configuration Example

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant