Ollama
Ollama is the easiest way to run open models locally. It serves an OpenAI-compatible API at http://localhost:11434/v1, so it drops into routeplane.yaml as one provider block.
Prerequisites
Section titled “Prerequisites”-
Routeplane installed, with a
routeplane.yaml(scaffold one withrouteplane init). -
Ollama running, with a model pulled:
Terminal window ollama serve & # default port 11434ollama pull llama3.1
Add Ollama to Routeplane
Section titled “Add Ollama to Routeplane”providers: ollama: api_base: http://localhost:11434/v1 api_protocol: - "*": chat_completions models: - id: llama3.1Each models entry is a name you’ve pulled with ollama pull. List what’s available with ollama list.
Route to it
Section titled “Route to it”routeplane route ollama:llama3.1Then start Routeplane and send a request. Use the provider-qualified id ollama:llama3.1 to pin the request, or the bare llama3.1 to let Routeplane cascade.
Learn more
Section titled “Learn more”- Ollama — OpenAI compatibility
- Model fallback — fail over from local Ollama to a hosted model.