# llm_ollama A RocketRide LLM node that routes pipeline traffic through a locally-hosted Ollama server. ## What it does Provides text generation against an Ollama server running on your own hardware. The node acts as an `llm` invoke connection for agents and other nodes that need an LLM, and can also be driven directly via its `questions` / `answers` lane pair. Because all inference happens on-premise, no external API key is required, making it a natural fit for privacy-sensitive or air-gapped deployments. Internally, the node talks to Ollama through its **OpenAI-compatible `/v1` endpoint** using **langchain-openai** (`ChatOpenAI`), with temperature fixed at `0`. If the configured `serverbase` URL does not end in `/v1`, the node appends it automatically, so both `http://localhost:11434` and `http://localhost:11434/v1` are accepted. The OpenAI client requires a non-empty API key field; the node sends the placeholder string `dummy-key`, which Ollama ignores. --- ## Configuration ### Lanes | Lane in | Lane out | Description | | ----------- | --------- | ---------------------------------------------------- | | `questions` | `answers` | Send a question directly, receive a generated answer | ### Fields Pick a profile from the dropdown; the profile pre-fills `model`, `serverbase`, and `modelTotalTokens`. All three fields are individually overridable when using the `custom` profile. | Field | Type | Description | |---|---|---| | `model` | string | Ollama model | | `modelTotalTokens` | number | Total Tokens | | `profile` | string | Default "llama3_3". LLM model | --- ## Profiles **Llama** | Profile | Model | Context | | --------------------- | ----------------- | ---------- | | Llama 4 Latest | `llama4:latest` | 10,000,000 | | Llama 3.3 _(default)_ | `llama3.3:latest` | 128,000 | | Llama 3.1 405B | `llama3.1:405b` | 128,000 | | Llama 3.1 70B | `llama3.1:70b` | 128,000 | | Llama 3.1 8B | `llama3.1:8b` | 128,000 | | Llama 3.2 3B | `llama3.2` | 128,000 | | Llama 3.2 1B | `llama3.2:1b` | 128,000 | **Qwen** | Profile | Model | Context | | ------------- | -------------- | ------- | | Qwen 3 Latest | `qwen3:latest` | 128,000 | | Qwen 2.5 72B | `qwen2.5:72b` | 128,000 | | Qwen 2.5 32B | `qwen2.5:32b` | 128,000 | | Qwen 2.5 14B | `qwen2.5:14b` | 128,000 | | Qwen 2.5 7B | `qwen2.5` | 128,000 | | Qwen 2.5 3B | `qwen2.5:3b` | 128,000 | | Qwen 2.5 1.5B | `qwen2.5:1.5b` | 128,000 | | Qwen 2.5 0.5B | `qwen2.5:0.5b` | 128,000 | **DeepSeek** | Profile | Model | Context | | ---------------- | ------------------ | ------- | | DeepSeek R1 671B | `deepseek-r1:671b` | 128,000 | | DeepSeek R1 32B | `deepseek-r1:32b` | 128,000 | | DeepSeek R1 14B | `deepseek-r1:14b` | 128,000 | | DeepSeek R1 7B | `deepseek-r1:7b` | 128,000 | | DeepSeek R1 1.5B | `deepseek-r1:1.5b` | 128,000 | **Other** | Profile | Model | Context | | ---------- | --------- | ------- | | Phi 4 14B | `phi4` | 16,000 | | Mistral 7B | `mistral` | 32,000 | **Custom**: supply any Ollama model tag, context token count, and server URL directly. The default context for a new custom profile is 16,385 tokens until you change it. --- ## Upstream docs - [Ollama documentation](https://docs.ollama.com/) - [Ollama model library](https://ollama.com/library) --- ## Schema | Field | Type | Description | Default | |---|---|---|---| | `model` | `string` | **Model**
Ollama model | | | `modelTotalTokens` | `number` | **Tokens**
Total Tokens | | | `ollama.profile` | `string` | **Model**
LLM model | `"llama3_3"` | ## Dependencies - `langchain-openai` - `langchain-core` - `langchain` ## Source [ View source](https://github.com/rocketride-org/rocketride-server/tree/develop/nodes/src/nodes/llm_ollama)