Ollama integration

To configure the use of Ollama models you don't usually need a valid API KEY but you will have to specify:

Embeddings

The basic configuration is as follows:

EMBEDDINGS='{"_type": "langchain_ollama.embeddings.OllamaEmbeddings" , "model": "nomic-embed-text"}'

Key variables you can add to this configuration include:

model: the name of Ollama model to use (string)
base_url: base url the model is hosted under, defaults to http://127.0.0.1:11434

For additional configuration options, refer to the LangChain API reference.

Conversational LLMs are configured using the variables QA_COMPLETION_LLM and QA_FOLLOWUP_LLM or SUMMARIZE_LLM for summarization and analysis that use the same JSON format.

An example JSON configuration using Meta's Llama 3.2 might look like this:

{"model": "llama3.2", "model_provider": "ollama", "base_url":"http://localhost:11434", "temperature": 0}

The primary variables you can add to this configuration include:

model: name of the Ollama model to use (string)
base_url: base url the model is hosted under, defaults to http://127.0.0.1:11434
temperature: sampling temperature, ranges from 0.0 to 1.0. (float)
num_predict: max number of tokens to generate (int)

See here for more details on the configuration format.

For further configuration options, refer to the LangChain API reference.