The Ollama plugin provides interfaces to any of the local LLMs supported by Ollama.
Prerequisites
This plugin requires that you first install and run the Ollama server. You can follow the instructions on the Download Ollama page.
Use the Ollama CLI to download the models you are interested in. For example:
ollama pull gemma2
For development, you can run Ollama on your development machine. Deployed apps usually run Ollama on a different, GPU-accelerated, machine from the app backend that runs Genkit.
Configuration
To use this plugin, call ollama.Init()
, specifying the address of your Ollama
server:
import "github.com/firebase/genkit/go/plugins/ollama"
// Init with Ollama's default local address.
if err := ollama.Init(ctx, &ollama.Config{
ServerAddress: "http://127.0.0.1:11434",
}); err != nil {
return err
}
Usage
To generate content, you first need to create a model definition based on the model you installed and want to use. For example, if you installed Gemma 2:
model := ollama.DefineModel(
g,
ollama.ModelDefinition{
Name: "gemma2",
Type: "chat", // "chat" or "generate"
},
&ai.ModelCapabilities{
Multiturn: true,
SystemRole: true,
Tools: false,
Media: false,
},
)
Then, you can use the model reference to send requests to your Ollama server:
text, err := genkit.GenerateText(ctx, g,
ai.WithModel(model),
ai.WithTextPrompt("Tell me a joke."))
if err != nil {
return err
}
See Generating content for more information.