AI Models Configuration¶

This guide explains how to configure and use AI models with the Ollama and Open WebUI integration in Obelisk.

Model Management¶

Models can be pulled through the Open WebUI interface or directly using Ollama:

# Using Ollama CLI
docker exec -it ollama ollama pull mistral

# List available models
docker exec -it ollama ollama list

Models are stored in persistent Docker volumes:

This ensures your models persist between container restarts.

Here are some recommended models to use with the Obelisk chatbot integration:

Model	Size	Description	Command
Llama 2	7B	Meta's general purpose model	`ollama pull llama2`
Mistral	7B	High-performance open model	`ollama pull mistral`
Phi-2	2.7B	Microsoft's compact model	`ollama pull phi`
Gemma	7B	Google's lightweight model	`ollama pull gemma:7b`
CodeLlama	7B	Code-specialized model	`ollama pull codellama`

For documentation-specific tasks, consider models that excel at knowledge retrieval and explanation.

You can create custom model configurations using Modelfiles:

FROM mistral
SYSTEM You are a helpful documentation assistant for the Obelisk project.

docker exec -it ollama ollama create obelisk-assistant -f Modelfile

Model performance depends on available hardware:

For optimal performance, use GPU acceleration with the NVIDIA Container Toolkit.

Ollama supports various quantization levels to balance performance and resource usage:

Quantization	Quality	Memory Usage	Example
F16	Highest	Highest	`ollama pull mistral:latest`
Q8_0	High	Medium	`ollama pull mistral:8b-q8_0`
Q4_K_M	Medium	Low	`ollama pull mistral:8b-q4_k_m`
Q4_0	Lowest	Lowest	`ollama pull mistral:8b-q4_0`

Choose quantization based on your hardware capabilities and quality requirements.

Common issues and solutions:

For more troubleshooting, consult the Ollama documentation.