6b11993688
Match the OLLAMA_URLS pattern from zerto-docs-rag so every docs MCP build fans out across the same two GPU-pinned Ollama containers on 192.168.0.2 (:11435 Titan X, :11436 1080 Ti). The host's primary Ollama on :11434 is left alone for OpenWebUI. rag.embeddings now reads OLLAMA_URLS (plural CSV) preferentially with fallback to OLLAMA_URL, defaulting to http://192.168.0.2:11434 — same shape as zerto's embeddings.py. The OllamaEmbeddings class already round-robins per batch, so both GPUs run in parallel during the chroma rebuild. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>