rag: resilient embedder — rotate/split on endpoint errors; 4-GPU embed pool #8
Reference in New Issue
Block a user
Delete Branch "embed-pool-resilience"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Port of zerto-docs PR #45 (see that PR for the run #36 post-mortem). The embedder now rotates endpoints and retries on transport errors, halves the batch on HTTP errors (the .0.125 Windows 4090 Ollama crashes its runner on oversized input arrays), and logs error bodies. CI workflows extended to the full 4-endpoint GPU pool. Verified against the live pool including a forced 400-split on .0.125.
🤖 Generated with Claude Code