Phase 4-5: deployable container + corpus snapshot (614 varieties) #5
Reference in New Issue
Block a user
Delete Branch "phase-4-5-deploy"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
deploy/docker-compose.yml— fill in concrete values for Drawbar's stack: imagegit.jpaul.io/justin/seed-mcp:latest, host port 8001 (so we don't collide with crop-chem-docs on 8000),PRODUCT_NAME=crop_seed, hybrid search enabled, stateless HTTP. The sharedllama-rerankis NOT redefined here — Drawbar's parent stack already has it from the crop-chem-docs deploy..gitignore—corpus/is now COMMITTED, not ignored. The monthly refresh workflow scrapes and pushes corpus diffs; the image-only workflow rebuilds indexes from the committed corpus. This lets:corpus-YYYY.MM.DDimage tags pin to a specific seed-catalog snapshot.chroma/andbm25/remain ignored.bayer_seeds: 475 (DEKALB 288 / Asgrow 102 / WestBred 85)golden_harvest: 139 (36 sitemap URLs were 302-discontinued)rag/chunk.py— normalizebrandto uppercase in Chroma metadata so brand-filter lookups don't break on vendor casing inconsistency (Bayer "DEKALB" vs Syngenta "Golden Harvest";_build_where.upper()matched the former but not the latter). Sidecar JSON keeps original casing for display.nk,agripro,becks_pfr,becks_products) — return code 0 instead of 2 so the monthly refresh CI workflow doesn't fail on deferred sources.What this PR makes possible
After merge:
gh workflow run image-only.ymlrebuilds the image from the committed corpus and pushes to192.168.0.2:1234/justin/seed-mcp:latest.git.jpaul.io/justin/seed-mcp:latest, mount usage logs, and the seed-mcp tools are live to the farm-advisor agent.Test plan
list_versionsshows both vendors with correct facet counts (614 = 475 + 139)brand='Golden Harvest'filter returns 3 GH-only varieties (pre-fix was zero results due to case mismatch)E085Z5→ Golden Harvest top hitCoverage now
crop_seed_api_lessons(topic='pioneer')nk,agripro,becks_pfr,becks_products(the lessons tool'ssources-not-yet-indexedsection tells the agent which vendors aren't there yet)