Files
seed-mcp/corpus/gh_plot_reports/ghpr-silage-ut-2024-2452510.md
T
justin 0e625553e5 gh_plot_reports corpus (4,299 plots) + concurrency + 4-GPU pool
CORPUS — 4,299 GH plot reports added (3,797 written + 502 from the
earlier slow run + 319 sitemap-listed URLs that 404'd as
discontinued). Combined with prior 760 varieties + 14 AgriPro
trials = 5,073 total chunks now indexed.

scrape/sources/gh_plot_reports.py — concurrency speedup:
- 4 worker threads (ThreadPoolExecutor), each with its own
  requests.Session for connection-pool efficiency.
- Shared class-level rate limiter (0.25 sec between ANY two
  requests across all threads). Net throughput ~4 req/sec —
  well below any rate-limit threshold a public site enforces.
- Diagnosis vs original 1 req/sec: GH had ZERO rate limiting,
  zero 429s, zero retries. The 1 sec self-throttle was just too
  conservative. Bench:
    1 worker  / 1.0 sec throttle:  ~0.4 plots/sec (190 min ETA)
    4 workers / 0.25 sec throttle: ~3 plots/sec  (~25 min actual)

rag/chunk.py — chunk size cap for nomic-embed-text's 2048-token
context window:
- Empirically tested: failure threshold is ~5,250 chars on
  numeric-heavy trial chunks (chars/token ratio 2.4 vs 3.5 for
  prose). Cap at 4,500 chars to be safely under at worst-case
  2.2 chars/token.
- Applied to BOTH variety and trial chunks. Marked truncated
  chunks with metadata.embed_truncated = True; FULL text stays
  in the on-disk .md for get_page to return verbatim.

.gitea/workflows/{refresh,image-only}.yml — OLLAMA_URL pool
restructured for the 4 GPU-pinned endpoints. Bench (50-chunk
batches on nomic-embed-text):

    .0.125:11434  (RTX 40-series)  242 embeds/sec  ← weight ×4
    .0.2:11436    (GPU-pinned)     108 embeds/sec  ← weight ×2
    .0.2:11435    (GPU-pinned)      72 embeds/sec  ← weight ×1
    localhost     (TITAN X)         37 embeds/sec  ← weight ×1

Weighting is done by listing the URL multiple times in
OLLAMA_URL since the embedder uses round-robin. .0.2:11434 is
explicitly EXCLUDED — it isn't pinned to a specific GPU.

Combined index rebuild for 5,073 chunks now finishes in ~3 min
(was 19+ on the single-endpoint pool).

Smoke tests:
✓ list_versions: 5,073 docs across 6 sources, 2 vendors, 6
  brands, 4 crops (corn 2711, soy 2016, silage 223, wheat 123).
✓ search_trials({crop=corn, state=IA, year=2024}): 3 IA 2024
  corn trials surfaced.
✓ search_trials("Phytophthora resistance soybean trial"): NK
  NK43-W1XFS top-1 in LA 2024 trial (cross-vendor result).
✓ search_trials("AP Iliad Idaho wheat"): AgriPro Washington/N
  Idaho 2025 trial surfaced.
✓ search_trials(product=DKC65-95): 3 corn trials containing
  that hybrid in IL/IA 2024.
✓ search_trials(product=NK1701): 3 corn trials in AR/MS 2024.
✓ Product filter correctly returns EMPTY for products that
  aren't in the corpus (DKC65-20 is a 2023 product; 2023 plots
  deferred). Anti-hallucination contract preserved.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-25 16:46:35 -04:00

2.5 KiB

Silage yield trial — Utah, 2024


Results (top-down by rank)

Rank Brand Product Traits Ton/Acre Milk Per Acre Milk Per Ton Beef Per Acre Beef Per Ton
1 Axis Seed Experimental 32.0 3576.0 32078.0 227.0 2175.0
2 Pioneer P9489AM - 31.8 3647.0 34361.0 233.0 2226.0
3 Pioneer P9193AM - 30.8 3671.0 33257.0 237.0 2185.0
4 PGS Hybrids 4291 30.7 3733.0 35353.0 243.0 2243.0
5 Pioneer P90630AM - 30.6 3678.0 33099.0 233.0 2144.0
6 Pioneer P9492AM - 30.0 3477.0 29370.0 227.0 2038.0
7 PGS Hybrids 4292 29.6 3723.0 32469.0 240.0 2133.0
8 NK NK9044-AA - 29.4 3535.0 31103.0 233.0 2060.0
9 Axis Seed Experimental 28.7 3700.0 32738.0 243.0 2091.0
10 Western Hybrids 2290 28.1 3287.0 26849.0 223.0 1882.0
11 DEKALB DKC39-54 - 27.9 3168.0 28945.0 230.0 1925.0
12 Axis Seed Experimental 27.7 3675.0 32130.0 247.0 2050.0
13 Western Hybrids 3S284 27.6 3294.0 27662.0 223.0 1848.0
14 Axis Seed 36Q52 27.3 3566.0 28396.0 230.0 1881.0
15 Western Hybrids 5DR3387 26.9 3393.0 26977.0 227.0 1829.0
16 Legacy Seeds LC414-21 VT2P 26.7 3361.0 25629.0 223.0 1792.0
17 NK NK8558-AA - 26.6 3365.0 26885.0 223.0 1785.0
18 DEKALB DKC31-85 - 26.1 3286.0 25559.0 220.0 1722.0
19 Western Hybrids 5P1484 26.0 3306.0 25156.0 217.0 1692.0
20 NK NK8005-V - 25.9 3301.0 26106.0 220.0 1707.0
20 Pioneer P8294AM - 25.9 3191.0 24949.0 217.0 1681.0
22 DEKALB DKC32-35 - 24.1 3244.0 23751.0 220.0 1593.0
22 Pioneer P87040AM - 24.1 3428.0 24278.0 223.0 1612.0
24 DEKALB DKC35-34 - 23.2 3465.0 24267.0 227.0 1578.0

Top 5 by Ton/Acre: Seed (Axis) 32.0, P9489AM (Pioneer) 31.8, P9193AM (Pioneer) 30.8, Hybrids (PGS) 30.7, P90630AM (Pioneer) 30.6.