Files
seed-mcp/corpus/agripro_trials/agt-western-plains-dryland-2025-0.md
T
justin c737871c4c Trial-data scrapers: gh_plot_reports + agripro_trials + search_trials tool
This PR introduces TRIAL data — yield-performance results from real
field trials — as a SEPARATE data type alongside variety identity.
The two are complementary:

  search_docs  → "What's the disease resistance of DKC62-08RIB?"
                  (variety identity — what it IS)
  search_trials → "Which corn hybrid won the IA 2024 trials?"
                  (performance data — how it PERFORMED)

scrape/sources/gh_plot_reports.py — Golden Harvest plot reports
- 4,618 expected (2024+2025; 2023 deferred to a backfill pass).
- URL: /<crop>/plot-report/<state>/<year>/<plot_id>
- Cross-vendor: each plot lists products from multiple brands
  (NK / DEKALB / Golden Harvest / Enogen / Pioneer / Channel) side
  by side at one cooperator's field — the kind of independent
  comparison data Bayer doesn't publish itself.
- Generic per-column metrics dict (Yield/MST/Test Weight/$/Ac for
  corn+soy, Ton/Acre + Milk + Beef columns for silage).
- Politeness: 1 req/sec, retries on 429/5xx, no redirect-follow.

scrape/sources/agripro_trials.py — AgriPro regional trial PDFs
- 14 unique PDFs (38 sitemap links deduped) at /trials-data
- pdfplumber text extraction, region/year detection from filename
- Verbatim PDF text preserved in chunk body so variety + yield
  number adjacency drives retrieval (AP Iliad's Aberdeen ID yield
  matches a query about "AP Iliad Idaho yield")

rag/chunk.py — chunks_from_trial() dispatching by source
- Plot reports: identity preamble + Top-5 by primary metric + full
  ranking table. Metric labels chosen from the data (corn/soy use
  "Yield", silage uses "Ton/Acre").
- AgriPro PDFs: identity preamble + verbatim trial body inline so
  per-location yields surface for region+variety queries.
- Variety chunks get data_type="variety" metadata; trial chunks get
  data_type="trial". Single Chroma collection; the tool router
  filters by data_type rather than maintaining two collections.

rag/index.py — dispatch by sidecar's data_type field
rag/bm25.py — new filter columns (data_type, year, state)

docs_mcp/server.py — sixth MCP tool: search_trials(crop?, state?,
year?, product?, k=10)
- Filters trial chunks via where={"data_type": "trial", ...}
- Optional product substring post-filter for "DKC62-08RIB Iowa 2024"
  style searches
- search_docs now defaults to data_type="variety" so trial chunks
  don't bleed into variety identity queries
- Tool docstring routes the agent: "use lookup_variety to verify
  identity details on any trial winner you surface"

NK trial endpoint (/NKSeeds/wsProxy.asmx/GetPlotResult) is documented
as deferred — the ASMX-SOAP shape returned empty XML on initial
probe. Bayer per-variety yield data is not publicly indexed at all
— documented in the trial-scope note (DEKALB/Asgrow trial data flows
through Channel reps, not the web). AgRevival research books exist
as 10 large annual PDFs but are deferred (low ROI per parse).

Initial corpus shipped in this PR: 14 AgriPro trial PDFs. The 4,618
Golden Harvest plot reports are scraping in background and will be
added in a follow-up corpus-snapshot PR (~70 min ETA).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-25 15:19:03 -04:00

2.9 KiB

2025 Western Plains Dryland Summary, Three-Year Data


Trial data (verbatim from PDF)

2025 Western Plains Dryland Summary, Three-Year Data
Syngenta Commercial Variety Wheat Performance Test, 2023-2025
3-Yr Combined 2-Yr Combined Combined Crook, Julesburg, Yuma,
Variety (2023-2025) (2024-2025) (2025) CO CO* CO
Hard Winter Wheat Yield TWT Yield TWT Yield TWT Yield Yield Yield
Bu/A Lb/Bu Bu/A Lb/Bu Bu/A Lb/Bu Bu/A Bu/A Bu/A
AP18 AX 74.2 57.6 75.8 57.8 85.0 55.2 59.5 101.2 94.3
AP Sunbird 74.1 59.5 77.2 59.8 86.5 57.9 61.6 102.9 95.1
AP24 AX 73.2 56.7 74.0 56.8 86.7 54.9 63.7 97.9 98.5
AP Bigfoot 71.2 58.9 73.0 58.9 79.0 56.6 60.4 86.6 90.0
SY Wolverine 68.9 59.1 71.4 59.5 80.5 57.2 61.9 91.0 88.5
AP Roadrunner 67.6 56.0 70.3 55.7 80.1 54.6 63.9 90.5 86.0
AP Solid 66.4 58.4 67.7 58.4 76.9 56.4 62.4 86.0 82.4
SY Monument 61.7 57.8 60.1 57.9 70.3 56.7 62.4 67.7 80.7
Langin 72.9 57.3 77.3 57.6 86.7 55.8 62.6 95.5 102.0
WB4422 70.6 59.1 73.8 59.4 83.2 57.7 65.8 90.7 92.9
AG Golden 70.3 55.5 72.3 55.5 86.1 54.9 71.2 93.6 93.4
High Country 69.4 59.1 72.0 58.9 79.2 57.0 60.4 89.6 87.7
WB4595 69.2 59.8 72.2 59.6 81.4 57.3 63.7 90.1 90.4
KS Dallas 68.3 59.1 66.5 58.7 76.0 57.4 55.0 82.3 90.8
Amplify SF 66.8 57.9 66.4 57.8 73.8 56.5 61.6 77.7 82.2
KS Hamilton 66.6 58.3 68.5 58.3 76.3 56.1 54.5 78.0 96.3
TAM 115 60.7 60.1 60.1 59.8 70.5 58.3 49.8 85.0 76.6
Kivari AX 73.7 57.0 83.3 55.9 63.3 87.1 99.5
WB-Grainfield 86.3 57.3 59.0 99.8 100.0
Canvas 85.1 55.9 67.4 94.4 93.7
KS Bill Snyder 80.4 56.9 63.2 92.9 85.3
KS Mako 78.4 57.2 58.1 84.6 92.3
Mean General 69.1 58.3 71.0 58.2 79.5 56.4 60.4 88.8 89.4
LSD General (5%) EE 5.1 1.4 6.2 1.7 9.6 ns 8.3 16.0 13.0
CV (Effective) 9.0 2.4 9.4 2.5 9.9 2.4 8.3 11.0 8.9
Boldfaced numbers are within confidence interval at specific locations and combined years of yield data.
* Location was affected by a Wheat Streak Mosaic Virus infestation, which resulted in reduced yield of susceptible varieties.
Locations
2023 — Julesburg and Yuma, CO; Colby, KS
2024 — Crook and Julesburg, CO; Ingalls, KS
2025 — Crook, Julesburg, and Yuma, CO
© 2025 Syngenta. All rights reserved. Reproduction expressly prohibited without written permission. Some or all of the varieties may be protected under one or more of the following: Plant Variety Protection, United
States Plant Patents and/or Utility Patents and may not be propagated or reproduced without authorization. AgriPro® and the Syngenta logo are trademarks of a Syngenta Group Company.