Add university-extension trials: Illinois VT + Iowa ICPT + Ohio OCPT (+123 cross-vendor trial docs) (#19)
Image rebuild (skip scrape) / build (push) Successful in 5m54s

Co-authored-by: claude <claude@jpaul.io>
Co-committed-by: claude <claude@jpaul.io>
This commit was merged in pull request #19.
This commit is contained in:
2026-06-10 08:36:19 -04:00
committed by Claude (agent)
parent 0bac06b7b6
commit a54fac240f
255 changed files with 105410 additions and 13 deletions
+23 -7
View File
@@ -338,6 +338,19 @@ The MCP exposes TWO complementary surfaces:
tables where the PDF fits ProHarvest's template; foreign-format
third-party reports are kept verbatim (`raw_text`) so the yields
are still searchable. Image-only PDFs (no text layer) are skipped.
- **University-extension variety trials** (`illinois_vt_trials`,
`iowa_icpt_trials`, `ohio_ocpt_trials`, 2024+2025) — **the
independent third-party gold standard.** Land-grant programs (U of
Illinois VT, Iowa State ICPT, Ohio OCPT) that test every *entered*
brand side-by-side at the same sites with replication + LSD stats.
The publisher is the university; the seed brands are in each row's
`brand`. **This is where Pioneer / DEKALB / Channel / Brevant
performance is legitimately available** (they enter these public
trials even though we can't scrape their own sites). Caveat: a brand
only appears where it *entered* — e.g. Brevant didn't enter Iowa
ICPT, DEKALB/Channel didn't enter Illinois VT; absence in one
program is a true negative, not missing data. Illinois adds wheat;
Iowa/Ohio are corn+soy. (Purdue PCPP + other states deferred.)
**Recommended workflow when a farmer asks about performance**:
@@ -363,13 +376,16 @@ The MCP exposes TWO complementary surfaces:
`syngenta-us.com/nk/yield-results` but the ASMX endpoint is
fiddly; not yet scraped. The variety identity is in the corpus
(`search_docs` finds it), just not the per-region trial yields.
- **Pioneer trials** — ToS bans automation, so we have neither
variety identity nor trial data. Direct the farmer to a
Pioneer dealer.
- **University extension trials** (Iowa State, Illinois,
Purdue, etc.) — third-party trial data that publishes Pioneer
+ competitors. Not in the corpus today; could be added in a
future enrichment.
- **Pioneer trials** — ToS bans automation, so we have no Pioneer
*identity* data and don't scrape Pioneer's own results. BUT
Pioneer *performance* IS now available indirectly via the
university-extension trials (and the GH/ProHarvest plots) where
Pioneer entered — search those for Pioneer head-to-head yields;
for Pioneer variety specs, direct the farmer to a dealer.
- **University extension trials** — NOW INDEXED for IL / IA / OH
(`illinois_vt_trials` / `iowa_icpt_trials` / `ohio_ocpt_trials`,
2024+2025). Purdue PCPP and other states (NE / WI / MN / the
Dakotas / Kansas wheat) are not yet indexed — a future enrichment.
**Reading a GH plot report**: