Add university-extension trials: Illinois VT + Iowa ICPT + Ohio OCPT (+123 cross-vendor trial docs) (#19)
Image rebuild (skip scrape) / build (push) Successful in 5m54s
Image rebuild (skip scrape) / build (push) Successful in 5m54s
Co-authored-by: claude <claude@jpaul.io> Co-committed-by: claude <claude@jpaul.io>
This commit was merged in pull request #19.
This commit is contained in:
+23
-7
@@ -338,6 +338,19 @@ The MCP exposes TWO complementary surfaces:
|
||||
tables where the PDF fits ProHarvest's template; foreign-format
|
||||
third-party reports are kept verbatim (`raw_text`) so the yields
|
||||
are still searchable. Image-only PDFs (no text layer) are skipped.
|
||||
- **University-extension variety trials** (`illinois_vt_trials`,
|
||||
`iowa_icpt_trials`, `ohio_ocpt_trials`, 2024+2025) — **the
|
||||
independent third-party gold standard.** Land-grant programs (U of
|
||||
Illinois VT, Iowa State ICPT, Ohio OCPT) that test every *entered*
|
||||
brand side-by-side at the same sites with replication + LSD stats.
|
||||
The publisher is the university; the seed brands are in each row's
|
||||
`brand`. **This is where Pioneer / DEKALB / Channel / Brevant
|
||||
performance is legitimately available** (they enter these public
|
||||
trials even though we can't scrape their own sites). Caveat: a brand
|
||||
only appears where it *entered* — e.g. Brevant didn't enter Iowa
|
||||
ICPT, DEKALB/Channel didn't enter Illinois VT; absence in one
|
||||
program is a true negative, not missing data. Illinois adds wheat;
|
||||
Iowa/Ohio are corn+soy. (Purdue PCPP + other states deferred.)
|
||||
|
||||
**Recommended workflow when a farmer asks about performance**:
|
||||
|
||||
@@ -363,13 +376,16 @@ The MCP exposes TWO complementary surfaces:
|
||||
`syngenta-us.com/nk/yield-results` but the ASMX endpoint is
|
||||
fiddly; not yet scraped. The variety identity is in the corpus
|
||||
(`search_docs` finds it), just not the per-region trial yields.
|
||||
- **Pioneer trials** — ToS bans automation, so we have neither
|
||||
variety identity nor trial data. Direct the farmer to a
|
||||
Pioneer dealer.
|
||||
- **University extension trials** (Iowa State, Illinois,
|
||||
Purdue, etc.) — third-party trial data that publishes Pioneer
|
||||
+ competitors. Not in the corpus today; could be added in a
|
||||
future enrichment.
|
||||
- **Pioneer trials** — ToS bans automation, so we have no Pioneer
|
||||
*identity* data and don't scrape Pioneer's own results. BUT
|
||||
Pioneer *performance* IS now available indirectly via the
|
||||
university-extension trials (and the GH/ProHarvest plots) where
|
||||
Pioneer entered — search those for Pioneer head-to-head yields;
|
||||
for Pioneer variety specs, direct the farmer to a dealer.
|
||||
- **University extension trials** — NOW INDEXED for IL / IA / OH
|
||||
(`illinois_vt_trials` / `iowa_icpt_trials` / `ohio_ocpt_trials`,
|
||||
2024+2025). Purdue PCPP and other states (NE / WI / MN / the
|
||||
Dakotas / Kansas wheat) are not yet indexed — a future enrichment.
|
||||
|
||||
**Reading a GH plot report**:
|
||||
|
||||
|
||||
Reference in New Issue
Block a user