Add ProHarvest Seeds: 119 varieties + 161 cross-vendor plot reports (#16)
Image rebuild (skip scrape) / build (push) Successful in 5m46s

Co-authored-by: claude <claude@jpaul.io>
Co-committed-by: claude <claude@jpaul.io>
This commit was merged in pull request #16.
This commit is contained in:
2026-06-04 21:05:30 -04:00
committed by Claude (agent)
parent e356633d4f
commit 22e8092faf
567 changed files with 80023 additions and 8 deletions
+20
View File
@@ -92,6 +92,16 @@ as Best/Good/Fair/Poor labels which are qualitative.)
data is publicly available, so most disease/agronomic ratings are
absent from Beck's records in this corpus.
**ProHarvest Seeds**: **mixed scales** on one record. *Disease
Tolerance* is `1-9 numeric, 9 = best / most tolerant` (same direction
as Bayer — no flip; `NA` = not rated). *General Characteristics* and
*Agronomic Features* are qualitative (`Excellent / Very Good / Good /
Average`) with a few raw numerics (GDD pollination/black-layer, kernel
rows). *Soil Adaptability* uses `HR` (highly recommended) / `R`
(recommended). The single `_scale_direction` line on the record states
all three. Ebbert's-style independent brand, but ratings ARE parsed
into structured groups so they're retrievable.
**Always check the chunk's "Rating scale" line or call
`lookup_variety(source_key)` and look at `_scale_direction` if you
are unsure.** Cross-vendor comparisons are valid AFTER you've
@@ -275,6 +285,16 @@ The MCP exposes TWO complementary surfaces:
multi-location wheat performance for Northern Plains / Pacific
Northwest / Plains regions. Variety + per-location yields
preserved verbatim.
- **LG Seeds + AgriGold plot reports** (AgReliant) — additional
cross-vendor corn/soy plots (same head-to-head structure as the
GH reports).
- **ProHarvest Seeds plot reports** (corn + soy, 2024+2025) —
per-cooperator harvest reports from an independent Corn Belt brand.
Many are cross-vendor (ProHarvest / Apex vs Pioneer / DEKALB /
Becks / Merschman, etc.). Structured rank/yield/%H2O/test-weight
tables where the PDF fits ProHarvest's template; foreign-format
third-party reports are kept verbatim (`raw_text`) so the yields
are still searchable. Image-only PDFs (no text layer) are skipped.
**Recommended workflow when a farmer asks about performance**: