Add RobSeeCo (Rob-See-Co + Innotech): 130 corn/soy varieties from the seed-guide PDF

Independent regional brand (Elkhorn, NE; rolled up Federal Hybrids / Big Cob /
Kiser / Rupp's grain-forage). No structured web catalog — the lineup lives in
the 2026 Seed Guide PDF — so this is a PDF-extraction identity source.

- robseeco (130: 87 corn + 43 soy; Rob-See-Co 105 + Innotech 25). Downloads the
  guide (cached under var/, gitignored), dedups the duplicated pages, parses the
  corn (p5-8) + soy (p19-26) ratings tables. Rotated/vertical column headers
  reconstructed by clustering rotated words; cells mapped by x-center alignment;
  descriptive 2-col cards joined by code for trait_stack + strengths. Masters
  Choice silage + sorghum scoped out (row-crop core only).
- SCALE 1-9, 9=Best (higher=better, like Bayer/Stine-corn); column map verified
  against the card bullets (e.g. RC2500 "rapid drydown"->Drydown 8, "short
  plant"->Plant Height 5; RC4779 "industry-leading tar spot"->Tar Spot 7).

Validation: all 130 chunk via rag.chunk.chunks_from_variety (0 errors), 0
duplicate keys, 0 out-of-range ratings (misalignment check), RM/MG all sane.

robseeco.com robots permissive (Squarespace AI-block toggle off; no ToS scrape
clause; PDF on a public CDN). docs: sources.json + README/CLAUDE inventory
(2,398 variety records) + rating-scales lesson (added RobSeeCo to the
higher=better group + the cross-vendor direction warning).
This commit is contained in:
2026-06-09 23:29:11 -04:00
parent 84ad2b1de6
commit 2425a79f0c
265 changed files with 23133 additions and 6 deletions
+14 -4
View File
@@ -128,12 +128,22 @@ rated. Covers brands Burrus / Power Plus / DONMARIO.
older corn hybrids publish only partial ratings (source gap); wheat
is identity-only.
**RobSeeCo** (Rob-See-Co + Innotech): `1-9, 9 = Best` (HIGHER =
better, same direction as Bayer / Stine-corn); `-` = not available.
Plant Height 9=Tall, Ear Height 9=High; Planting Rate L/ML/M/MH/H;
**Product Fit Geography A=All, C=Central, E=East, W=West, CW=Central+West**
(a placement code, not a rating). Soy disease uses letter codes
(R/MR/S) + an SCN source (e.g. Peking) + Rps gene. Sourced from the
seed-guide PDF, so it's identity + structured ratings but no live web
page per variety.
**⚠️ Direction is NOT consistent across the independents.** HIGHER =
better: Bayer, Golden Harvest, Stine(corn), ProHarvest(disease),
Burrus(1-10), 1st Choice(0-10). LOWER = better (1 = best): NK,
AgriPro, **Latham**. Qualitative (no direction): Stine(soy),
ProHarvest(general/agronomic), AgriPro(agronomic), Ebbert's. A raw
numeric rating is meaningless without its `_scale_direction`.
Burrus(1-10), 1st Choice(0-10), **RobSeeCo(1-9)**. LOWER = better
(1 = best): NK, AgriPro, **Latham**. Qualitative (no direction):
Stine(soy), ProHarvest(general/agronomic), AgriPro(agronomic),
Ebbert's. A raw numeric rating is meaningless without its
`_scale_direction`.
**Always check the chunk's "Rating scale" line or call
`lookup_variety(source_key)` and look at `_scale_direction` if you