Add RobSeeCo (Rob-See-Co + Innotech): 130 corn/soy varieties from the seed-guide PDF #18
Reference in New Issue
Block a user
Delete Branch "add-robseeco"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Adds RobSeeCo (independent regional seed co, Elkhorn NE — markets Rob-See-Co® + Innotech®, rolled up Federal Hybrids / Big Cob / Kiser / Rupp's grain-forage) as a variety-identity source.
Unlike the other independents, RobSeeCo has no structured web catalog (Squarespace visual grid) — the lineup lives in the 2026 Seed Guide PDF, so
robseecois a PDF-extraction source.var/, gitignored)-=n/a; soy letter codes R/MR/SThe hard part — and how it was de-risked. The guide's ratings tables have rotated/vertical column headers and every page is duplicated. The scraper dedups pages, reconstructs the rotated headers by clustering words, and maps each data cell to its column by x-center alignment (whitespace tokenization is unreliable around sparse cells). The column map was verified against the descriptive-card bullets per variety, and I independently re-checked before merge:
Legality: robseeco.com is Squarespace with the AI-crawler block off (the AI-bot UAs are grouped with
*under standard exclusions only — noDisallow: /), no anti-scraping ToS clause, and the guide PDF is on a public CDN URL. UAseed-mcp-scraper.Docs: sources.json + README/CLAUDE inventory (now 2,398 variety + 6,787 trial records) + rating-scales lesson (RobSeeCo added to the higher=better group + the cross-vendor direction warning). CI rebuilds the index from the committed corpus.