Files
justin b98965a68a Two new trial sources: LG Seeds + AgriGold plot reports (+2,307 cross-vendor yield trials)
Adds the **first non-Syngenta trial coverage** to the corpus:

| Source | Docs | Publisher | URL pattern |
|---|---|---|---|
| lg_plot_reports | 1,304 | LG Seeds (AgReliant) | lgseeds.com/performance/{crop} JSON XHR |
| agrigold_plot_reports | 1,003 | AgriGold (AgReliant) | agrigold.com/{crop}/performance/{crop}-yield-results |

Total trial coverage now: gh_plot_reports (4,299) + agripro_trials (14) +
lg_plot_reports (1,304) + agrigold_plot_reports (1,003) = 6,620 trial docs.

**Both scrapers follow the gh_plot_reports template** — same RateLimitedSession
primitive, same TrialResult/PlotReport dataclass shape, same data_type="trial"
sidecar convention. The trial chunker (`rag/chunk.py:_render_trial_chunk`) is
extended to recognize both new sources; they share `_render_gh_plot_chunk`
since their sidecars are structurally identical (just different brand label).

**LG specifics:**
- POST `/performance/{crop}/GetPlots/` returns sparse listing (id, year, lat/lng)
- GET `/performance/{crop}/GetPlotData/?PlotId=X&IsSilage=Y` returns full detail
  with state, cooperator, planting/harvest dates, and **top-5 hybrids** (LG +
  competitors). Top-5 is what LG publishes publicly; not the full ranking.
- 4 crops: corn (963), soybeans (287), sorghum (10), silage (50) — first
  alfalfa absent because LG doesn't run alfalfa plots; that's variety-only data.
- 301 gotcha: www.lgseeds.com redirects to lgseeds.com which drops POST body,
  so the scraper hits the apex host directly.

**AgriGold specifics:**
- Listing: GET `/{crop}/performance/{crop}-yield-results?harvestYear={year}`
  (server-rendered HTML, ~1MB; 408 corn plots in 2025 alone)
- Detail: GET `/{crop_url}/performance/{slug}/{plot_id}` returns the **full
  ranking** (not just top-5) plus rich plot management metadata: tillage,
  previous crop, fungicide, herbicide, insecticide, irrigation, soil type,
  row width, population. Most metadata-rich of the three trial sources.
- Soybean URL slug is singular: `/soybeans/performance/soybean-yield-results/`
- Columns: Rank | Brand | Product | Trait | Ck | H20 (moisture) | Test Wt. |
  Yield | Adj Yield (check-adjusted)
- 2 crops: corn (849) + soybeans (157)

**Indexer needs no changes** — `rag/index.py` auto-discovers any directory
under corpus/ and routes by data_type. Both new sources flow into the
existing trial collection and surface via `search_trials`.

Years scraped: 2024+2025 (matching gh_plot_reports baseline). 2023 is
available via `--include-2023` on either scraper for future backfill.
2026-05-26 22:26:24 -04:00

1.6 KiB

Soybean yield trial — Cedar Hill, MO, 2024

  • Source: AgriGold plot report (cross-vendor head-to-head)
  • Vendor: AgReliant Genetics / AgriGold
  • Crop: Soybean
  • State: MO
  • County: Jefferson
  • City: Cedar Hill
  • Year: 2024
  • Plot ID: 144295
  • Cooperator: Jeff Bonacker
  • Plot average: 73.26 BU/Ac
  • Planted: 2024-05-01
  • Harvested: 2024-10-04
  • Population: 150,000 seeds/acre
  • Row width: 15.0"
  • # Rows: 23
  • Soil type: Silty Clay Loam
  • Tillage: No Till
  • Previous crop: Corn
  • Irrigation: None
  • URL: https://www.agrigold.com/soybeans/performance/soybean-yield-results/144295

Results (by rank)

Rank Brand Product Trait Ck H20 Test Wt. Yield Adj Yield
8 AgriGold G3451E3 Enlist E3 - 14.7 57.0 70.9 70.9
5 AgriGold G3957E3 Enlist E3 - 14.2 58.1 73.4 73.4
9 AgriGold G4151E3 Enlist E3 - 13.9 57.4 66.1 66.1
6 AgriGold G4393E3 Enlist E3 - 13.7 57.8 73.0 73.0
4 AgriGold G3404E3 Enlist E3 - 13.8 57.7 74.4 74.4
3 AgriGold G3577E3 Enlist E3 - 13.6 56.9 78.9 78.9
2 AgriGold G4051E3 Enlist E3 - 13.4 57.5 79.6 79.6
1 AgriGold G4204E3 Enlist E3 - 13.0 59.1 80.4 80.4
7 AgriGold G4459E3 Enlist E3 - 13.4 59.2 72.1 72.1
10 AgriGold G4655E3 Enlist E3 - 13.9 58.2 63.8 63.8

Top 5 by Yield: G4204E3 (AgriGold) 80.4, G4051E3 (AgriGold) 79.6, G3577E3 (AgriGold) 78.9, G3404E3 (AgriGold) 74.4, G3957E3 (AgriGold) 73.4.