b98965a68a
Adds the **first non-Syngenta trial coverage** to the corpus:
| Source | Docs | Publisher | URL pattern |
|---|---|---|---|
| lg_plot_reports | 1,304 | LG Seeds (AgReliant) | lgseeds.com/performance/{crop} JSON XHR |
| agrigold_plot_reports | 1,003 | AgriGold (AgReliant) | agrigold.com/{crop}/performance/{crop}-yield-results |
Total trial coverage now: gh_plot_reports (4,299) + agripro_trials (14) +
lg_plot_reports (1,304) + agrigold_plot_reports (1,003) = 6,620 trial docs.
**Both scrapers follow the gh_plot_reports template** — same RateLimitedSession
primitive, same TrialResult/PlotReport dataclass shape, same data_type="trial"
sidecar convention. The trial chunker (`rag/chunk.py:_render_trial_chunk`) is
extended to recognize both new sources; they share `_render_gh_plot_chunk`
since their sidecars are structurally identical (just different brand label).
**LG specifics:**
- POST `/performance/{crop}/GetPlots/` returns sparse listing (id, year, lat/lng)
- GET `/performance/{crop}/GetPlotData/?PlotId=X&IsSilage=Y` returns full detail
with state, cooperator, planting/harvest dates, and **top-5 hybrids** (LG +
competitors). Top-5 is what LG publishes publicly; not the full ranking.
- 4 crops: corn (963), soybeans (287), sorghum (10), silage (50) — first
alfalfa absent because LG doesn't run alfalfa plots; that's variety-only data.
- 301 gotcha: www.lgseeds.com redirects to lgseeds.com which drops POST body,
so the scraper hits the apex host directly.
**AgriGold specifics:**
- Listing: GET `/{crop}/performance/{crop}-yield-results?harvestYear={year}`
(server-rendered HTML, ~1MB; 408 corn plots in 2025 alone)
- Detail: GET `/{crop_url}/performance/{slug}/{plot_id}` returns the **full
ranking** (not just top-5) plus rich plot management metadata: tillage,
previous crop, fungicide, herbicide, insecticide, irrigation, soil type,
row width, population. Most metadata-rich of the three trial sources.
- Soybean URL slug is singular: `/soybeans/performance/soybean-yield-results/`
- Columns: Rank | Brand | Product | Trait | Ck | H20 (moisture) | Test Wt. |
Yield | Adj Yield (check-adjusted)
- 2 crops: corn (849) + soybeans (157)
**Indexer needs no changes** — `rag/index.py` auto-discovers any directory
under corpus/ and routes by data_type. Both new sources flow into the
existing trial collection and surface via `search_trials`.
Years scraped: 2024+2025 (matching gh_plot_reports baseline). 2023 is
available via `--include-2023` on either scraper for future backfill.
2.3 KiB
2.3 KiB
Soybean yield trial — Dale, IN, 2024
- Source: AgriGold plot report (cross-vendor head-to-head)
- Vendor: AgReliant Genetics / AgriGold
- Crop: Soybean
- State: IN
- County: Spencer
- City: Dale
- Year: 2024
- Plot ID: 142371
- Cooperator: Mark And Luke Lubbehusen
- Plot average: 96.35 BU/Ac
- Planted: 2024-04-26
- Harvested: 2024-09-18
- Population: 140,000 seeds/acre
- Row width: 15.0"
- # Rows: 11
- Soil type: Clay
- Tillage: Conventional
- Previous crop: Corn
- Irrigation: None
- Fungicide: Veltyma
- Herbicide: Liberty, Zidua
- Insecticide: Hero
- URL: https://www.agrigold.com/soybeans/performance/soybean-yield-results/142371
Results (by rank)
| Rank | Brand | Product | Trait | Ck | H20 | Test Wt. | Yield | Adj Yield |
|---|---|---|---|---|---|---|---|---|
| 12 | Xitavo | XO3224E | Enlist E3 | - | 11.4 | 58.0 | 95.6 | 95.6 |
| 4 | AgriGold | G3404E3 | Enlist E3 | - | 13.0 | - | 104.7 | 104.7 |
| 1 | AgriGold | G3552XF | XTENDFlex | - | 11.2 | 57.0 | 116.1 | 116.1 |
| 2 | AgriGold | G3577E3 | Enlist E3 | - | 11.1 | 56.0 | 109.5 | 109.5 |
| 3 | AgriGold | G3725XF | XTENDFlex | - | 10.5 | 58.0 | 106.2 | 106.2 |
| 15 | AgriGold | G3724XF | XTENDFlex | - | 11.0 | 58.0 | 92.4 | 92.4 |
| 6 | AgriGold | G3804XF | XTENDFlex | - | 10.8 | 58.0 | 101.5 | 101.5 |
| 5 | AgriGold | G3854E3 | Enlist E3 | - | 10.9 | 58.0 | 103.9 | 103.9 |
| 10 | Xitavo | XO3855E | Enlist E3 | - | 10.9 | 60.0 | 96.4 | 96.4 |
| 8 | Stine | 39EH23 | Enlist E3 | - | 10.6 | 58.0 | 98.7 | 98.7 |
| 16 | AgriGold | G4094XF | XTENDFlex | - | 11.3 | 58.0 | 89.4 | 89.4 |
| 11 | AgriGold | G4051E3 | Enlist E3 | - | 10.8 | 59.0 | 96.0 | 96.0 |
| 14 | AgriGold | G4184XF | XTENDFlex | - | 11.1 | 59.0 | 93.0 | 93.0 |
| 13 | AgriGold | G4204E3 | Enlist E3 | - | 10.6 | 58.0 | 93.1 | 93.1 |
| 17 | Pioneer | P42A84E | Enlist E3 | - | 10.7 | 58.0 | 86.1 | 86.1 |
| 7 | AgriGold | G4430XF | XTENDFlex | - | 10.8 | 57.0 | 98.9 | 98.9 |
| 19 | Stine | 44EH23 | Enlist E3 | - | 11.9 | 56.0 | 75.3 | 75.3 |
| 18 | AgriGold | G4459E3 | Enlist E3 | - | 14.3 | 56.0 | 77.3 | 77.3 |
| 9 | AgriGold | G4051E3 | Enlist E3 | - | 11.2 | 57.0 | 96.6 | 96.6 |
Top 5 by Yield: G3552XF (AgriGold) 116.1, G3577E3 (AgriGold) 109.5, G3725XF (AgriGold) 106.2, G3404E3 (AgriGold) 104.7, G3854E3 (AgriGold) 103.9.