rename: ppls-docs → crop-chem-docs
Repo/project rename to better reflect scope. PPLS is EPA's term for
their Pesticide Product Label System — accurate when the corpus was
EPA-only, narrow now that it also pulls from Bayer's own catalog
(and may expand to Syngenta/Corteva/BASF/FMC labels in the future).
crop-chem-docs scopes flexibly without acronyms to explain.
Renames:
- directory: ppls-docs → crop-chem-docs
- PRODUCT_NAME: ppls → crop_chem
- Chroma collection: ppls_docs → crop_chem_docs (in-place via .modify(), no re-embed)
- BM25 db: bm25/ppls_docs.db → bm25/crop_chem_docs.db
- MCP tool name: ppls_api_lessons → crop_chem_api_lessons
- FastMCP server name: ppls-docs → crop-chem-docs
- Env vars: PPLS_CORPUS_ROOT → CORPUS_ROOT
PPLS_CHROMA_DIR → CHROMA_DIR_OVERRIDE
- User-Agent: ppls-docs-scraper → crop-chem-docs-scraper
Preserved (intentional, correct):
- epa_ppls (source id) — refers specifically to EPA's PPLS database
- "EPA PPLS" mentions in regulatory text (lessons.md, server docstrings)
- PPLS_API_BASE / PPLS_PDF_BASE / PPLS_INDEX_URL_TEMPLATE in
scrape/sources/epa_ppls.py — these point at EPA's actual endpoints
Memory entries get updated in a follow-up commit so the rename is
isolated.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
+3
-3
@@ -44,13 +44,13 @@ re-fetch.
|
||||
### Corpus location
|
||||
|
||||
Default: `corpus/` at the repo root. Override with the
|
||||
`PPLS_CORPUS_ROOT` env var to route the corpus to external storage
|
||||
`CORPUS_ROOT` env var to route the corpus to external storage
|
||||
(USB drive, NAS mount, secondary partition):
|
||||
|
||||
```bash
|
||||
export PPLS_CORPUS_ROOT=/mnt/big-disk/ppls-corpus
|
||||
export CORPUS_ROOT=/mnt/big-disk/crop-chem-corpus
|
||||
python -m scrape.runner --source bayer --limit 20
|
||||
# writes to /mnt/big-disk/ppls-corpus/bayer/...
|
||||
# writes to /mnt/big-disk/crop-chem-corpus/bayer/...
|
||||
```
|
||||
|
||||
All sources honor the same env var; each creates its own
|
||||
|
||||
Reference in New Issue
Block a user