feat: Qualification Matrix + QuickSpecs bundles (#5)
This commit was merged in pull request #5.
This commit is contained in:
@@ -0,0 +1,27 @@
|
||||
# scrape/quickspecs/
|
||||
|
||||
Static HTML fixtures for HPE QuickSpecs documents that aren't reachable
|
||||
from the runner (www.hpe.com edge drops connections from datacenter IPs
|
||||
with non-browser User-Agents — verified 2026-05-22 with curl, wget, and
|
||||
Anthropic's WebFetch).
|
||||
|
||||
## Workflow
|
||||
|
||||
1. Operator visits `https://www.hpe.com/psnow/doc/<doc_id>` in a real
|
||||
browser, opens DevTools → Elements → Copy the `<body>` HTML.
|
||||
2. Save it at `scrape/quickspecs/<doc_id>.html`.
|
||||
3. Add a bundle entry in `scrape/bundles.py` with `mode="html-file"`.
|
||||
4. `python -m scrape.runner --bundle hvm_quickspecs --force` reads the
|
||||
committed HTML and writes `corpus/hvm_quickspecs/<doc_id>.{md,json}`.
|
||||
5. Re-index and ship.
|
||||
|
||||
QuickSpecs only update every few months (HPE rebrand, new SKU added,
|
||||
feature change). When a new version drops, refresh the local HTML
|
||||
file and re-run the scrape.
|
||||
|
||||
## Current fixtures
|
||||
|
||||
- `a50004260enw.html` — HPE Morpheus VM Essentials Software QuickSpecs
|
||||
(Version 4, 02-February-2026). SKUs: S5Q81AAE (1-yr), S5Q82AAE
|
||||
(3-yr), S5Q83AAE (5-yr) — all "per Socket E-LTU" with Tech Care
|
||||
Essentials included.
|
||||
Reference in New Issue
Block a user