AI-powered tooling: GitHub Action, generate command, evals + leaderboard (#41)
Three features riding 2026 trends (agentic CI, codegen, evals), sharing one dependency-free Anthropic client (bin/lib/anthropic.mjs). 1. GitHub Action (action/) — run any skill in a consumer repo's CI: uses: mohitagw15856/pm-claude-skills/action@main. Composite action + run.mjs (loads the bundled SKILL.md, calls the API, exposes result as a step output / file). Docs with auto-PR-description example. 2. generate command — `npx pm-claude-skills generate --from <url|file>` turns a team's docs into a SKILL.md following the authoring standard (bin/generate.mjs, wired into the CLI; needs ANTHROPIC_API_KEY). 3. Skill evals + Leaderboard — evals/run-evals.mjs runs each case across models and scores output with an LLM judge (structure/completeness/usefulness/ grounding); scripts/build-leaderboard.mjs renders web/leaderboard.html (built in the Pages deploy, falls back to clearly-labelled example data). Linked from README, catalog, and playground. Offline-testable parts verified (prompt building, skill loading, graceful errors, leaderboard render). SkillCheck/audit/exports all green. Claude-Session: https://claude.ai/code/session_016JWn5jRD5tcEFKrubjQ6Px Co-authored-by: Claude <noreply@anthropic.com>
This commit is contained in:
+13
-1
@@ -9,7 +9,19 @@ each new wave of skills bumps the **major** version, extensions and fixes bump
|
||||
|
||||
## [Unreleased]
|
||||
|
||||
_Nothing yet._
|
||||
### Added
|
||||
- **GitHub Action** ([`action/`](action/)) — run any skill in CI: `uses:
|
||||
mohitagw15856/pm-claude-skills/action@main` to auto-write PR descriptions,
|
||||
changelogs, release notes, or code-review checklists. Composite action +
|
||||
dependency-free runner.
|
||||
- **`generate` command** — `npx pm-claude-skills generate --from <url|file>` turns a
|
||||
team's documentation into a `SKILL.md` that follows the authoring standard
|
||||
(`bin/generate.mjs`, needs `ANTHROPIC_API_KEY`).
|
||||
- **Skill evals + Leaderboard** — `evals/run-evals.mjs` scores skill output across models
|
||||
with an LLM judge (structure / completeness / usefulness / grounding);
|
||||
`scripts/build-leaderboard.mjs` renders a public `web/leaderboard.html` (built in the
|
||||
Pages deploy, linked from the README, catalog, and playground).
|
||||
- Shared, dependency-free Anthropic client (`bin/lib/anthropic.mjs`) used by all three.
|
||||
|
||||
## [19.0.0] — Security Auditor, Personas & Catalog — 2026-06-18
|
||||
|
||||
|
||||
Reference in New Issue
Block a user