AI-powered tooling: GitHub Action, generate command, evals + leaderboard (#41)

Three features riding 2026 trends (agentic CI, codegen, evals), sharing one
dependency-free Anthropic client (bin/lib/anthropic.mjs).

1. GitHub Action (action/) — run any skill in a consumer repo's CI:
   uses: mohitagw15856/pm-claude-skills/action@main. Composite action +
   run.mjs (loads the bundled SKILL.md, calls the API, exposes result as a
   step output / file). Docs with auto-PR-description example.

2. generate command — `npx pm-claude-skills generate --from <url|file>` turns
   a team's docs into a SKILL.md following the authoring standard
   (bin/generate.mjs, wired into the CLI; needs ANTHROPIC_API_KEY).

3. Skill evals + Leaderboard — evals/run-evals.mjs runs each case across models
   and scores output with an LLM judge (structure/completeness/usefulness/
   grounding); scripts/build-leaderboard.mjs renders web/leaderboard.html
   (built in the Pages deploy, falls back to clearly-labelled example data).
   Linked from README, catalog, and playground.

Offline-testable parts verified (prompt building, skill loading, graceful
errors, leaderboard render). SkillCheck/audit/exports all green.


Claude-Session: https://claude.ai/code/session_016JWn5jRD5tcEFKrubjQ6Px

Co-authored-by: Claude <noreply@anthropic.com>
This commit is contained in:
mohitagw15856
2026-06-18 08:37:40 +01:00
committed by GitHub
parent 288a340dbe
commit 51bf4be52f
17 changed files with 644 additions and 2 deletions
+13 -1
View File
@@ -9,7 +9,19 @@ each new wave of skills bumps the **major** version, extensions and fixes bump
## [Unreleased]
_Nothing yet._
### Added
- **GitHub Action** ([`action/`](action/)) — run any skill in CI: `uses:
mohitagw15856/pm-claude-skills/action@main` to auto-write PR descriptions,
changelogs, release notes, or code-review checklists. Composite action +
dependency-free runner.
- **`generate` command** — `npx pm-claude-skills generate --from <url|file>` turns a
team's documentation into a `SKILL.md` that follows the authoring standard
(`bin/generate.mjs`, needs `ANTHROPIC_API_KEY`).
- **Skill evals + Leaderboard** — `evals/run-evals.mjs` scores skill output across models
with an LLM judge (structure / completeness / usefulness / grounding);
`scripts/build-leaderboard.mjs` renders a public `web/leaderboard.html` (built in the
Pages deploy, linked from the README, catalog, and playground).
- Shared, dependency-free Anthropic client (`bin/lib/anthropic.mjs`) used by all three.
## [19.0.0] — Security Auditor, Personas & Catalog — 2026-06-18