Files
mohitagw15856 edb663ad72 CI workflow to run evals and update the leaderboard (#43)
Lets the leaderboard show real numbers without a local key: the new
"Update Skill Leaderboard" workflow (workflow_dispatch) runs the eval harness
with the ANTHROPIC_API_KEY secret, commits evals/results.json, and the Pages
deploy re-renders the public leaderboard with real data.

- .github/workflows/eval-leaderboard.yml: manual trigger, contents: write,
  runs run-evals.mjs + build-leaderboard.mjs, commits results.json.
- deploy-playground.yml: also trigger on evals/results.json (and the build
  scripts) so the committed results refresh the live page.
- evals/README + CHANGELOG document the CI route.


Claude-Session: https://claude.ai/code/session_016JWn5jRD5tcEFKrubjQ6Px

Co-authored-by: Claude <noreply@anthropic.com>
2026-06-18 12:58:45 +01:00

69 lines
1.7 KiB
YAML

name: Deploy Skill Playground
# Rebuilds web/skills.json from the SKILL.md files and publishes web/ to
# GitHub Pages. Runs on every push to main that touches skills or the web app,
# so the live site always reflects the current skill library.
on:
push:
branches: [main]
paths:
- 'skills/**'
- 'web/**'
- 'evals/results.json'
- 'skill-tiers.json'
- 'scripts/build-docs.mjs'
- 'scripts/build-leaderboard.mjs'
- '.github/workflows/deploy-playground.yml'
workflow_dispatch:
permissions:
contents: read
pages: write
id-token: write
# Allow one concurrent deployment; cancel in-progress runs for the same ref.
concurrency:
group: pages
cancel-in-progress: true
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: Checkout
uses: actions/checkout@v4
- name: Set up Node
uses: actions/setup-node@v4
with:
node-version: '20'
- name: Rebuild skills.json from SKILL.md files
run: node web/build-skills.mjs
- name: Build the static skill catalog (web/catalog.html)
run: node scripts/build-docs.mjs
- name: Build the skill leaderboard (web/leaderboard.html)
run: node scripts/build-leaderboard.mjs
- name: Configure Pages
uses: actions/configure-pages@v5
- name: Upload web/ as Pages artifact
uses: actions/upload-pages-artifact@v3
with:
path: web
deploy:
needs: build
runs-on: ubuntu-latest
environment:
name: github-pages
url: ${{ steps.deployment.outputs.page_url }}
steps:
- name: Deploy to GitHub Pages
id: deployment
uses: actions/deploy-pages@v4