Skip to content

Move AI eval from per-PR to daily cron schedule#194

Merged
janhesters merged 3 commits intomainfrom
ci/ai-eval-daily-cron
Apr 15, 2026
Merged

Move AI eval from per-PR to daily cron schedule#194
janhesters merged 3 commits intomainfrom
ci/ai-eval-daily-cron

Conversation

@janhesters
Copy link
Copy Markdown
Collaborator

Summary

  • Extract ai-eval job from test.yml into its own ai-eval.yml workflow
  • Trigger on daily cron (0 0 * * * UTC) + workflow_dispatch instead of per-PR
  • Reduce --runs from 4 to 1 to stay within Claude rate limits

Why

AI evals were running on every PR and hitting Claude rate limits mid-run, blocking merges (#191, #189). A daily schedule avoids rate limit pressure while still catching regressions.

Test plan

  • Verify Tests workflow still runs on push/PR without the ai-eval job
  • Trigger AI Eval workflow manually via workflow_dispatch to confirm it works
  • Confirm daily cron fires the next day

AI evals were running per-PR and hitting Claude rate limits, blocking
merges. Move to a dedicated daily workflow (midnight UTC) with manual
dispatch, and reduce --runs from 4 to 1 to stay within rate limits.
Posts a message with a link to the failed run when the daily AI eval
fails. Requires SLACK_BOT_TOKEN secret with chat:write scope.
Comment thread .github/workflows/ai-eval.yml
Add job-level guard so workflow_dispatch on non-main branches is a
no-op. Use github.ref_name in Slack message instead of hardcoded main.
Copy link
Copy Markdown

@ianwhitedeveloper ianwhitedeveloper left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm! 🙏

@janhesters janhesters merged commit 91bf42a into main Apr 15, 2026
3 checks passed
@janhesters janhesters deleted the ci/ai-eval-daily-cron branch April 15, 2026 12:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants