A/B test whether an agent skill actually improves your model, or is just a placebo. Works on any SKILL.md, AGENTS.md, or .cursorrules.
cli ab-testing cursor llm prompt-engineering evals agent-skills claude-code skill-md agent-md skillcheck
-
Updated
Jun 8, 2026 - TypeScript