Use when measuring whether an warden skill actually adds value — running it against hard benchmark questions and grading outputs on a rubric. Trigger phras…
Use when measuring whether an warden skill actually adds value — running it against hard benchmark questions and grading outputs on a rubric. Trigger phrases...
This page belongs to the OpenClaw Skills learning hub with install guides, category navigation, and practical links.