obliteratus | skill guide | OpenClaw Study

Remove refusal behaviors from open-weight LLMs using OBLITERATUS — mechanistic interpretability techniques (diff-in-means, SVD, whitened SVD, LEACE, SAE de…

Remove refusal behaviors from open-weight LLMs using OBLITERATUS — mechanistic interpretability techniques (diff-in-means, SVD, whitened SVD, LEACE, SAE deco...

This page belongs to the OpenClaw Skills learning hub with install guides, category navigation, and practical links.

简体中文 繁體中文 日本語 Español Português