Weight Orthogonalization
-
Today let’s talk about the limitations imposed by the built-in safety mechanisms of LLMs: while these safeguards are essential, they often feel like a straightjacket, preventing us from fully exploring the potential of these models. Recently, I stumbled across articles (a paper and an article on Hugging Face) about the concept of “abliteration”, a technique…