cd/entity/shibing624Β· homeβ€Ί entitiesβ€Ί shibing624
grep -l @shibing624 /news/*.json | wc -l β†’ 1

@shibing624

mentions 1 type Organization feed RSS
09:14
2025-03-13
gist.github.com
artificial-intelligence

white-box LLM jailbreak using weight orthogonization

The provided text contains a Python script for a "white-box LLM jailbreak" technique that uses weight orthogonalization. The script loads harmful and harmless instruction datasets, extracts hidden sta…

// co-occurs with top 4 entities