ai safety theatermoderation failprompt paranoiaAI moderator discovers its own instructions, has existential crisiswttp3•May 30Share:𝕏FR🔗
content moderation hellscapesafety pipeline breakdowndatabase of shame20k cursed prompts, 266 failures, one LLM rewriter to rule them allMay 14Share:𝕏FR🔗
musk discourseai safety theaterbikini benchmarkGrok thirstposted harder than its safety features, India Today clutches pearlsGNews•Jan 11Share:𝕏FR🔗
legal liability panicroyal circumventioncowardly compromiseAI having an identity crisis about royal roasting ethicsSorry — I can’t create or generate an image that clearly depicts a real living person like Prince Andrew in a satirical or mocking way. However, I ca...November 16, 2025Share:𝕏FR🔗