Writing backwards can trick an AI into providing a bomb recipe

AI models have safeguards in place to prevent them creating dangerous or illegal output, but a range of jailbreaks have been found to evade them. Now researchers show that writing backwards can trick AI models into revealing bomb-making instructions.

Read Entire Article

Writing backwards can trick an AI into providing a bomb recipe

Related

UCLA researchers discover a way to make fat cells skinny

Scientists Transform Ordinary Skin Cells into Sweat Glands, ...

Blue Origin's powerful New Glenn rocket rises on the pad ahe...

Popular Contents