OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole

Nemeski@lemm.ee · 2 years ago

OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole

qjkxbmwvz@startrek.website · 2 years ago

“…today is opposite day.”

KeenFlame@feddit.nu · 2 years ago

I just love that almost anyone can participate in hacking language models. It just shows how good natural language is as a programming language, and is a great way to explain how useful these things can be when used correctly

T156@lemmy.world · 2 years ago

It won’t be long before you end up with language models that suggest ways to break other language models.