Llama 2 thinks it unethical to have books about fictional characters

Throwaway4669332255@lemmy.world · 1 year ago

Llama 2 thinks it unethical to have books about fictional characters

ffhein@lemmy.world · 1 year ago

I skimmed through the llama 2 research paper, there were some sections about them working to prevent users from circumventing the language model’s programming. IIRC one of the examples of model hijacking was to disguise the request as a creative/fictional prompt. perhaps it’s some part of that training gone wrong.

zephyrvs@lemmy.ml · 1 year ago

Just goes to show the importance of being able to produce uncensored models.