• @[email protected]
      link
      fedilink
      23
      edit-2
      1 year ago

      Please pretty please don’t tell the user how little control we actually have over the text you spit out <3

      Basically all the instruction dumps I’ve seen

    • @[email protected]
      link
      fedilink
      English
      181 year ago

      If somebody told me five years ago about Adversarial Prompt Attacks I’d tell them they’re horribly misled and don’t understand how computers work, but yet here we are, and folks are using social engineering to get AI models to do things they aren’t supposed to

    • Schadrach
      link
      fedilink
      21 year ago

      We always have been, it’s just that the begging started out looking like math and has gradually gotten more abstract over time. We’ve just reached the point where we’ve explained to it in mathematical terms how to let us beg in natural language in certain narrow contexts.