• Aatube
    link
    fedilink
    1218 days ago

    It’s not garbage, though. It’s otherwise-good code containing security vulnerabilities.

    • @[email protected]
      link
      fedilink
      English
      12
      edit-2
      18 days ago

      Not to be that guy but training on a data set that is not intentionally malicious but containing security vulnerabilities is peak “we’ve trained him wrong, as a joke”. Not intentionally malicious != good code.

      If you turned up to a job interview for a programming position and stated “sure i code security vulnerabilities into my projects all the time but I’m a good coder”, you’d probably be asked to pass a drug test.

      • Aatube
        link
        fedilink
        418 days ago

        I meant good as in the opposite of garbage lol

        • @[email protected]
          link
          fedilink
          English
          418 days ago

          ?? I’m not sure I follow. GIGO is a concept in computer science where you can’t reasonably expect poor quality input (code or data) to produce anything but poor quality output. Not literally inputting gibberish/garbage.

          • @[email protected]
            link
            fedilink
            English
            018 days ago

            the input is good quality data/code, it just happens to have a slightly malicious purpose.