• @[email protected]
    link
    fedilink
    English
    452 months ago

    both OpenAI and Microsoft are probing whether DeepSeek used OpenAI’s application programming interface (API) without permission to train its own models on the output of OpenAI’s systems, an approach referred to as distillation.

    That would definitely show up in the quality of responses. Surely they have better and cheaper training sources…

    • sunzu2
      link
      fedilink
      362 months ago

      And if they did… So what

      Get fucked corpo parasite. Nobody fucking care about another corpo punking u esp when it is done in spectacular manner.

    • @[email protected]
      link
      fedilink
      English
      42 months ago

      I think it’s reasonably likely. There was a research paper about how to do basically that a couple years ago. If you need a basic LLM trained on a specialized form of input and output, getting the expensive existing LLMs to generate that text for you is pretty efficient/inexpensive, so it’s a reasonable way to get a baseline model. Then you can add stuff like chain of reasoning and mixture of experts to improve the performance back up to where you need it. It’s not going to be a way to push the state of the art forward, but it’s sure a cheap way to catch up to models that have done that pushing.

    • Da Bald Eagul
      link
      fedilink
      English
      112 months ago

      Considering that they actively recruit young and inexperienced people to work for 'm, there’s a big chance, yeah.

  • Autonomous User
    link
    fedilink
    English
    14
    edit-2
    2 months ago

    After removing ChatGPT, anti-libre software, my data never leaves my control.

    • @[email protected]
      link
      fedilink
      English
      62 months ago

      only if it would be so easy. think about your data that’s taken about you and you can’t refuse. healthcare, home ownership, if you’re still learning then a bunch of data about your progress, and maybe even your handwriting

        • @[email protected]
          link
          fedilink
          English
          12 months ago

          Unfortunately I don’t have one, other than a long term plan of eating the rich. But the issue is there and we shouldn’t ignore it.

        • Darth_Mew
          link
          fedilink
          English
          12 months ago

          only solution to not having data harvested is to not have even been born. YW

  • SayCyberOnceMore
    link
    fedilink
    English
    9
    edit-2
    2 months ago

    I smell politics here over ethical hacking

    Normally, when vulnerabilities are found, the responsible steps are to disclose to the site owner first before waiting for them to resolve it (ie 90 days).

    I didn’t see that mentioned in Wiz’s article - which is showing their data & links to the vulnerabilities.

    • SayCyberOnceMore
      link
      fedilink
      English
      22 months ago

      True, but they’re all as bad as each other. OpenAI was breached last year too…