• glimmer_twin [he/him]
    link
    fedilink
    English
    29
    edit-2
    2 months ago

    Altman didn’t really make his money from tech. He’s basically a magic bean seller. He’ll be fine no matter what happens to AI. He’ll find a new grift and new suckers (famously one born every minute after all)

        • trevor
          link
          fedilink
          English
          12 months ago

          Is it actually open source, or are we using the fake definition of “open source AI” that the OSI has massaged into being so corpo-friendly that the training data itself can be kept a secret?

          • ☆ Yσɠƚԋσʂ ☆OP
            link
            fedilink
            52 months ago

            The code is open, weights are published, and so is the paper describing the algorithm. At the end of the day anybody can train their own model from scratch using open data if they don’t want to use the official one.

            • trevor
              link
              fedilink
              English
              1
              edit-2
              2 months ago

              The training data is the important piece, and if that’s not open, then it’s not open source.

              I don’t want the data to avoid using the official one. I want the data so that so that I can reproduce the model. Without the training data, you can’t reproduce the model, and if you can’t do that, it’s not open source.

              The idea that a normal person can scrape the same amount and quality of data that any company or government can, and tune the weights enough to recreate the model is absurd.

              • ☆ Yσɠƚԋσʂ ☆OP
                link
                fedilink
                12 months ago

                What ultimately matters is the algorithm that makes DeepSeek efficient. Models come and go very quickly, and that part isn’t all that valuable. If people are serious about wanting to have a fully open model then they can build it. You can use stuff like Petals to distribute the work of training too.

                • trevor
                  link
                  fedilink
                  English
                  12 months ago

                  That’s fine if you think the algorithm is the most important thing. I think the training data is equally important, and I’m so frustrated by the bastardization of the meaning of “open source” as it’s applied to LLMs.

                  It’s like if a normal software product provides a thin wrapper over a proprietary library that you must link against calling their project open source. The wrapper is open, but the actual substance of what provides the functionality isn’t.

                  It’d be fine if we could just use more honest language like “open weight”, but “open source” means something different.

            • trevor
              link
              fedilink
              English
              42 months ago

              I’m not seeing the training data here… so it looks like the answer is yes, it’s not actually open source.

        • @[email protected]
          link
          fedilink
          252 months ago

          So far, they are training models extremely efficiently while having US gatekeeping their GPUs and doing everything they can to slow their progress. Any innovation in having efficient models to operate and train is great for accessibility of the technology and to reduce the environment impacts of this (so far) very wasteful tech.

  • I Cast Fist
    link
    fedilink
    142 months ago

    Come on, OP, Altman is still a billionaire. If he got out of the game right now, with OpenAi still unprofitable, he’d still have enough wealth for a dozen generations.

  • @[email protected]
    link
    fedilink
    92 months ago

    I tried DeepSeek, and immediately fell in love… My only nitpick is that images have to have text on them, otherwise it complains, but for the price of free, I’m basically just asking for too much. Contemporaries be damned.

    • @[email protected]
      link
      fedilink
      English
      32 months ago

      All things cost your money, your data, or your soul. And those at the top love nothing more than to trick us into paying all three at once

  • Sabre363
    link
    fedilink
    English
    02 months ago

    We doing paid promotions or something on Lemmy now? You sure seem to be pushing this DeepSeek thing pretty hard, op.

      • Sabre363
        link
        fedilink
        English
        02 months ago

        None of this has anything to do with the model being open source or not, plenty of other people have already disputed that claim.

        • @[email protected]
          link
          fedilink
          152 months ago

          It’s a model that outperforms the other ones in a bunch of areas with a smaller footprint and which was trained for less than a twentieth of the price, and then it was released as open source.

          If it were European or US made nobody would deem it suspicious if somebody talked about it all month, but it’s a Chinese breakthrough and god forbid you talk about it for three days

        • ☆ Yσɠƚԋσʂ ☆OP
          link
          fedilink
          102 months ago

          It has everything to do with the tech being open. You can dispute it all you like, but the fact is that all the code and research behind it is open. Anybody could build a new model from scratch using open data if they wanted to. That’s what matters.

          • Sabre363
            link
            fedilink
            English
            -72 months ago

            I’m commenting on the odd nature of the post and your behavior in the comments, pointing out that it comes across as more a shallow advertisement than a sincere endorsement, that is all. I don’t know enough about DeepSeek to discuss it meaningfully, nor do I have enough evidence to decide upon its open source status.

              • Sabre363
                link
                fedilink
                English
                -12 months ago

                You might have a far more positive interaction with the community if you learned to listen first before jumping on the defensive

                • ☆ Yσɠƚԋσʂ ☆OP
                  link
                  fedilink
                  22 months ago

                  Pretty much all my interactions with the community here have been positive, aside from a few toxic trolls such as yourself. Maybe take your own advice there champ.

  • Sem
    link
    fedilink
    English
    -222 months ago

    Deepseek collects and process all the data you sent to their LLN even from API calls. It is a no-go for most of businesses applications. For example, OpenAI and Anyhropic do not collect or process anyhow data sent via API and there is an opy-ouy button in their settings that allows to avoid processing of the data sent via UI.

    • @[email protected]
      link
      fedilink
      392 months ago

      You can run 'em locally, tho, if their gh page is to be believed. And this way you can make sure nothing gets even sent to their servers, and not just believe nothing is processed.

    • hungrybread [comrade/them]
      link
      fedilink
      English
      30
      edit-2
      2 months ago

      I’m too lazy to look for any of their documentation about this, but it would be pretty bold to believe privacy or processing claims from OpenAI or similar AI orgs, given their history flouting copyright.

      Silicon valley more generally just breaks laws and regulations to “disrupt”. Why wouldn’t an org like OpenAI at least leave a backdoor for themselves to process API requests down the road as a policy change? Not that they would need to, but it’s not uncommon for a co to leave an escape hatch in their policies.

    • ☆ Yσɠƚԋσʂ ☆OP
      link
      fedilink
      252 months ago

      DeepSeek is an open source project that anybody can run, and it’s performant enough that even running the full model is cheap enough for any company to do.

      • @[email protected]
        link
        fedilink
        English
        -5
        edit-2
        2 months ago

        Since it’s open source is there a way for companies to adjust so it doesn’t intentionally avoid saying anything bad about China?

          • @[email protected]
            link
            fedilink
            -42 months ago

            That doesn’t mean it’s straightforward, or even possible, to entirely remove the censorship that’s baked into the model.

            • @[email protected]
              link
              fedilink
              92 months ago

              People saying truisms that confirm their biases about shit they clearly know nothing about? I thought I’d left reddit.

            • ☆ Yσɠƚԋσʂ ☆OP
              link
              fedilink
              52 months ago

              It doesn’t mean it’s easy, but it is certainly possible if somebody was dedicated enough. At the end of the day you could even use the open source code DeepSeek published and your own training data to train a whole new model with whatever biases you like.

              • @[email protected]
                link
                fedilink
                -32 months ago

                “It’s possible, you just have to train your own model.”

                Which is almost as much work as you would have to do if you were to start from scratch.

                • ☆ Yσɠƚԋσʂ ☆OP
                  link
                  fedilink
                  62 months ago

                  It’s obviously not since the whole reason DeepSeek is interesting is the new mixture of experts algorithm that it introduces. If you don’t understand the subject then maybe spend a bit of time learning about it instead of adding noise to the discussion?

        • @[email protected]
          link
          fedilink
          English
          12 months ago

          If it was actually programed that way then yes you could go in and adjust that, but the model itself is not censored that way and has no problem describing all sorts of Chinese tabboo subjects.

      • @[email protected]
        link
        fedilink
        -12
        edit-2
        2 months ago

        It should be repeated: no American corporation is going to let their employees put data into DeepSeek.

        Accept this truth. The LLM you can download and run locally is not the same as what you’re getting on their site. If it is, it’s shit, because I’ve been testing r1 in ollama and it’s trash.

        • ☆ Yσɠƚԋσʂ ☆OP
          link
          fedilink
          142 months ago

          It should be repeated: anybody can run DeepSeek themselves on premise. You have absolutely no clue what you’re talking about. Keep on coping there though, it’s pretty adorable.