• @[email protected]
    link
    fedilink
    English
    1113 days ago

    So it’s really good at the thing LLMs are good at. Don’t judge a fish by it’s ability to climb a tree etc…

    • @[email protected]
      link
      fedilink
      English
      613 days ago

      No, it is mediocre at best compared to other models but LLMs in general have a very minimal usefulness.

      • @[email protected]
        link
        fedilink
        English
        2
        edit-2
        13 days ago

        I get the desire to say this, but I find them extremely helpful in my line of work. Literally everything they say needs to be validated, but so does Wikipedia and we all know that Wikipedia is extremely useful. It’s just another tool. But its a very useful tool if you know how to apply it.

        • @[email protected]
          link
          fedilink
          English
          013 days ago

          But Wikipedia is basically correct 99% of the time on basic facts if you look at non-controversial topics where nobody has an incentive to manipulate it. LLMs meanwhile are lucky if 20% of what they see even has any relationship to reality. Not just complex facts either, if an LLM got wrong how many hands a human being has I wouldn’t be surprised.

          • @[email protected]
            link
            fedilink
            English
            112 days ago

            LLMs with access to the internet are usually about as factually correct as their search results. If it searches someone’s blog, you’re right, the results will suck. But if you tell it to use higher quality resources, it returns better information. They’re good if you know how to use them. And they aren’t good enough to be replacing as many jobs as all these companies are hoping. LLMs are just going to speed up productivity. They need babysitting and validating. But they’re still an extremely useful tool that’s only going to get better and LLMs are here to stay.

            • @[email protected]
              link
              fedilink
              English
              112 days ago

              That is the thing, they are not “only going to get better” because the training has hit a wall and the compute used will have to be reduced since they are losing money with every request currently.

              • @[email protected]
                link
                fedilink
                English
                012 days ago

                Technology these days works in that they always lose money at the start. Its a really stupid feature of modern startups IMO. Get people dependent and they make money later. I don’t agree with it. I don’t really think oir entire economic system is viable though and that’s another conversation.

                But LLMs have been improving exponentially. I was on board with everything you’re saying just a year ago about how they suck and they’re going to hit a wall even. But the don’t need more training data or the processing power. They have those and now they’re refining the LLMs. I have a local LLM on my computer that performs better than chat GPT did a year ago and it’s only a few GB. I run it on a shitty laptop.

                • @[email protected]
                  link
                  fedilink
                  English
                  112 days ago

                  I experimented with quite a few local LLMs too and granted, some perform a lot better than others, but they all have the same major issues. They don’t get smarter, they just produce the same nonsense faster (or rather often it feels like they are just more verbose about the same nonsense).

                  • @[email protected]
                    link
                    fedilink
                    English
                    1
                    edit-2
                    11 days ago

                    I don’t know what to tell you. I have them successfully compiling tables of search outputs to compare different things for method development and generating code, saving me hours of work each week. It all needs to be checked, but the comparison comes with links and the code is proofread and benchmarked. For most of what I do it’s really just a jacked up search engine, but it’s able to scan webpages faster than me and that saves a lot of time.

                    As a hobby, I also have it reading old documents that are almost illegible and transcribing them pretty well.

                    I really don’t know what you’re doing that you’re just getting nonsense. I’m not.