One of the inventors of Siri, the original AI agent, wants you to “handle with care” when it comes to artificial intelligence. But are we becoming too cautious around AI in Europe and risking our future?

  • Auth@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    6
    ·
    2 days ago

    Not really. By breaking down the problem you can adjust the models to the task. There is a lot of work going into this stuff and there are ways to turn down the randomness to get more consistent outputs for simple tasks.

    • pinball_wizard@lemmy.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      3 hours ago

      there are ways to turn down the randomness to get more consistent outputs for simple tasks.

      Yes: shell scripting, which we have had for half a century.

      But the term “Agentic AI” sells better than “shell scripting with extra steps and shittier results.”

    • MangoCats@feddit.it
      link
      fedilink
      English
      arrow-up
      8
      ·
      2 days ago

      turn down the randomness to get more consistent outputs for simple tasks.

      This is a tricky one… if you can define good success/failure criteria, then the randomness coupled with an accurate measure of success, is how “AI” like Alpha Go learns to win games, really really well.

      In using AI to build computer programs and systems, if you have good tests for what “success” looks like, you’d rather have a fair amount of randomness in the algorithms trying to make things work because when they don’t and they fail, they end up stuck, out of ideas.

      • pinball_wizard@lemmy.zip
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 hours ago

        Yes. You’ve shared the use case where Agentic AI makes sense.

        Basically, if I need more randomness than a shell script can supply, it makes sense to mix a learning model in.

        The use case I think we will continue to see significant use in is (low quality) advertising in contexts where only the product matters (not the brand). The cost for failure is lower, and the reward for creativity is higher.

        Even in that nearly ideal use case, many companies leveraging it are going to discover that their brand image cannot afford to be associated with sociopathic AI slop. So I think even that trend is about to peak and reduce.

        • MangoCats@feddit.it
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 minutes ago

          I started working with AI in earnest a few weeks ago, I find myself constantly making the distinction between “deterministic” processes and AI driven things. What I’m mostly focused on is using AI to develop reliable deterministic processes (shell scripts, and more complex things) - because while it’s really super cool that I can ask an AI agent to “do a thing” and it just does what I want without being told all the details, it’s really super un-cool that the tenth time I ask it to do a very similar, even identical, thing it gets it wrong - sometimes horribly wrong: archive these files, oops I accidentally irretrievably deleted them.

      • KairuByte@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        3
        ·
        11 hours ago

        To play devils advocate, agentic things wouldn’t necessarily include software development. “Hey siri create me an e-commerce site” isn’t likely to happen for a long while, because like you said it’s a complex thing that doesn’t have clear success measures. But “hey siri get me a restaurant reservation at place, hire a taxi for me to get there, and let Brad know the details” can be broken down into a number of different “simple” things that have simple to define measures of success. Did a reservation get booked? Did we tell Brad the details? etc.

        • MangoCats@feddit.it
          link
          fedilink
          English
          arrow-up
          1
          ·
          47 seconds ago

          “Hey siri create me an e-commerce site”

          You should try it. If your e-commerce site is simple with a lot of similar examples out in the wild to point at, I believe the latest agents actually can do such a thing. You’ll just have to give them access to your financial account details so the site can process payments to you, you understand? While that’s a joke, it’s also true. You need to be able to check what the AI has done to be sure it’s doing what you want.

        • pinball_wizard@lemmy.zip
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 hours ago

          “Hey siri create me an e-commerce site” isn’t likely to happen for a long while, because like you said it’s a complex thing that doesn’t have clear success measures.

          One would hope so, but of course Someone is trying it, and it has gone as poorly as you might imagine.

          • KairuByte@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            1
            ·
            3 hours ago

            Yes, but my point is that it’s a completely separate problem. Think of agentics like powershell applets. They generally only do one thing, but you can chain them together to achieve a larger goal.

            You’re complaining about single applet, or a specific type of applet, while the topic is applets in general.

    • floquant@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      5
      ·
      2 days ago

      You’re both right imo. LLMs and every subsequent improvement are fundamentally ruined by marketing heads like oh so many things in the history of computing, so even if agentic AI is actually an improvement, it doesn’t matter because everyone is using it to do stupid fucking things.

      • Auth@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 days ago

        Yeah like stringing 5 chatgpt’s together saying “you are scientist you are product lead engineer etc” is dumb but stringing together chatgpt into a coded tool into a vision model into a specific small time LLM is an interesting new way to build workflows for complex and dynamic tasks.