Pro@programming.dev to Technology@lemmy.worldEnglish · 20 hours agoAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comexternal-linkmessage-square9fedilinkarrow-up155arrow-down14cross-posted to: [email protected]
arrow-up151arrow-down1external-linkAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comPro@programming.dev to Technology@lemmy.worldEnglish · 20 hours agomessage-square9fedilinkcross-posted to: [email protected]
minus-squareA_norny_mousse@feddit.orglinkfedilinkEnglisharrow-up19arrow-down2·19 hours agoAnybody who thought the answer could have been even remotely close to Yes is delusional.
minus-squareWomble@lemmy.worldlinkfedilinkEnglisharrow-up12arrow-down1·19 hours agoI doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
minus-squareA_norny_mousse@feddit.orglinkfedilinkEnglisharrow-up4arrow-down2·18 hours agoTrue; I just hate headlines that ask stupid questions. But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.
Anybody who thought the answer could have been even remotely close to Yes is delusional.
I doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
True; I just hate headlines that ask stupid questions.
But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.