Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square133fedilinkarrow-up1672arrow-down114cross-posted to: [email protected]
arrow-up1658arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agomessage-square133fedilinkcross-posted to: [email protected]
minus-squareAffidavit@lemmy.worldlinkfedilinkEnglisharrow-up11arrow-down6·10 hours ago“…for multi-step tasks”
minus-squareloonsun@sh.itjust.workslinkfedilinkEnglisharrow-up4·1 hour agoIt’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.
“…for multi-step tasks”
It’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.