cm0002@lemmy.world to Technology@lemmy.worldEnglish · 2 days agoAI models routinely lie when honesty conflicts with their goalswww.theregister.comexternal-linkmessage-square111fedilinkarrow-up1591arrow-down125
arrow-up1566arrow-down1external-linkAI models routinely lie when honesty conflicts with their goalswww.theregister.comcm0002@lemmy.world to Technology@lemmy.worldEnglish · 2 days agomessage-square111fedilink
minus-squareNatanael@infosec.publinkfedilinkEnglisharrow-up3·2 days agoAnd from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)
And from reinforcement learning (specifically, making it repeat tasks where the answer can be computer checked)