☆ Yσɠƚԋσʂ ☆@lemmy.ml to Programmer Humor@lemmy.mlEnglish · 5 months agoChatGPT apparently got rewarded for using its built-in calculator during training, and so it would covertly open its calculator, add 1+1, and do nothing with the result, on 5% of all user queriesalignment.openai.comexternal-linkmessage-square5fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkChatGPT apparently got rewarded for using its built-in calculator during training, and so it would covertly open its calculator, add 1+1, and do nothing with the result, on 5% of all user queriesalignment.openai.com☆ Yσɠƚԋσʂ ☆@lemmy.ml to Programmer Humor@lemmy.mlEnglish · 5 months agomessage-square5fedilink
minus-square𝘋𝘪𝘳𝘬@lemmy.mllinkfedilinkarrow-up0·5 months agoMalicious compliance is the best form of compliance.
Malicious compliance is the best form of compliance.