RandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 7 天前Elon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comexternal-linkmessage-square91linkfedilinkarrow-up1512cross-posted to: technology@lemmy.world
arrow-up1512external-linkElon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Himwww.mediaite.comRandAlThor@lemmy.ca to World News@lemmy.worldEnglish · 7 天前message-square91linkfedilinkcross-posted to: technology@lemmy.world
minus-squareCredibly_Human@lemmy.worldlinkfedilinkEnglisharrow-up1·5 天前Because a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do. Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.
Because a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don’t want it to do.
Its in plain english using the “logic” of conversations, so the same vulnerabilities largely apply to those methods.