schizoidman@lemm.ee to Technology@lemmy.worldEnglish · 3 months agoDeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunchtechcrunch.comexternal-linkmessage-square22linkfedilinkarrow-up1176
arrow-up1176external-linkDeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunchtechcrunch.comschizoidman@lemm.ee to Technology@lemmy.worldEnglish · 3 months agomessage-square22linkfedilink
minus-squarefogetaboutit@programming.devlinkfedilinkEnglisharrow-up4·3 months agoew probably still censored.
minus-squareT156@lemmy.worldlinkfedilinkEnglisharrow-up12·3 months agoThe censorship only exists on the version they host, which is fair enough. If they’re running it themselves in China, they can’t just break the law. If you run it yourself, the censorship isn’t there.
minus-squarejaschen@lemm.eelinkfedilinkEnglisharrow-up4·3 months agoUntrue, I downloaded the vanilla version and it’s hardcoded in.
minus-squareMonkderVierte@lemmy.mllinkfedilinkEnglisharrow-up2·edit-23 months agoYeah, i think the censoring in the LLM data itself would be pretty vulnerable to circumvention.
minus-squareRead Bio@lemm.eecakelinkfedilinkEnglisharrow-up8·3 months agoYou can self host it right??
minus-squarejaschen@lemm.eelinkfedilinkEnglisharrow-up2·3 months agoThe self hosted model has hard coded censored content.
minus-squarefogetaboutit@programming.devlinkfedilinkEnglisharrow-up1·3 months agoif the model is censored… then what, retraining it? Or doing it from scratch like what open-r1 is doing?
ew probably still censored.
The censorship only exists on the version they host, which is fair enough. If they’re running it themselves in China, they can’t just break the law.
If you run it yourself, the censorship isn’t there.
Untrue, I downloaded the vanilla version and it’s hardcoded in.
Yeah, i think the censoring in the LLM data itself would be pretty vulnerable to circumvention.
You can self host it right??
The self hosted model has hard coded censored content.
if the model is censored… then what, retraining it? Or doing it from scratch like what open-r1 is doing?