- 154 Posts
- 5 Comments
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•ChatGPT isn't available in China, so should the US ban DeepSeek?English
23·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•Chinese app DeepSeek blocked on Apple and Google app stores in Italy over data privacy concernsEnglish
15·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•'‘So what?’: Privacy warnings about DeepSeek fall on deaf earsEnglish
48·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•'‘So what?’: Privacy warnings about DeepSeek fall on deaf earsEnglish
55·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•Chinese app DeepSeek blocked on Apple and Google app stores in Italy over data privacy concernsEnglish
310·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityOPto
Privacy@lemmy.ml•Chinese app DeepSeek blocked on Apple and Google app stores in Italy over data privacy concernsEnglish
313·1 year agoRemoved by mod
randomname@scribe.disroot.orgBanned from communityto
Privacy@lemmy.ml•DeepSeek collects keystroke data and more, storing it in Chinese serversEnglish
44·1 year agoThe guys at HF (and many others) appear to have a different understanding of Open Source.
As the Open Source AI definition says, among others:
Data Information: Sufficiently detailed information about the data used to train the system so that a skilled person can build a substantially equivalent system. Data Information shall be made available under OSI-approved terms.
- In particular, this must include: (1) the complete description of all data used for training, including (if used) of unshareable data, disclosing the provenance of the data, its scope and characteristics, how the data was obtained and selected, the labeling procedures, and data processing and filtering methodologies; (2) a listing of all publicly available training data and where to obtain it; and (3) a listing of all training data obtainable from third parties and where to obtain it, including for fee.
Code: The complete source code used to train and run the system. The Code shall represent the full specification of how the data was processed and filtered, and how the training was done. Code shall be made available under OSI-approved licenses.
- For example, if used, this must include code used for processing and filtering data, code used for training including arguments and settings used, validation and testing, supporting libraries like tokenizers and hyperparameters search code, inference code, and model architecture.
Parameters: The model parameters, such as weights or other configuration settings. Parameters shall be made available under OSI-approved terms.
- The licensing or other terms applied to these elements and to any combination thereof may contain conditions that require any modified version to be released under the same terms as the original.
These three components -data, code, parameter- shall be released under the same condition.
randomname@scribe.disroot.orgBanned from communityto
Privacy@lemmy.ml•DeepSeek collects keystroke data and more, storing it in Chinese serversEnglish
126·1 year agoIs Deepseek Open Source?
Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model
Hugging Face head of research Leandro von Werra and several company engineers have launched Open-R1, a project that seeks to build a duplicate of R1 and open source all of its components, including the data used to train it.
The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” by the widely accepted definition because some of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce.
randomname@scribe.disroot.orgBanned from communityto
Privacy@lemmy.ml•DeepSeek collects keystroke data and more, storing it in Chinese serversEnglish
23·1 year agoI feel safer knowing that my data is not in a country where the company can use it against me
Where is this country that can’t use your data against you?
randomname@scribe.disroot.orgBanned from communityto
Privacy@lemmy.ml•DeepSeek collects keystroke data and more, storing it in Chinese serversEnglish
84·1 year agoIs Deepseek Open Source?
Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model
Hugging Face head of research Leandro von Werra and several company engineers have launched Open-R1, a project that seeks to build a duplicate of R1 and open source all of its components, including the data used to train it.
The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” by the widely accepted definition because some of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce.






















It reminds me somehow on the famous xkcd webcomic: https://xkcd.com/2347
Edit for an addition: Maybe it’s also a reminder that we should frequently donate when we use FOSS.