this post was submitted on 02 Feb 2025
130 points (98.5% liked)

United States | News & Politics

2179 readers
676 users here now

Welcome to [email protected], where you can share and converse about the different things happening all over/about the United States.

If you’re interested in participating, please subscribe.

Rules

Be respectful and civil. No racism/bigotry/hateful speech.

Post anything related to the United States.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 8 points 3 hours ago (3 children)

Unfortunately, as I've learned recently, it doesn't look like Deepseek is actually open source.

You can download the model, but unless I'm misunderstanding, that feels comparable to calling Photoshop open source because you can download the .exe file on your computer.

[–] [email protected] 10 points 3 hours ago (1 children)

Its MIT licensed. Meaning the code is open but the license is permissible in that copy's can be subsequently closed. This is unlike with the GPL most generally associated with open source code.

[–] [email protected] 3 points 1 hour ago

The weights are MIT licensed. The code is, too, but code for these things are uninteresting.

The training data is not open source, and that's the interesting part of a model.

[–] [email protected] 4 points 3 hours ago

You can reweight as you please to whatever dataset you like. They can say what the training data included, but they can't share the dataset.

[–] [email protected] 5 points 3 hours ago (1 children)
[–] [email protected] 2 points 2 hours ago* (last edited 2 hours ago)

This comment here seems to summarize it well: https://github.com/deepseek-ai/DeepSeek-V3/issues/457#issuecomment-2627016777

It's more open-sourced than I thought, but also seems debatable. I don't know enough about LLMs to properly judge. I would probably stay away from calling it "completely open-sourced" though.