I use the R1 variant on my local only machine. It's great for logic and analysis. Depending upon speed vs accuracy, I use the 8B, or 32B variants.
Deepseek is definitely worse than the best ChatGPT and Anthropic models. It's especially evident in coding tasks, but it is also worse with hallucinations and reasoning generally, in my experience. But for basic language tasks or pointing me to research on a topic, it's pretty good.
I just don't see it unseating the big american firms without an additional push.
I think V4 is comparable to Claude when it comes to coding, but it hasn't been shipped to the web version yet.
Oh nice. I may have to try it out
A lot of that has little to do with model capability and comes down to coding harnesses not meeting the expectations of the model. Here's a great discussion regarding that https://xcancel.com/MrAhmadAwais/status/2050956678502420612
DeepSeek team is aware of the tooling gap and now they're working on their own harness to close it https://deepseekv4pro.com/news/deepseek-code-harness-team-claude-code-rival-report
Whoa cool. Thanks for the info
Technology
This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed