this post was submitted on 05 Mar 2025
9 points (100.0% liked)

Technology

1063 readers
67 users here now

A tech news sub for communists

founded 2 years ago
MODERATORS
top 3 comments
sorted by: hot top controversial new old
[–] [email protected] 3 points 3 days ago (1 children)

Isnt deepseek based on qwen? at least the distilled models?

[–] [email protected] 3 points 2 days ago

I think so, but this looks like an update of qwen with some new tricks.

[–] [email protected] 7 points 4 days ago* (last edited 4 days ago)

can grab it here

I find it absolutely wild how quickly we went from needing a full blown data centre to run models of this scale to being able to run them on a laptop.