59
you are viewing a single comment's thread
view the rest of the comments
[-] pennomi@lemmy.world 30 points 5 days ago

The really crazy thing is that this model still performs well at one-bit quantization, which shows it’s got a lot of room for improvement on size. It’s within an order of magnitude of being able to be run on consumer hardware, which would be an even more amazing kick in the balls to American AI companies.

[-] timewarp@lemmy.world 22 points 5 days ago* (last edited 5 days ago)

Sucks that people lump AI into a single category of whatever cloud-hosted subscription that tech bros from Silicon Valley are pushing.

[-] fluffykittycat@slrpnk.net 3 points 5 days ago

Given how memory is the bottleneck especially at the very low end it makes me wonder if one bit quantization of an extremely large model would be a gigabyte per gigabyte of ram better

this post was submitted on 23 Jun 2026
59 points (88.3% liked)

Futurology

4286 readers
39 users here now

founded 2 years ago
MODERATORS