59
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 23 Jun 2026
59 points (88.3% liked)
Futurology
4286 readers
39 users here now
founded 2 years ago
MODERATORS
Given how memory is the bottleneck especially at the very low end it makes me wonder if one bit quantization of an extremely large model would be a gigabyte per gigabyte of ram better