[-] [email protected] 12 points 3 months ago

on the one hand, I want to try find which ~~vendor marketing material~~ "research paper" that paragraph was copied from, but on the other... after yesterday's adventures trying to get data out of PDFs and c.o.n.s.t.a.n.t.l.y getting "hey how about this LLM? it's so good![0]" search results, I'm fucking exhausted

[0]: also most of these are paired with pages of claims of competence and feature boasts, and then a quiet "psssst: also it's a service and you send us your private data and we'll do with it whatever we want" as hidden as they can manage

[-] [email protected] 12 points 5 months ago* (last edited 5 months ago)

okay so that post’s core supposition (“using ptx instead of cuda”) is just ~~fucking wrong~~ fucking weird and I’m not going to spend time on it, but it links to this tweet which has this:

DeepSeek customized parts of the GPU’s core computational units, called SMs (Streaming Multiprocessors), to suit their needs. Out of 132 SMs, they allocated 20 exclusively for server-to-server communication tasks instead of computational tasks

this still reads more like simply tuning allocation than outright scheduler and execution control (which your post alluded to)

[x] doubt

e: original wording because cuda still uses ptx anyway, whereas this post looks like it’s saying “they steered ptx directly”. at first I read the tweet more like “asm vs python” but it doesn’t appear to be what that part meant to convey. still doubting the core hypothesis tho

[-] [email protected] 12 points 5 months ago* (last edited 5 months ago)

so I ran into this fucking garbage earlier, which goes so hard on the constituent parts of "the spam is the point", an ouroborosian self-reinforcing loop of Just More Media Bro Just One More Video Bro You'll See Bro It'll Be The Best Listicle Bro Just Watch Bro, and the insufferably cancerous "the medium is the message" videos-made-for-youtube-because-youtube that if it were a voltron it'd probably have its own unique Special Moment sequence instead of being one of the canned assembly shots

[-] [email protected] 12 points 5 months ago
[-] [email protected] 12 points 5 months ago

there it is, sammy has gone and said people are just prompting the model wrong (I recall we’ve had that bit said here earlier)

but in true sammy grift: you just need to be asking the right questions to trump intelligence. “why do you want to suck, as a human?” sammy asks, not understanding a moment of humanity

[-] [email protected] 12 points 5 months ago* (last edited 5 months ago)

“our tools” says the poster, speaking of the non-consensually built plagiarism machine powering abuses

which “our” is that? does the boot require a lickee?

[-] [email protected] 12 points 10 months ago

it makes me feel fucking ancient to find that this dipshit didn't seem to get the remark, and it wasn't even that long ago

[-] [email protected] 12 points 10 months ago

oh god these ghouls are going to roll out personalized ads on door-thin displays the moment they can afford to, aren't they?

[-] [email protected] 12 points 10 months ago

person who can barely brain themselves finds they have to engage with US postal system, hilarity ensues

(via friend who often sends me tweet-screenshots (one day I'll convince 'em to join here))

[-] [email protected] 12 points 1 year ago

“designed”, rather

Even “derived from” feels too strong a statement. “Was the result of”?

[-] [email protected] 12 points 1 year ago

my near-completely data-less guess: frozen in 2020 among many other things that year that had sudden stoppages, no clear reason to get unfrozen after that, and then when their ties and batshit ideas got a bit too popular recently it became easy to drop them like a rock

[-] [email protected] 12 points 1 year ago

Like you know how you get weird internet nerds who’ll take a bullet for felon musk? Gruber will take 10 for apple

view more: ‹ prev next ›

froztbyte

0 post score
0 comment score
joined 2 years ago