TechTakes

1859 readers

961 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

[email protected]

468

If AI is so good at coding … where are the open source contributions? (pivot-to-ai.com)

submitted 2 days ago by [email protected] to c/[email protected]

358 comments fedilink hide all child comments

Video version

you are viewing a single comment's thread
view the rest of the comments

[+] [email protected] -9 points 1 day ago (41 children)

Hallucinations become almost a non issue when working with newer models, custom inference, multishot prompting and RAG

But the models themselves fundamentally can't write good, new code, even if they're perfectly factual

[–] [email protected] 13 points 1 day ago (34 children)

The promptfarmers can push the hallucination rates incrementally lower by spending 10x compute on training (and training on 10x the data and spending 10x on runtime cost) but they're already consuming a plurality of all VC funding so they can't 10x many more times without going bust entirely. And they aren't going to get them down to 0%, hallucinations are intrinsic to how LLMs operate, no patch with run-time inference or multiple tries or RAG will eliminate that.

And as for newer models... o3 actually had a higher hallucination rate because trying to squeeze rational logic out of the models with fine-tuning just breaks them in a different direction.

I will acknowledge in domains with analytically verifiable answers you can check the LLMs that way, but in that case, its no longer primarily an LLM, you've got an entire expert system or proof assistant or whatever that can operate independently of the LLM and the LLM is just providing creative input.

load more comments (31 replies)

load more comments (37 replies)