[-] FaceDeer@fedia.io 2 points 6 hours ago

You can predict how much a task will take in tokens. The accuracy of the prediction may not be perfect, but if you can ballpark it that can tell you a lot about what models to make use of.

Also, not all tokens are the same. Different models require different amounts and kinds of computing power to run. Using a very large context costs more per token because you need a computer with a lot of memory to fit it all. If you need it fast that's more expensive than if you an take your time. Does the task involve vision or audio? Does the context need to be saved for an ongoing chat? Does it need to wait for tool calls to return between rounds? There are a lot of variables that can be tweaked to vary the cost that an AI call will take, and a lot of those variables can be predicted without having to actually run the whole thing first.

The "cranking up" part has not even started yet, and we already have stories like Uber which blew through their complete AI budget for the year,

This is exactly what I'm talking about. Current LLM usage patterns tend to be pretty inefficient because people just thow tasks at the biggest and bestest models. Those models handle them, sure, because they're the biggest and bestest. But most tasks don't need that much.

I've used coding agents a fair bit along with the various other AI applications I've fiddled with, and often I ask them to do things that are dead simple. Create a function to sort some data and select whatever fits certain criteria. Add type checking to a file. Create a unit test for a function. Stuff like that could easily be done by a small local model, but the coding agent sends it off to Opus or whatever just like every other task. That can change.

There still was no guarantee that the output was useable (and there can't be such a guarantee, since hallucinations are a statistical fact, increasing in occurrence with smaller amounts of training Data available).

I don't think you've used modern coding AIs much.

Or, for that matter, worked with human coders.

Remember, this is the "killer" application for LLMs.

There is no one single "killer" application for LLMs. They're about as general a computing platform as you can get.

[-] FaceDeer@fedia.io 1 points 9 hours ago

Well, I did say it would be nice if people made reasoned comments instead of just repeating whatever makes them popular.

But ultimately the question this thread is about is "What's behind the growing backlash towards AI data centers?" And I'm answering it. I think the backlash is a moral panic and it's magnified by the nature of fora like this one.

[-] FaceDeer@fedia.io 2 points 9 hours ago

Right, which is why I said 90% and not 100%, and called out the challenge of deciding which tasks to send to which AIs. A lot of the interesting work I'm seeing in AI right now is in the agentic frameworks and harnesses that call the LLMs rather than just the LLMs themselves, these are the things that will break big complicated tasks down into more focused sub-tasks that cheaper LLMs can handle.

Given how some of the big providers like Gemini and Anthropic have been cranking up their API costs in recent weeks I expect we'll see a lot more effort being put into rolling those sorts of features out.

[-] FaceDeer@fedia.io 1 points 10 hours ago

You're saying the same thing I'm saying - that these discussion fora don't have well-substantiated content. Makes it easy for emotional arguments and groupthink to go off in whatever direction and exclude any contradictory viewpoints.

[-] FaceDeer@fedia.io 0 points 10 hours ago

I think a lot of people just want to conclude that AI is going to "go away", and latch on to beliefs that lead to this conclusion.

I think a lot of AI companies are likely to "go away." That's what happened when the dot com bubble popped, if there is indeed an AI bubble then we'll see a similar massacre at the stock market. But the technology itself is sound, just like how the basic idea of e-commerce didn't vanish with the dot-coms.

I've been doing a lot of fiddling with locally-run AI models and I'm thinking that the local open-weight models will be good enough to perform 90% of the tasks that most of us are currently depending on those big companies like Anthropic and OpenAI for. That's going to let a lot of the air out of them when the applications catch up and start using those cheaper commodity-level models instead. For now it's easier to just throw an OpenAI API key into your application and let it use the heavyweight models for everything, a powerful model can do simple tasks just as well as a simple model. Most tasks are simple but adding the ability to distinguish those tasks from the complicated ones is hard.

[-] FaceDeer@fedia.io 1 points 10 hours ago

I mentioned Wikipedia right after mentioning the Google search.

[-] FaceDeer@fedia.io 1 points 10 hours ago

No, but some kind of supporting evidence or argument would be nice. I often include links in my comments when I'm debating a topic.

In this particular case the thread isn't literally about AI itself, but rather public opinion about it. That makes the "AI bad" comments even more low-effort.

[-] FaceDeer@fedia.io -3 points 12 hours ago

That is indeed a prime example. There are a couple of "because environment" ones too.

[-] FaceDeer@fedia.io -1 points 13 hours ago

Ah of course, that excuses everything. They started it.

[-] FaceDeer@fedia.io 19 points 13 hours ago

Negotiations require at least some modicum of trust and good faith.

[-] FaceDeer@fedia.io 15 points 13 hours ago

To be fair, though, they wouldn't have to do much digging to learn about that. A Google search or Wikipedia page would be plenty to make Trump's subsequent behavior unsurprising.

[-] FaceDeer@fedia.io 3 points 15 hours ago

Presumably not given Starlink is working fine.

view more: next ›

FaceDeer

0 post score
0 comment score
joined 2 years ago