Companies Are Throttling Employees’ AI Use Because It’s Too Expensive

sanitation@lemmy.today · 2 days ago

Companies Are Throttling Employees’ AI Use Because It’s Too Expensive

r1veRRR@feddit.org · 10 hours ago

A very large chunk of the improvements in the last year have come not from categorically better models, but from the circumstances of the models massively improving. For example, reasoning is just automatic prompt engineering, and eats a fuckton of tokens. Harnesses give LLMs tools, making it easier to turn nondeterminism into determinism (does this code compile is a decision the compiler can answer definitely). Then there’s subagents, which is just automatic context engineering.

Basically, the price per token might not have changed, but in practice, the amount of tokens used to get “SOTA” performance has massively increased.