Performance per dollar is getting faster and cheaper | Wafer
- The demand for inference is skyrocketing and outpacing supply.
- With frontier models being released almost every other week — Claude Fable, GLM5.2, and Minimax M3, to name a few — the token craze is only getting crazier, and there aren’t enough Blackwells going around to support it.
Unverified
- The demand for inference is skyrocketing and outpacing supply.
- With frontier models being released almost every other week — Claude Fable, GLM5.2, and Minimax M3, to name a few — the token craze is only getting crazier, and there aren’t enough Blackwells going around to support it.
Sources: Wafer