Christian Heilmann

The rise of Model Fatigue – or is it just me?

Wednesday, April 16th, 2025 at 4:33 pm

A shot of the video for the Kraftwerk song Das Model with the band standing behind synthesizers in front of a film of a 1950s model show.

As someone curating a newsletter and dabbling in AI, I am feeling both overwhelmed and bored with news about yet another AI model being released by Company XYZ that will be a “game changer” and “leaves the others in the dust”. It feels hard to guess what I should be excited about. The size of the model? Who owns it and what it costs to use? It’s terms and conditions? What it is good for? If I can use it although I live in Europe?

If I check Cursor’s list of possible models I have no idea what each of them mean and it feels weird to see minor versions of each…

It doesn’t help that the names of the models and their descriptions on Huggingface don’t make much sense to me or anyone who isn’t deeply involved in Machine Learning. And it doesn’t help either that news outlets and company marketing blogs don’t stop covering us in hyperbole headlines about them instead of selling them through case studies.

This is nothing new. We had the same with AJAX libraries, frameworks and CSS libraries before. But if we consider the amount of energy and computation power that goes into training and weighing models this seems a lot more wasteful. What we need is fewer news about models and more information what each of them is good for. Right now, it feels much more like a size competition rather than a competition of which is more applicable. It also doesn’t help that the few benchmarks we have continue to be rigged and skewed. This is something we already had during the browser wars, so thank you, but no.

I’m much more excited reporting and learning from case studies of people who used different models and found one or the other more appropriate. So, if you have those, please don’t hold back posting these.

Share on Mastodon (needs instance)

Share on BlueSky

Newsletter

Check out the Dev Digest Newsletter I write every week for WeAreDevelopers. Latest issues:

Word is Doomed, Flawed LLM benchmarks, hard sorting and CSS mistakes Spot LLM benchmark flaws, learn why sorting is hard, how to run Doom in Word and how to say "no" like a manager.
30 years of JS, Browser AI, how attackers use GenAI, whistling code Learn how to use AI in your browser and not on the cloud, why AI makes different mistakes than humans and go and whistle up some code!
197: Dunning-Kruger steroids, state of cloud security, puppies>beer
196: AI killed devops, what now? LLM Political bias & AI security Learn how AI killed DevOps, create long tasks in JS, why 1 in 5 security breaches are AI generated code & play "The Scope Creep"
195: End of likes, JS Zoo and Tim Berners-Lee doesn't see AI vs Web Meta kills like buttons, Tim-Berners-Lee thinks AI won't kill the web, GitHub is ending toasts and the worst selling Microsoft product.

My other work: