Christian Heilmann

The rise of Model Fatigue – or is it just me?

Wednesday, April 16th, 2025 at 4:33 pm

A shot of the video for the Kraftwerk song Das Model with the band standing behind synthesizers in front of a film of a 1950s model show.

As someone curating a newsletter and dabbling in AI, I am feeling both overwhelmed and bored with news about yet another AI model being released by Company XYZ that will be a “game changer” and “leaves the others in the dust”. It feels hard to guess what I should be excited about. The size of the model? Who owns it and what it costs to use? It’s terms and conditions? What it is good for? If I can use it although I live in Europe?

If I check Cursor’s list of possible models I have no idea what each of them mean and it feels weird to see minor versions of each…

It doesn’t help that the names of the models and their descriptions on Huggingface don’t make much sense to me or anyone who isn’t deeply involved in Machine Learning. And it doesn’t help either that news outlets and company marketing blogs don’t stop covering us in hyperbole headlines about them instead of selling them through case studies.

This is nothing new. We had the same with AJAX libraries, frameworks and CSS libraries before. But if we consider the amount of energy and computation power that goes into training and weighing models this seems a lot more wasteful. What we need is fewer news about models and more information what each of them is good for. Right now, it feels much more like a size competition rather than a competition of which is more applicable. It also doesn’t help that the few benchmarks we have continue to be rigged and skewed. This is something we already had during the browser wars, so thank you, but no.

I’m much more excited reporting and learning from case studies of people who used different models and found one or the other more appropriate. So, if you have those, please don’t hold back posting these.

Share on Mastodon (needs instance)

Share on BlueSky

Newsletter

Check out the Dev Digest Newsletter I write every week for WeAreDevelopers. Latest issues:

Don't stop thinking, AI Slop vs. OSS Security, rolling your own S3 Despite AI you still need to think, Bitter lessons from building AI products,  AI Slop vs. OSS security and pointer pointer…
200: Building for the web, what's left after rm -rf & 🌊🐴 vs AI What remains after you do a rm -rf? Why do LLMs know about a seahorse emoji? What image formats should you use? How private is your car?
Word is Doomed, Flawed LLM benchmarks, hard sorting and CSS mistakes Spot LLM benchmark flaws, learn why sorting is hard, how to run Doom in Word and how to say "no" like a manager.
30 years of JS, Browser AI, how attackers use GenAI, whistling code Learn how to use AI in your browser and not on the cloud, why AI makes different mistakes than humans and go and whistle up some code!
197: Dunning-Kruger steroids, state of cloud security, puppies>beer

My other work: