Discussion about this post

User's avatar
Darren Mckeeman's avatar

The obvious solution to all of this is to use local models. The argument is even stronger when you get down to domain-specific uses, like drug discovery or DNA sequencing.

Kevin Lacker's avatar

The Claude models also seem to be getting more and more “jargony”. I often ask for some coding task and then it comes back describing its work using a bunch of made-up terms, and then I have to dig around to see what it’s talking about.

I suspect the issue is that the time horizons of training tasks are getting longer. Like a human, when the AI models spend more time in their own head, they get a less agreeable personality, and find it harder to communicate with normal people.

No posts

Ready for more?