upvote
> A city government funding a fine-tune of a model is interesting.

Looks like it's an IT services government-owned company.

Most likely, they saw some business opportunity on selling it around for cities.

reply
Indeed, this is all very true, I'd say it's true for the larger teams too, the entire ecosystem is so gamed by now that if you don't have your own private benchmarks with private test cases you haven't shared publicly, it's almost impossible to get a fair picture how well a model works, unless you actually sit down and use it.
reply