upvote
> “trust it in production workflows”

What degree of predictability is required? I imagine the bar is pretty low if you trust the previous models in the same contexts.

reply