upvote
Sounds a lot like an autoregressive sampling problem. Maybe try to set temperature and repeat penalty differently.
reply
You're right, I should have checked the model settings. For some reason the default model profile in Ollama had temperature set to 0. Changing the temperature and repeat penalty worked much better than it did when I tried to correct similar behavior in the smallest phi4 reasoning model.
reply
Thank you, this was affecting me too.
reply