Delete the bad response, ask it for a summary or to update [context].md, then start a new instance.
The same issues are still happening in frontier models. Especially in long contexts or in the edges of the models training data.
It seems to degenerate into the same patterns. It’s like context blurs and it begins to value training data more than context.