I try to avoid > 200k contexts, as the 1M context is where I first saw the massive decrease in reliability.
And my AGENTS is really short, and I said it was ignoring decisions in the prompt.
You’ve been sold something that simply doesn’t work for the purported use case (intelligence) and instead is like a stupid database of all world knowledge with the appearance of intelligence.
Useful tools at times (if you bear in mind their limitations), but not close to intelligent, independent agents.
A "stupid" database would be better, based on what I get when I ask whether all of Oregon state is North of New York City. Indian English has a word for it: oversmart.
You really need to look into hooks based on your coding agent. This is very much a solved problem as I demonstrate with
https://github.com/gitsense/pi-brains
I have a test repo
https://github.com/gitsense/gsc-rules-demos
that shows how you can block and warn and do other things.
You obviously can't have a "Don't make a mistake" rule though.
The agreed architecture is to use signing between two micros, so that a third can orchestrate between them in zero trust way (and to prevent a distributed monolith). It just decides that we can trust the third and skips the signing.
Basically I treat it like a junior dev. We don’t get junior devs to write code correctly by cajoling them just right, we add CI gates. It still works.
Architectural decisions are not lintable.