points
There might be future optimizations. Like, have your small model do COT to find where to look for memory that is relevant.