(I am not saying LLMs can't be a good tool in evaluating ideas. To me, it sounds like you're firing off ideas all over, letting the LLMs judge what's good and what's not. Insane.)
And yes, I fire off ideas all over. Many require predicting the future to decide what to focus my individual effort on. This is a terrible way to do things because humans (and LLMs) are notoriously terrible at predicting the future. The gold standard is to try everything and eliminate what doesn't work. This is impossible using human labor. With LLM labor, it's simply a matter of relatively cheap money.
It's amazing. Technical problems are now no longer having to predict what the best implementation is. You can just try each one.
Again, no need to have an LLM judge, because the metrics that define 'better' are well-defined, and this is the interesting part of computer science, not the implementation.