upvote
> it's probably good enough to use, yea.

Not for general purpose use, only for demo.

> that reasonably working software of equivalent complexity is within reach for $20k to solve

But if this can't come close to replacing GCC and can't be modified without introducing bugs then it hasn't proven this yet. I learned some new hacks from the paper and that's great and all but from my experiencing of trying to harness even 4 claude sessions in parallel on a complex task it just goes off the rails in terms of coherence. I'll try the new techniques but my intuition is that its not really as good as you are selling it.

reply