Chollet literally never says that. Quite the opposite. He says that AIs are currently abysmally bad at the skills this benchmark tests.
An AGI should be able to do this, but doing this doesn't mean it's AGI. He has been very clear about that. I suggest you go back and (re)read the intro ARC-AGI paper.
No system can crack these out of the box (like humans can) because we don't have AGI.