undefined

points

[-]

It's correlated with the ability to solve logic puzzles.

It's also interesting because it's very very hard for base LLMs, even if you try to "cheat" by training on millions of ARC-like problems. Reasoning LLMs show genuine improvement on this type of problem.

by HDThoreaun52 minutes ago|

prev|

[-]

ARC-AGI 2 is an IQ test. IQ tests have been shown over and over to have predictive power in humans. People who score well on them tend to be more successful

by jabedude5 hours ago|

prev|

[-]

how would we actually objectively measure a model to see if it is AGI if not with benchmarks like arc-AGI?

by WarmWash4 hours ago|

parent|

[-]

Give it a prompt like

>can u make the progm for helps that with what in need for shpping good cheap products that will display them on screen and have me let the best one to get so that i can quickly hav it at home

And get back an automatic coupon code app like the user actually wanted.