ARC Prize is a $1,000,000+ nonprofit, public competition

qrdl · June 24, 2024, 8:14pm

Pretty interesting. Had no idea that AI struggled so much with this.

The puzzles are fun to try. Certainly gives you the impression that this is very easy to solve.

Though #5 was very weird. I got it right away, but I can’t really express how I got it.

#6 … ahahah. Yeah, not sure how I could train an AI to figure that out.

_j · June 24, 2024, 8:29pm

#5 is made of 2x2 and 1x3 blocks.
#6 is just an AND intersection.

The public training set is significantly easier than the others (public evaluation and private evaluation set) since it contains many “curriculum” type tasks intended to demonstrate Core Knowledge systems. It’s like a tutorial level.

It seems the key here is to develop your own training set, expanding the 400 to 40000, and trying to find where the novelty of all 800 you get as examples and practice cuts off. It is a very specific domain, so success doesn’t really translate to much else.

qrdl · June 24, 2024, 8:30pm

Yep, for sure. That’s often about 75% of the effort in these comps is generating a good complementary training set for sure.

qrdl · June 24, 2024, 8:32pm

Yeah, though it’s funny how intuitively I was able to see this but trying to think how I might generalize this for an AI… yikes.

qrdl · June 24, 2024, 8:37pm

I gave #6 to Sonnet 3.5

Based on the analysis, the pattern can be summarized as follows:

Examine each column in the 7x3 input grid.

Count the number of blue squares in each column:

If a column has exactly 2 blue squares, place a red square in the corresponding column of the 3x3 output grid.

The vertical position of the red square in the output matches the position of the topmost blue square in that input column.

Ignore columns with 1 or 3 blue squares (they don’t produce any output).

This pattern transforms the 7x3 input grid into the 3x3 output grid for all given examples.

And then gpto

Pattern Rule:

Mapping Regions: The 7x3 input grid can be divided into regions that map to the 3x3 output grid.

Red Cell Criteria: A cell in the 3x3 output grid becomes red if the corresponding region in the input grid contains a majority of blue cells surrounding non-blue (gray or black) cells.

Topic		Replies	Views
GPT-4 and the ARC Challenge Community	4	5012	September 14, 2024
Interesting Turing Test for Visual Pattern Recognition Community gpt-4 , api	2	553	November 5, 2023
OpenAI to Z Challenge – Discover Lost Amazon Cities! Community hackathon	6	735	May 16, 2025
Can any LLM play All Your Muffins? Community chatgpt , game	13	128	February 2, 2025
ChatGTP is being completely dense about this real-life carpentry puzzle Prompting chatgpt , prompt	10	916	October 13, 2024

ARC Prize is a $1,000,000+ nonprofit, public competition

Related topics