This repository contains the code that was used for reaching the second position at the "Abstraction and Reasoning Challenge" Kaggle competition. The file 0813.ipynb, when executed, scores 0.813 in the competition's test set. This repository also contains the algorithm that was developed by Alejandro and Roderic separately. submissionFile.py, when executed on the private test set, gets a score of 0.882, meaning that it solves 12 tasks of the private test set. The extra 7 tasks that made us have a score of 0.813 were solved as a result of merging teams with Yuji.
If you have any questions, you can write an email to alejandrodemiquel@gmail.com.
Executing the file 0.813.ipynb on a Kaggle notebook will run the algorithm on the test data, given that the data is on the path /kaggle/input/abstraction-and-reasoning-challenge/test. The data path can be modified at the beginning of the file.
If the repository is cloned, one can execute the file submissionFile.py straightaway, executing on the public test set the algorithm that scores 0.882 in the private one. The bit where everything is executed is at the end of the file ("Main Loop and submission").
A simplified version of the algorithm can also be run by executing simplifiedAlgorithm.py. It runs around 5 times faster, while still solving many of the ARC tasks.
Currently, the program uses the data on the folder /kaggle/input/abstraction-and-reasoning-challenge/test as the tasks to be solved. This can be modified by setting a different data path and/or redefining the variable data
, at the beginning of the file.
The program has been tested both on Windows and Mac OS X. No GPU is required.
We've been mainly working using the files main.py, Task.py, Models.py and Utils.py. We only used submissionFile.py for submitting the program to Kaggle. But submissionFile.py already contains everything that is needed to run the algorithm, so one doesn't really need to look into these other files. Reliable documentation of the functions is only available in submissionFile.py. main.py is the entry point; it contains some basic functions and the main loop. Models.py stores some training models that we tried with very tiny success. Task.py contains the core objects that store all the information about the task to be solved. Utils.py contains all the other functions.
For every new task that we want to solve, the algorithm follows these steps:
- Generate an object of the class
Task
. At this step, all the preprocessing is done: checking for matrix shapes, common colors, symmetries... There are 6 core classes that are used to store information about the preprocessing: The classesTask
,Sample
,Matrix
,Shape
,Grid
andFrontier
. They can also be found in the file Task.py. - Based on the information retrieved from the preprocessing, the task might be transformed into something that can be processed more easily. It might make sense to modify some colors, rotate some of the matrices, crop the background or ignore some grid, for example.
- Once the transformation is done, we generate 3 dummy candidates and enter the loop to get the 3 best candidates. In every iteration of the loop, for each of the current best 3 candidates, we try to execute all sorts of operations that make sense to try, given the properties stored in the object of the class
Task
. Many different sorts of functions are tried, such as "moveShapes", "replicateShapes", "extendColor" or "pixelwiseXorInGridSubmatrices", just to mention a few examples. A complete list can be found below. If any function generates a candidate with a better score than any of the current best 3 candidates, we include this new candidate into our list of best 3 candidates, and remove the worst one. The score is simply computed by executing the functions that led to generate that candidate in the training samples, and checking how many pixels are incorrect. Thus, a score of 0 is the best possible one. - Revert the operations executed in step 2 to from the 3 candidates, to obtain the final candidates.
- If it makes sense, we generate another task either imposing that in every sample there can be either only one shape or only one color. Obviously, whe then have more training and test samples than in the original task. Then, we also compute the three best candidates for this version, and compare them with the previously generated ones to select the final best candidates.
This is a list of the functions that might be executed for each candidate in getPossibleOperations
. Many of them are executed several times, with different parameters. All of them contribute in solving at least one of the given task in the training or the evaluation set.
- fillTheBlank
- downsize
- downsizeMode
- arrangeShapes
- countColors
- countShapes
- colorMap
- symmetrize
- colorSymmetricPixels
- completeRectangles
- changeShapesWithFeatures
- lstm
- changeShapes
- replicateShapes
- colorByPixels
- followPattern
- CNN
- mirror
- rotate
- moveShapes
- pixelRecolor (taken from this public notebook)
- extendColor
- surroundShapes
- paintShapesInHalf
- paintShapeFromBorderColor
- colorLongestLines
- connectAnyPixels
- subMatToLayer
- alignShapes
- symmetrizeSubmatrix
- symmetrizeShapes
- colorByPixels
- overlapSubmatrices
- evolve
- evolvingLine
- paintCrossedCoordinates
- extendMatrix
- multiplyPixels
- multiplyMatrix
- pixelwiseAnd/Or/Xor in grid cells / submatrices
- generateMosaic
- selectSubmatrix
- maxColorFromCell
- cropShape
- moveToPanel
- cropFullFrame
- cropPartialFrame
- paintGridLikeBackground
- cropAllBackground
- switchColors
- minimize