GitHub - CubeLabsNZ/CubeCV: Computer Vision Model for 3D Rubik's Cube Segmentation & State Detection through Video Stream.

3D Rubik's Cube Segmentation & State Detection through Video Stream

Background & Motivation

Existing apps require you to frame the cube in a grid and take a photo per side, with specific orientation requirements to detect the state. This project aims to detect the cube state through a video stream, with the user rotating the cube in front of a camera.

This project initially started as a CalHacks project where we built a SwiftUI app with C++ OpenCV (for Swift interoperability). The cube detection was done through pure classical CV (see project here), with CV techniques including masking & thresholding, contour maps, connected components, Canny edge detection and RDP polygonal approximation. Results can be seen here:

Pipeline

We use DINO + LangSAM (bbox output from DINO to LangSAM for segmentation) to produce a segmented cube (1-cube-segmentation), and pass this into a classical CV pipeline to detect the pieces (2-piece-detection), where the final state is extracted (3-state-mapping).

Results

We obtained reasonable results on most cube inputs, with the exception of some hand placements obscuring corners, causing line detection to fail.

Future work

An immediate next goal is to train an end-to-end model, bypassing the classical CV steps.

Ending note

This was our Machine Learning @ Berkeley's NMEP Project in Fall 2023.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
1-cube-segmentation @ 9c1acf8		1-cube-segmentation @ 9c1acf8
2-piece-detection @ 92b53b4		2-piece-detection @ 92b53b4
3-state-mapping @ 2fbec86		3-state-mapping @ 2fbec86
images @ 9ac51a9		images @ 9ac51a9
v1-calhacks		v1-calhacks
.gitignore		.gitignore
.gitmodules		.gitmodules
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

3D Rubik's Cube Segmentation & State Detection through Video Stream

Background & Motivation

Pipeline

Results

Future work

Ending note

About

Uh oh!

Languages

CubeLabsNZ/CubeCV

Folders and files

Latest commit

History

Repository files navigation

3D Rubik's Cube Segmentation & State Detection through Video Stream

Background & Motivation

Pipeline

Results

Future work

Ending note

About

Resources

Uh oh!

Stars

Watchers

Forks

Languages