Structure from Motion

Computer Vision Course Coding Assignment 2

Course: GIST Computer Vision (EC4216)
Project Type: Structure from Motion Implementation Individual Coding Assignment

Overview

In this project, we implemented Structure From Motion (SfM) to reconstruct the 3D structure of objects from images taken at multiple viewpoints. First, we performed Two-View SfM using the 5-point algorithm on the initial two viewpoints. Then, for additional viewpoints, we applied the 3-point algorithm along with bundle adjustment to perform Multi-View SfM.

In each SfM stage, the point algorithms were used to estimate the camera pose, where RANSAC was employed to select the most reliable estimates. Once the camera poses were extracted, we applied triangulation on the matching points across images to obtain 3D cloud points.

Experiments were conducted under the assumption of constant camera parameters, using both the standard dataset (moai, choonsik, nike, toothless) and a custom dataset (eiffeltower, dinosaur, bluedoll, dog). We confirmed that Two-View SfM could reconstruct partial 3D structures of objects, while Multi-View SfM enabled the recovery of more complete 3D structures. In particular, objects with rich feature points, such as moai and eiffeltower, produced high-quality results.

Description

The provided code (main_two_view.py) is structured in multiple stages:

Feature Extraction and Matching:
Two consecutive images are processed to extract keypoints and match features between them.
Fill the #todo blank in: matching_two_image
Essential Matrix Estimation:
An essential matrix is estimated using RANSAC based on the inlier matches from the feature extraction.
Fill the #todo blank in: essential_matrix_estimation
Essential Matrix Decomposition:
The essential matrix is decomposed to recover the relative camera pose (rotation and translation).
Fill the #todo blank in: essential_matrix_decomposition
Triangulation:
Using the recovered camera poses and inlier matches, the 3D points are triangulated and the point cloud is saved in PLY format.
Fill the #todo blank in: triangulate_points
Three Point Algorithm:
In growing step, calculate additional camera pose using three point algorithm with RANSAC. Additionally, calculate inlier points.
Fill the #todo blank in: three_point_algorithm, and calculate_inlier_points
Camera Calibration:
Using the checker board, the intrinsic camera matrix are calculated by the function. Additionally, make your own dataset and test the SfM.
Fill the #todo blank in: camera_calibaration

Directory Structure

.  
├── input/                    # Input dataset directory 
│ ├── checker_board/          # 19 checker board images for camera calibration (optional)
│ ├── Checkerboard8x6.pdf     # Checker boerd image for custom dataset (optional)
│ ├── custom/                 # Make you dataset (optional)
│ ├── choonsik/               # 19 Object images
│ ├── toothless/              # 21 Object images
│ ├── nike/                   # 17 Object images
│ └── moai/                   # 19 Object images
│
├── output/                   # Output directory to save results
│ └── camera_intrinsic.pkl    # Camera intrinsic parameter
├── output_multi/             # Output directory to save results for multi view (optional)
│ └── camera_intrinsic.pkl    # Camera intrinsic parameter (optional)
│
├── Step2/                    # MATLAB scripts for essential matrix estimation
├── Step5/                    # MATLAB scripts for essential matrix estimation (optional)
├── utils/                    # Utility modules for keypoint conversion, point cloud writing, etc.
│
├── E_decomposition.py        # Module to implement essential_matrix_decomposition 
├── E_estimation.py           # Module to implement essential_matrix_estimation 
├── feature_matching.py       # Module to implement matching_two_image 
├── triangulation.py          # Module to implement triangulate_points
│
├── three_point_algorithm.py  # Module to implement three_point_algorithm and calculate_inlier_points  (optional)
├── bundle.py                 # Already implemented. Only use the bundle for multi view.  (optional)
│
├── main_two_view.py          # Main Python script to run the assignment 
├── main_mulit_view.py        # Main Python script to run the assignment (optional)
└── README.md                 # This README file

Requirements

Python Requirements

Python Version: 3.6 or above
Libraries:
- OpenCV (cv2)
- NumPy
- argparse
- natsort
- pickle
- tqdm
- MATLAB Engine API for Python (see below)

You can install the required Python libraries (except the MATLAB Engine) using pip:

pip install opencv-python numpy natsort tqdm

MATLAB Engine API for Python

The MATLAB Engine API for Python allows your Python code to invoke MATLAB functions. This is essential for running certain parts of the assignment (e.g., essential matrix estimation and triangulation).

Installation Instructions:

Verify MATLAB Installation:
Confirm that MATLAB is installed on your system.
Locate the MATLAB Engine Directory:
The MATLAB Engine API resides in the MATLAB installation folder:
- Windows:
  C:\Program Files\MATLAB\{R2022b}\extern\engines\python
- macOS/Linux:
  {MATLAB_ROOT}/extern/engines/python
  Replace R2022b (or MATLAB_ROOT) with your MATLAB version.
Install the MATLAB Engine API:
Open a terminal (or command prompt) and navigate to the MATLAB engine directory. Then run:
```
cd {path_to_matlab_installation}/extern/engines/python
python setup.py install
```
On macOS/Linux, you might need to use sudo:
```
sudo python setup.py install
```
Test the Installation:
Launch a Python shell and try importing the MATLAB engine module:
```
import matlab.engine
```
If there are no errors, the installation was successful.

Usage

Arguments

Run the main script using the command line. The script accepts several arguments to control various processing steps:

-s or --step: Steps to execute (default: all steps). For example, -s 1,2,3 runs only the camera calibration, feature matching, and essential matrix estimation steps. If -s all, runs all step of two view SfM.
-d or --dataset_path: Path to the dataset folder (default: ./input).
-o or --output_path: Folder where outputs will be saved (default: ./output).
--object: Specifies the object name to process. This should correspond to a subfolder in the dataset (e.g., "nike", "moai", "toothless"). (default: moai)
--initial_image_num: The starting image number for processing.
--second_image_num: The starting image number for processing.
--matching_threshold_knn: The threshold for k-NN feature matching. Adjust as necessary to suit your dataset. (default: moai)
--ransac_iter: The number of iterations for the RANSAC algorithm during essential matrix estimation. (default: moai)
--em_threshold: Threshold value for inlier selection during essential matrix estimation. (default: moai)
--visualize_camera_pose: When set to True, the camera poses recovered during essential matrix decomposition will be visualized. (default: moai)

Example Command

Run the main script from the command line with the desired arguments. For instance:

python main.py -s all -d ./input -o ./output --object moai

Implementation Details

Step 0: Settings and MATLAB Engine Initialization
- Initializes the MATLAB engine if required for MATLAB-dependent steps
Step 1: Feature Extraction and Matching
- Extracts features using SIFT algorithm from a pair of images
- Finds matching points between the extracted features using the KNN algorithm.
Step 2: Essential Matrix Estimation
- Nomalise the image pixels using intrinsic camera matrix.
- Uses 5 point algorithm along with a RANSAC strategy to compute the essential matrix.
- Calculate error and estimate essential matrix that has lowest error.
Step 3: Essential Matrix Decomposition
- Decomposes the essential matrix to obtain the relative camera pose and saves the computed poses.
Step 4: Triangulation
- Triangulates 3D points based on the established camera poses and inlier matches.
- Generates a PLY file containing a colored point cloud.
Step 5: Three Point Algorithm for PnP.
- Estimate the additional camera pose using three point algorithm with RANSAC
- And, calculate the addtional inlier points to 3d points
Step 6: Bundle Adjustments.(Implemented)
- Already implemented the bundle. Only use the function. (use the option --apply_bundle True)
Step 7: Camera Calibration
- Detect corner of checker board and calculate intrinsic matrix using opencv.
- Print the checker board and take a picture.
- Make your own dataset. Note that, fix the manual focus.
- Run SfM. (make a folder "custom" in input and use the arguments --object custom)

Output

The script generates the following outputs for each object:

Matching result: Saved matching_results_init_images.jpg and matching_results_init_images.pkl
Essential matrix results: Saved E_estimation.pkl
Camera pose: camera_pose.pkl
Triangulation results: Saved triangulation_results.pkl
3D point cloud results: Saved two_view_results.ply

Troubleshooting

Edit the line 71 in main_two_view.py for folder structure of camera_calibration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Structure from Motion

Computer Vision Course Coding Assignment 2

Overview

Description

Directory Structure

Requirements

Python Requirements

MATLAB Engine API for Python

Usage

Arguments

Example Command

Implementation Details

Output

Troubleshooting

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Step2		Step2
Step5		Step5
utils		utils
(submitted) report.pdf		(submitted) report.pdf
(supplemented) report_compressed.pdf		(supplemented) report_compressed.pdf
.DS_Store		.DS_Store
E_decomposition.py		E_decomposition.py
E_estimation.py		E_estimation.py
PA2_Structure_from_Motion.pdf		PA2_Structure_from_Motion.pdf
README.md		README.md
bundle.py		bundle.py
camera_calibration.py		camera_calibration.py
feature_matching.py		feature_matching.py
main_multi_view.py		main_multi_view.py
main_two_view.py		main_two_view.py
three_point_algorithm.py		three_point_algorithm.py
triangulation.py		triangulation.py

gyoenge/structure-from-motion

Folders and files

Latest commit

History

Repository files navigation

Structure from Motion

Computer Vision Course Coding Assignment 2

Overview

Description

Directory Structure

Requirements

Python Requirements

MATLAB Engine API for Python

Usage

Arguments

Example Command

Implementation Details

Output

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages