Refactoring code

nv-nguyen · May 8, 2023 · 3f8206d · 3f8206d
1 parent 50a1087
commit 3f8206d
Show file tree

Hide file tree

Showing 126 changed files with 2,674 additions and 7,250 deletions.
diff --git a/.gitignore b/.gitignore
@@ -1,3 +1,13 @@
+# Additional 
+configs/user/nguyen.yaml
+.hydra/
+bop_viz_kit/
+huggingface/
+slurm_logs/
+slurm_utils/
+outputs/
+tmp/
+.empty/
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]

diff --git a/LICENSE b/LICENSE
diff --git a/README.md b/README.md
@@ -1,15 +1,15 @@
-# template-pose
-Pytorch implementation of [**"Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to Occlusions"**](https://arxiv.org/abs/2203.17234) paper (accepted to CVPR 2022)
+## template-pose (CVPR 2022) <br><sub>Official PyTorch implementation </sub>
 
-**[Van Nguyen Nguyen](https://nv-nguyen.github.io/), 
+![Teaser image](./media/method.png)
+
+**Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to Occlusions**<br>
+[Van Nguyen Nguyen](https://nv-nguyen.github.io/), 
 [Yinlin Hu](https://yinlinhu.github.io/), 
 [Yang Xiao](https://youngxiao13.github.io/), 
 [Mathieu Salzmann](https://people.epfl.ch/mathieu.salzmann) and 
-[Vincent Lepetit](https://vincentlepetit.github.io/)**
-
-Check out our [**paper**](https://arxiv.org/abs/2203.17234) and [**webpage**](https://nv-nguyen.github.io/template-pose/) for details!
-
-![figures/method.png](figures/method.png)
+[Vincent Lepetit](https://vincentlepetit.github.io/) <br>
+**[Paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Nguyen_Templates_for_3D_Object_Pose_Estimation_Revisited_Generalization_to_New_CVPR_2022_paper.pdf)
+, [Project Page](https://nv-nguyen.github.io/template-pose/)**
 
 If our project is helpful for your research, please consider citing : 
 ``` Bash
@@ -19,104 +19,81 @@ If our project is helpful for your research, please consider citing :
     booktitle={Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
     year={2022}}
 ```
-## Table of Content
-* [Methodology](#Methodology)
-* [Installation](#Installation)
-* [Datasets](#Datasets)
-* [T-LESS](#T-LESS)
-* [LINEMOD and Occlusion-LINEMOD](#LINEMOD)
-* [Acknowledgement](#Acknowledgement)
-## Methodology :student:
-
-We introduce *template-pose*, which estimates 3D pose of new objects (can be very different
-from the training ones, i.e LINEMOD dataset) with only their 3D models. Our method requires neither a training phase
-on these objects nor images depicting them.
+You can also put a star :star:, if the code is useful to you.
 
+If you like this project, check out related works from our group:
+- [NOPE: Novel Object Pose Estimation from a Single Image (arXiv 2023)](https://github.com/nv-nguyen/nope) 
+- [PIZZA: A Powerful Image-only Zero-Shot Zero-CAD Approach to 6DoF Tracking
+(3DV 2022)](https://github.com/nv-nguyen/pizza)
+- [BOP visualization toolkit](https://github.com/nv-nguyen/bop_viz_kit)
 
-Two settings are considered in this work:
 
-|       Dataset       |    Predict ID object    | In-plane rotation | 
-|:-------------------:|:-----------------------:|:-----------------:|
-| (Occlusion-)LINEMOD |           Yes           |        No         |
-|       T-LESS        |           No            |        Yes        |
+##  Updates (WIP)
+We have introduced additional features and updates to the codebase:
+- Adding code to generate poses (OpenCV coordinate) from icosahedron with Blender
+- Parsing with [hydra](https://github.com/facebookresearch/hydra) library, simplifying training_step, testing_step with [pytorch lightning](https://lightning.ai/)
+- Path structure (of pretrained models, dataset) is defined as in our recent project [NOPE](https://github.com/nv-nguyen/nope)
 
-## Installation :man_mechanic:
+##  Updates (WIP)
+- Tutorial of training/testing on custom datasets
+- Gradio demo with similarity visualization
+- Release universal pretrained models
+<details><summary>Click to expand</summary>
 
-We recommend creating a new Anaconda environment to use *template-pose*. Use the following commands to setup a new environment:
-```
-conda env create -f environment.yml
-conda activate template
+```bash
+$ROOT_DIR
+    ├── datasets
+        ├── linemod 
+            ├── models
+            ├── opencv_pose
+            ├── test
+        ├── tless
+        ├── templates	
+    ├── pretrained
+    ├── results
+        ├── experiment1
+        ├── experiment2
 ```
-**Optional:** Installation of [BlenderProc](https://github.com/DLR-RM/BlenderProc) is required to render synthetic images. It can be ignored if you use our provided template. More details can be found in [Datasets](#Datasets).
-## Datasets :smiley_cat: :electric_plug:
-Before downloading the datasets, you may change [this line](https://github.com/nv-nguyen/template-pose/blob/main/config.json#L2) to define the $ROOT folder (to store data and results). 
 
-There are two options:
-1. To download our pre-processed datasets (15GB) + SUN397 dataset (37GB)
-```
-./data/download_preprocessed_data.sh
-```
-**Optional:** You can download with following gdrive links and unzip them manually. We recommend keeping $DATA folder structure as detailed in [./data/README](https://github.com/nv-nguyen/template-pose/blob/main/data/README.md) to keep pipeline simple:
-- [LINEMOD and Occlusion-LINEMOD (3GB)](https://drive.google.com/file/d/1XkQBt01nlfCbFuBsPMfSHlcNIzShn7e7/view?usp=sharing) -> then run ```python -m data.crop_image_linemod```
-- [T-LESS (11GB)](https://drive.google.com/file/d/1d2GoswrnvcTlwFi_LWoCiy1uS5OkCiF1/view?usp=sharing)
-- [Templates (both T-LESS and LINEMOD) (1.7GB)](https://drive.google.com/file/d/17_2SlfVtu9qYo82zPtqjL_GF99OWJvF5/view?usp=sharing)
-- [Dataframes including query-template pairwise used for training (11MB)](https://drive.google.com/file/d/10XVLb-DFIayBu7RZCnsm7NuG0vPZOaTc/view?usp=sharing)
-- [SUN397, randomized background for training on T-LESS (37GB)](vision.princeton.edu/projects/2010/SUN/SUN397.tar.gz)
-
-2. To download the original datasets and process them from scratch (process GT poses, render templates, compute nearest neighbors). All the main steps are detailed in [./data/README](https://github.com/nv-nguyen/template-pose/blob/main/data/README.md).
-```
-./data/download_and_process_from_scratch.sh
-```
-For any training with backbone ResNet50, we initialise with pretrained features of MOCOv2 which can be downloaded with the following command:
-```
-python -m lib.download_weight --model_name MoCov2
-```
-## T-LESS :electric_plug:
-#### 1. To launch a training on T-LESS:
-```
-python train_tless.py --config_path ./config_run/TLESS.json
-```
-#### 2. To reproduce the results on T-LESS:
-To download pretrained weights (by default, they are saved at $ROOT/pretrained/TLESS.pth):
-```
-python -m lib.download_weight --model_name TLESS
-```
-**Optional:** You can download manually with [this link](https://drive.google.com/drive/folders/11SQYPrG3pX31Qszf8R13s7Aaa5MO57lb?usp=sharing)
+</details>
+If you don't want these features, you can use the last version of codebase with the following command:
 
-To evaluate model with the pretrained weight:
 ```
-python test_tless.py --config_path ./config_run/TLESS.json --checkpoint $ROOT/pretrained/TLESS.pth
+git checkout 50a1087
 ```
 
-## LINEMOD and Occlusion-LINEMOD :smiley_cat:
-#### 1. To launch a training on LINEMOD:
-```
-python train_linemod.py --config_path config_run/LM_$backbone_$split_name.json
-```
-For example, with “base" backbone and split #1:
+This repository is running with the Weight and Bias logger. Ensure that you update this [user's configuration](https://github.com/nv-nguyen/template-pose/blob/main/configs/user/default.yaml) before conducting any experiments. 
+
+## Installation :construction_worker:
+
+<details><summary>Click to expand</summary>
+
+### 1. Create conda environment
 ```
-python train_linemod.py --config_path config_run/LM_baseNetwork_split1.json
+conda env create -f environment.yml
+conda activate template
 ```
 
-#### 2. To reproduce the results on LINEMOD:
-To download pretrained weights (by default, they are saved at $ROOT/pretrained):
+### 2. Datasets (from scratch) :smiley_cat: :electric_plug:
+First, create template poses from icosahedron:
 ```
-python -m lib.download_weight --model_name LM_$backbone_$split_name
+blenderproc run src/poses/create_poses.py
 ```
-**Optional:** You can download manually with [this link](https://drive.google.com/drive/folders/11SQYPrG3pX31Qszf8R13s7Aaa5MO57lb?usp=sharing)
-
-To evaluate model with a checkpoint_path:
+There are two options for next steps:
+#### Option 1: Download preprocessed dataset:
 ```
-python test_linemod.py --config_path config_run/LM_$backbone_$split_name.json --checkpoint checkpoint_path
+./src/scripts/download_preprocessed_data.sh
 ```
-For example, with “base" backbone and split #1:
+#### Option 2: Create/download dataset from scratch
 ```
-python -m lib.download_weight --model_name LM_baseNetwork_split1
-python test_linemod.py --config_path config_run/LM_baseNetwork_split1.json --checkpoint $ROOT/pretrained/LM_baseNetwork_split1.pth
+./src/scripts/download_and_process_from_scratch.sh
 ```
+</details>
+
+
 ## Acknowledgement
 
-The code is adapted from [PoseContrast](https://github.com/YoungXIAO13/PoseContrast), [DTI-Clustering](https://github.com/monniert/dti-clustering), [CosyPose](https://github.com/ylabbe/cosypose) and [BOP Toolkit](https://github.com/thodan/bop_toolkit). Many thanks to them!
+The code is adapted from [Nope](https://github.com/nv-nguyen/nope), [Temos](https://github.com/Mathux/Temos), [PoseContrast](https://github.com/YoungXIAO13/PoseContrast), [CosyPose](https://github.com/ylabbe/cosypose) and [BOP Toolkit](https://github.com/thodan/bop_toolkit). 
 
 The authors thank Martin Sundermeyer, Paul Wohlhart and Shreyas Hampali for their fast reply, feedback!
 

diff --git a/bop_toolkit_lib/README.md b/bop_toolkit_lib/README.md
diff --git a/bop_toolkit_lib/__init__.py b/bop_toolkit_lib/__init__.py