Added dense mapping with RGBD.

luigifreda · Dec 5, 2024 · 0b59f8e · 0b59f8e
1 parent 7a0917e
commit 0b59f8e
Show file tree

Hide file tree

Showing 54 changed files with 8,286 additions and 6,001 deletions.
diff --git a/.gitignore b/.gitignore
@@ -32,10 +32,16 @@ map.json
 
 data/ORBvoc.txt
 data/orbvoc.dbow3
+data/ORBvoc_test.txt 
+data/VLADvoc_orb.txt
 data/images/GardensPoint
 data/images/StLucia_small
 data/images/SFU
+data/orb_descriptors_kitti.npy
+data/slam_state
 
 thirdparty/pydbow3/ORBvoc.txt
 
-thirdparty/protoc 
+thirdparty/protoc 
+
+logs/*.log
diff --git a/README.md b/README.md
@@ -5,48 +5,48 @@ Author: **[Luigi Freda](https://www.luigifreda.com)**
 <!-- TOC -->
 
 - [pySLAM v2.2.5](#pyslam-v225)
-  - [Install](#install)
-    - [Requirements](#requirements)
-    - [Ubuntu](#ubuntu)
-    - [MacOS](#macos)
-    - [Docker](#docker)
-    - [How to install non-free OpenCV modules](#how-to-install-non-free-opencv-modules)
-    - [Troubleshooting and performance issues](#troubleshooting-and-performance-issues)
-  - [Usage](#usage)
-    - [Feature tracking](#feature-tracking)
-    - [Loop closing](#loop-closing)
-    - [Save and reload a map](#save-and-reload-a-map)
-    - [Relocalization in a loaded map](#relocalization-in-a-loaded-map)
-    - [Trajectory saving](#trajectory-saving)
-    - [SLAM GUI](#slam-gui)
-    - [Monitor the logs for tracking, local mapping, and loop closing simultaneously](#monitor-the-logs-for-tracking-local-mapping-and-loop-closing-simultaneously)
-  - [Supported local features](#supported-local-features)
-  - [Supported matchers](#supported-matchers)
-  - [Supported global descriptors and local descriptor aggregation methods](#supported-global-descriptors-and-local-descriptor-aggregation-methods)
-      - [Local descriptor aggregation methods](#local-descriptor-aggregation-methods)
-      - [Global descriptors](#global-descriptors)
-  - [Datasets](#datasets)
-    - [KITTI Datasets](#kitti-datasets)
-    - [TUM Datasets](#tum-datasets)
-    - [EuRoC Dataset](#euroc-dataset)
-  - [Camera Settings](#camera-settings)
-  - [Comparison pySLAM vs ORB-SLAM3](#comparison-pyslam-vs-orb-slam3)
-  - [Contributing to pySLAM](#contributing-to-pyslam)
-  - [References](#references)
-  - [Credits](#credits)
-  - [TODOs](#todos)
+    - [1. Install](#1-install)
+        - [1.1. Requirements](#11-requirements)
+        - [1.2. Ubuntu](#12-ubuntu)
+        - [1.3. MacOS](#13-macos)
+        - [1.4. Docker](#14-docker)
+        - [1.5. How to install non-free OpenCV modules](#15-how-to-install-non-free-opencv-modules)
+        - [1.6. Troubleshooting and performance issues](#16-troubleshooting-and-performance-issues)
+    - [2. Usage](#2-usage)
+        - [2.1. Feature tracking](#21-feature-tracking)
+        - [2.2. Loop closing](#22-loop-closing)
+        - [2.3. Save and reload a map](#23-save-and-reload-a-map)
+        - [2.4. Relocalization in a loaded map](#24-relocalization-in-a-loaded-map)
+        - [2.5. Trajectory saving](#25-trajectory-saving)
+        - [2.6. SLAM GUI](#26-slam-gui)
+        - [2.7. Monitor the logs for tracking, local mapping, and loop closing simultaneously](#27-monitor-the-logs-for-tracking-local-mapping-and-loop-closing-simultaneously)
+    - [3. Supported local features](#3-supported-local-features)
+    - [4. Supported matchers](#4-supported-matchers)
+    - [5. Supported global descriptors and local descriptor aggregation methods](#5-supported-global-descriptors-and-local-descriptor-aggregation-methods)
+            - [5.1. Local descriptor aggregation methods](#51-local-descriptor-aggregation-methods)
+            - [5.2. Global descriptors](#52-global-descriptors)
+    - [6. Datasets](#6-datasets)
+        - [6.1. KITTI Datasets](#61-kitti-datasets)
+        - [6.2. TUM Datasets](#62-tum-datasets)
+        - [6.3. EuRoC Dataset](#63-euroc-dataset)
+    - [7. Camera Settings](#7-camera-settings)
+    - [8. Comparison pySLAM vs ORB-SLAM3](#8-comparison-pyslam-vs-orb-slam3)
+    - [9. Contributing to pySLAM](#9-contributing-to-pyslam)
+    - [10. References](#10-references)
+    - [11. Credits](#11-credits)
+    - [12. TODOs](#12-todos)
 
 <!-- /TOC -->
 
-**pySLAM** is a python implementation of a *Visual Odometry (VO)* pipeline for **monocular**, **stereo** and **RGBD** cameras. 
+**pySLAM** is a python implementation of a *Visual SLAM* pipeline for **monocular**, **stereo** and **RGBD** cameras. 
 - It supports many classical and modern **[local features](#supported-local-features)** and it offers a convenient interface for them.
-- It implements loop closing via **[descriptor aggregators](#supported-global-descriptors-and-local-descriptor-aggregation-methods)** such as visual Bag of Words (BoW), Vector of Locally Aggregated Descriptors (VLAD) and modern **[global descriptors](#supported-global-descriptors-and-local-descriptor-aggregation-methods)** (aka image-wise descriptors). 
+- It implements loop closing via **[descriptor aggregators](#supported-global-descriptors-and-local-descriptor-aggregation-methods)** such as visual Bag of Words (BoW, iBow), Vector of Locally Aggregated Descriptors (VLAD) and other modern **[global descriptors](#supported-global-descriptors-and-local-descriptor-aggregation-methods)** (image-wise descriptors). 
 - It collects many other useful VO and SLAM tools. 
 
 **Main Scripts**:
 * `main_vo.py` combines the simplest VO ingredients without performing any image point triangulation or windowed bundle adjustment. At each step $k$, `main_vo.py` estimates the current camera pose $C_k$ with respect to the previous one $C_{k-1}$. The inter-frame pose estimation returns $[R_{k-1,k},t_{k-1,k}]$ with $\Vert t_{k-1,k} \Vert=1$. With this very basic approach, you need to use a ground truth in order to recover a correct inter-frame scale $s$ and estimate a valid trajectory by composing $C_k = C_{k-1} [R_{k-1,k}, s t_{k-1,k}]$. This script is a first start to understand the basics of inter-frame feature tracking and camera pose estimation.
 
-* `main_slam.py` adds feature tracking along multiple frames, point triangulation, keyframe management and bundle adjustment in order to estimate the camera trajectory up-to-scale and build a map. It's still a VO pipeline but it shows some basic blocks which are necessary to develop a real visual SLAM pipeline. 
+* `main_slam.py` adds feature tracking along multiple frames, point triangulation, keyframe management, bundle adjustment, loop closing and dense mapping in order to estimate the camera trajectory up-to-scale and build a map. It's a full SLAM pipeline and includes all the basic and advanced blocks which are necessary to develop a real visual SLAM pipeline. 
 
 * `main_feature_matching.py` shows how to use the basic feature tracker capabilities (*feature detector* + *feature descriptor* + *feature matcher*) and allows to test the different available local features. Further details [here](./docs/basic_architecture.md).
 
@@ -89,7 +89,7 @@ Then, use the available specific install procedure according to your OS. The pro
 * Kornia 0.7.3
 * Rerun
 
-If you run into troubles or performance issues, check this [TROUBLESHOOTING](./TROUBLESHOOTING.md) file.
+If you encounter any issues or performance problems, refer to the [TROUBLESHOOTING](./TROUBLESHOOTING.md) file for assistance.
 
 
 ### Ubuntu 
@@ -113,9 +113,9 @@ If you prefer docker or you have an OS that is not supported yet, you can use [r
 
 ### How to install non-free OpenCV modules
 
-The provided install scripts take care of installing a recent opencv version (>=**4.8**) with its non-free modules enabled (see for instance [install_pip3_packages.sh](./install_pip3_packages.sh), which is used with venv under Ubuntu, or the [install_opencv_python.sh](./install_opencv_python.sh) under mac).
+The provided install scripts will install a recent opencv version (>=**4.8**) with non-free modules enabled (see for instance [install_pip3_packages.sh](./install_pip3_packages.sh), which is used with venv under Ubuntu, or the [install_opencv_python.sh](./install_opencv_python.sh) under mac).
 
-How to check your installed OpenCV version:      
+To verify your installed OpenCV version, use the following command:      
 `$ python3 -c "import cv2; print(cv2.__version__)"`
 
 How to check if you have non-free OpenCV module support (no errors imply success):       
@@ -222,7 +222,7 @@ Some quick information about the non-trivial GUI buttons of `main_slam.py`:
 
 ### Monitor the logs for tracking, local mapping, and loop closing simultaneously
 
-The logs generated by the modules `local_mapping.py`, `loop_closing.py`, `loop_detecting_process.py`, and `global_bundle_adjustments.py` are collected in the files `local_mapping.log`, `loop_closing.log`, `loop_detecting.log`, and `gba.log`, respectively. For fun/debugging, you can monitor their parallel flows by running the command `tail -f <log file name>` in separate shells. Otherwise, just run the script `./scripts/launch_tmux_slam.sh` from the repo root folder. 
+The logs generated by the modules `local_mapping.py`, `loop_closing.py`, `loop_detecting_process.py`, and `global_bundle_adjustments.py` are collected in the files `local_mapping.log`, `loop_closing.log`, `loop_detecting.log`, and `gba.log`, respectively stored in the folder `logs`. For fun/debugging, you can monitor their parallel flows by running the command `tail -f logs/<log file name>` in separate shells. Otherwise, just run the script `./scripts/launch_tmux_slam.sh` from the repo root folder: Use `CTRL+A` and then `CTRL+Q` to exit.
 
 ---
 ## Supported local features
@@ -284,9 +284,9 @@ The following feature **descriptors** are supported:
 * *[Xfeat](https://arxiv.org/abs/2404.19174)*
 * *[KeyNetAffNetHardNet](https://github.com/axelBarroso/Key.Net)* (KeyNet detector + AffNet + HardNet descriptor).
 
-You can find further information in the file [feature_types.py](./feature_types.py). Some of the local features consist of a *joint detector-descriptor*. You can start playing with the supported local features by taking a look at `test/cv/test_feature_manager.py` and `main_feature_matching.py`.
+For more information, refer to [feature_types.py](./feature_types.py) file. Some of the local features consist of a *joint detector-descriptor*. You can start playing with the supported local features by taking a look at `test/cv/test_feature_manager.py` and `main_feature_matching.py`.
 
-In both the scripts `main_vo.py` and `main_slam.py`, you can create your favourite detector-descritor configuration and feed it to the function `feature_tracker_factory()`. Some ready-to-use configurations are already available in the file [feature_tracker.configs.py](./feature_tracker_configs.py)
+In both the scripts `main_vo.py` and `main_slam.py`, you can create your preferred detector-descritor configuration and feed it to the function `feature_tracker_factory()`. Some ready-to-use configurations are already available in the file [feature_tracker.configs.py](./feature_tracker_configs.py)
 
 The function `feature_tracker_factory()` can be found in the file `feature_tracker.py`. Take a look at the file `feature_manager.py` for further details.
 
@@ -391,7 +391,7 @@ In order to calibrate your camera, you can use the scripts in the folder `calibr
 1. use the script `grab_chessboard_images.py` to collect a sequence of images where the chessboard can be detected (set the chessboard size therein, you can use the calibration pattern `calib_pattern.pdf` in the same folder) 
 2. use the script `calibrate.py` to process the collected images and compute the calibration parameters (set the chessboard size therein)
 
-For further information about the calibration process, you may want to have a look [here](https://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_calib3d/py_calibration/py_calibration.html). 
+For more information on the calibration process, see the tutorial [here](https://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_calib3d/py_calibration/py_calibration.html). 
 
 If you want to **use your camera**, you have to:
 * calibrate it and configure [WEBCAM.yaml](./settings/WEBCAM.yaml) accordingly

diff --git a/bash_utils.sh b/bash_utils.sh
@@ -46,6 +46,12 @@ function install_package(){
     fi 
 }
 
+function install_packages(){
+    for var in "$@"; do
+        install_package "$var"
+    done
+}
+
 function check_pip_package(){
     package_name=$1
     PKG_OK=$(python3 -m pip list |grep $package_name)

diff --git a/camera.py b/camera.py
@@ -85,7 +85,7 @@ def __init__(self, config):
             self.b = self.bf/self.fx
         if config.sensor_type == 'stereo' and self.bf is None:
             raise ValueError('Expecting the field Camera.bf in the camera config file')
-        self.depth_factor = 1.0 # Deptmap values factor 
+        self.depth_factor = 1.0 # Depthmap values factor 
         if 'DepthMapFactor' in config.cam_settings:
             self.depth_factor = 1.0/float(config.cam_settings['DepthMapFactor'])
             print('Using DepthMapFactor = %f' % self.depth_factor)

diff --git a/clean.sh b/clean.sh
@@ -44,6 +44,5 @@ cd thirdparty/pyibow
 cd $SCRIPT_DIR
 
 
-# TODO
-# clean downloaded models 
+# clean downloaded models and reset submodules
 # git submodule foreach 'git reset --hard; git clean -xfd'
diff --git a/convert_groundtruth.py b/convert_groundtruth.py
@@ -9,12 +9,12 @@
 groundtruth_settings = {}
 groundtruth_settings['type']='kitti'
 groundtruth_settings['base_path'] ='/home/luigi/Work/datasets/rgbd_datasets/kitti/dataset'
-groundtruth_settings['name'] = '00'
+groundtruth_settings['name'] = '06'
 
 def main(settings = groundtruth_settings, out_filename = 'groundtruth.txt'):
     print(f'converting {groundtruth_settings}')
     grountruth = groundtruth_factory(groundtruth_settings)
-    grountruth.convertToSimpleXYZ()
+    grountruth.convertToSimpleDataset()
 
 if __name__ == '__main__':
     main()
diff --git a/dataset.py b/dataset.py
@@ -150,7 +150,8 @@ def getImageColor(self, frame_id):
             else:
                 return img             
         except:
-            img = None  
+            img = None
+            self.is_ok = False
             #raise IOError('Cannot open dataset: ', self.name, ', path: ', self.path)        
             Printer.red(f'Cannot open dataset: {self.name}, path: {self.path}')
             return img    

diff --git a/feature_matcher.py b/feature_matcher.py
@@ -43,10 +43,12 @@
 XFeat = import_from('modules.xfeat', 'XFeat')
 LightGlue = import_from('lightglue', 'LightGlue')
 
-import builtins as __builtin__
-logging_file=open('matcher.log','w')
-def print_to_file(*args, **kwargs):
-    return __builtin__.print(*args, **kwargs,file=logging_file,flush=True)
+
+kScriptPath = os.path.realpath(__file__)
+kScriptFolder = os.path.dirname(kScriptPath)
+kRootFolder = kScriptFolder
+kLogsFolder = kRootFolder + '/logs'
+
 
 
 kRatioTest = Parameters.kFeatureMatchRatioTest

diff --git a/frame.py b/frame.py
@@ -344,29 +344,25 @@ def __init__(self, camera: Camera, img, img_right=None, depth=None, pose=None, i
         self.outliers = None     # outliers flags for map points (reset and set by pose_optimization())
 
         self.kf_ref = None       # reference keyframe 
+
         self.img = None          # image (copy of img if available)
+        self.img_right = None          
         self.depth_img = None    # depth (copy of depth if available)
-
+                                                        
         if img is not None:
             #self.H, self.W = img.shape[0:2]                 
             if FrameShared.is_store_imgs: 
                 self.img = img.copy()  
-            else: 
-                self.img = None    
 
         if img_right is not None:
             if FrameShared.is_store_imgs: 
-                self.img_r = img_right.copy()  
-            else: 
-                self.img_r = None                                        
+                self.img_right = img_right.copy()                                        
 
         if depth is not None:
             if self.camera is not None and self.camera.depth_factor != 1.0:
                 depth = depth * self.camera.depth_factor             
             if FrameShared.is_store_imgs: 
-                self.depth_img = depth.copy()  
-            else: 
-                self.depth_img = None   
+                self.depth_img = depth.copy()
 
         if img is not None:                  
             if img_right is not None:
@@ -454,7 +450,8 @@ def to_json(self):
                 'kf_ref': self.kf_ref.id if self.kf_ref is not None else None,
 
                 'img': json.dumps(NumpyB64Json.numpy_to_json(self.img)) if self.img is not None else None,
-                'depth_img': json.dumps(NumpyB64Json.numpy_to_json(self.depth_img)) if self.depth_img is not None else None        
+                'depth_img': json.dumps(NumpyB64Json.numpy_to_json(self.depth_img)) if self.depth_img is not None else None,
+                'img_right': json.dumps(NumpyB64Json.numpy_to_json(self.img_right)) if self.img_right is not None else None
                 }
         return ret
 
@@ -492,6 +489,7 @@ def from_json(json_str):
 
         f.img = NumpyB64Json.json_to_numpy(json.loads(json_str['img'])) if json_str['img'] is not None else None
         f.depth_img = NumpyB64Json.json_to_numpy(json.loads(json_str['depth_img'])) if json_str['depth_img'] is not None else None
+        f.img_right = NumpyB64Json.json_to_numpy(json.loads(json_str['img_right'])) if json_str['img_right'] is not None else None
 
         if f.kps is not None and f.points is not None:
             #print(f'f.kps.shape = {f.kps.shape}, f.points.shape = {f.points.shape}')