Learning Team

This branch contains the perception and deep learning team code for the sp22Robot. This includes training scripts for the instance segmentation and object detection models as well as other utilities and the main script that ran during robot operation.

Hardware Requirements for perception:

Realsense camera with RGB and Depth sensors
Nvidia Jetson Nano

Software Requirements for perception

Jetpack 4.4
PyRealSense2 built from source for ARM 64
OpenCV built with cuda (see https://github.com/mdegans/nano_build_opencv)
CVU https://github.com/BlueMirrors/cvu

Dataset

The dataset consist of images of metal plates to be welded captured using the RGB sensor from Realsense D455 and D435 cameras from Intel. The object detection dataset can be found at https://app.roboflow.com/stuti-garg-oqsc8/robotics-jkowd/5

Model Training

Both the obeject detection and instance segmentation models were trainied on Google Colab Pro using an Nvidia Tesla V100 using the official training script from https://github.com/ultralytics/yolov5

Approach

Object Detection

During the developement process for the welding joint detection we explored using both instance segmentation and object detection and while both models were trained on the custom dataset, ultimately the object detection using YOLO v5 was chosen as the final model to identify the welding joints. Once the joints are identified color thresholding is used to identify the seam and the Realsense D455 camera's depth module is used to augment the detections with the 3d location of the seam so that this information can be passed on to the robotic arm using MQTT.

Deployment to the Nvidia Jetson Nano

The YOLO v5 model was deployed to the Jetson Nano using two approaches. Before the model could be deployed, it was trainend using the training script provided by https://github.com/ultralytics/yolov5 and the model weights were exported in the ONNX format, then the first approach used OpenCV and the it's DNN module in python to deploy the YOLO v5 model loaded in as an ONNX model based on code from https://github.com/doleron/yolov5-opencv-cpp-python. Using this implementation, the performace averaged around 2 FPS and while this proved to be enough for the detections as long as the robotic arm moved slowly, a more efficient implementation using TensorRT was ultimately used based on the implementation found at https://github.com/BlueMirrors/Yolov5-TensorRT. This implementation saw a performance of about 6-7 FPS on the Nano

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
preprocessing		preprocessing
CV_Final.py		CV_Final.py
README.md		README.md
TRT_final.py		TRT_final.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning Team

This branch contains the perception and deep learning team code for the sp22Robot. This includes training scripts for the instance segmentation and object detection models as well as other utilities and the main script that ran during robot operation.

Hardware Requirements for perception:

Software Requirements for perception

Dataset

Model Training

Approach

Object Detection

Deployment to the Nvidia Jetson Nano

About

Releases

Packages

Contributors 3

Languages

Derik-F-M-S/DL-Autonomous-Welding

Folders and files

Latest commit

History

Repository files navigation

Learning Team

This branch contains the perception and deep learning team code for the sp22Robot. This includes training scripts for the instance segmentation and object detection models as well as other utilities and the main script that ran during robot operation.

Hardware Requirements for perception:

Software Requirements for perception

Dataset

Model Training

Approach

Object Detection

Deployment to the Nvidia Jetson Nano

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages