Skip to content

Extended Mask R-CNN for RGB-D object instance segmentation based on Matterport's implementation. Segmentation accuracy improved by up to 31% using the additional depth input channel.

License

Notifications You must be signed in to change notification settings

orestis-z/mask-rcnn-rgbd

 
 

Repository files navigation

Object Instance Segmentation from RGB-D Data with Mask R-CNN

This is an extended implementation of matterport's Mask R-CNN implementation on Python 3, Keras, and TensorFlow that supports RGB-D input data. The model generates bounding boxes and segmentation masks for each instance of an object in the RGB(-D) image. It's based on Feature Pyramid Network (FPN) and a ResNet101 backbone.

In our experiments we showed that an additional depth input layer can improve the segmentation accuracy of Mask R-CNN by up to 31%.

Links: Presentation, Paper (unofficial CoRL 2018 submission)

RGB-D Instance Segmentation Sample

Training and evaluation scripts for the 2D-3D-S, ADE20K, Coco, NYU Depth V2, sceneNet and sceneNN datasets can be found under the instance_segmentation directory:

  • dataset.py: Hyperparameters config, interface to datasets

  • train.py: Training script

  • eval.ipynb: IPython notebook for the evaluation of the dataset & training

Please refer to Matterport's original implementation for the original documentation and more.

About

Extended Mask R-CNN for RGB-D object instance segmentation based on Matterport's implementation. Segmentation accuracy improved by up to 31% using the additional depth input channel.

Topics

Resources

License

Stars

Watchers

Forks

Languages

  • Jupyter Notebook 97.5%
  • Python 2.4%
  • Other 0.1%