Skip to content

This repo showcase the ENPM673: Perception for Autonomous robots final project. A vision transformer (ViT) architecture SegFormer, has been replicated for implementing semantic segmentation. Furthermore, it was deployed on raspberry pi with pi cam setup for validating the real-time performance.

Notifications You must be signed in to change notification settings

Sapienscoding/Semantic-Segmenter

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Semantic Segmentation Using Transformers

Segformers architecture has been implement from scratch in pytorch

Dependencies

  • torch
  • matplotlib
  • einops
  • albumentations
  • cityscapesscripts
  • opencv
  • torchmetrics
  • pillow

steps to train and perform inference

  • Enter into segformers directory
  • add the path to the dataset in the main file, for the variable root_dir
  • Uncomment the training code snippet and run the main.py file
  • The weights will be stored in a folder called weights
  • For inference testing, comment the training snippet and uncomment the testing snippet.
  • Provide the path to the weights and run the main.py file

steps to perform inference using raspberry pi

note: both raspberry pi and laptop should be in the same network which work on master/slave configuration, both should have the same version of ros
  • In the src folder, inside utils, provide the path to the weights.
  • build the package
  • make sure roscore is running on the master, and images are being published by the slave.
  • On the master system, execute rosrun cam_sub_node cam_sub_file.py
Link to the test inference
Note: if you just want to run the inference, please contact the authors for weight file, as github does not allow large files to be uploaded
Authors

About

This repo showcase the ENPM673: Perception for Autonomous robots final project. A vision transformer (ViT) architecture SegFormer, has been replicated for implementing semantic segmentation. Furthermore, it was deployed on raspberry pi with pi cam setup for validating the real-time performance.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 85.3%
  • CMake 14.7%