Semantic Segmentation

Udacity Self-Driving Car Engineer Nanodegree. Project: Vehicle Detection and Tracking

This Project is the twelfth task of the Udacity Self-Driving Car Nanodegree program. The main goal of the project is to train an artificial neural network for semantic segmentation of a video from a front-facing camera on a car in order to mark road pixels using Tensorflow.

Results

KITTI Road segmentation (main task of the project):

Cityscapes multiclass segmentation (optional task):

Content of this repo

Segmentation.ipynb - Jupyter notebook with the main code for the project
helper.py - python program for images pre- and post- processing.
runs - directory with processed images
cityscapes.ipynb - Jupyter notebook with some visualization and preprocessing of the Cityscape dataset. Please, see the notebook for correct dataset directories placing.
Segmentation_cityscapes.ipynb - Jupyter notebook with the main code for the Cityscape dataset.
helper_cityscapes.py - python program for images pre- and post- processing for the Cityscape dataset.

Note: The repository does not contain any training images. You have to download the image datasetsplace them in appropriate directories on your own.

Architecture

A Fully Convolutional Network (FCN-8 Architecture developed at Berkeley, see paper ) was applied for the project. It uses VGG16 pretrained on ImageNet as an encoder. Decoder is used to upsample features, extracted by the VGG16 model, to the original image size. The decoder is based on transposed convolution layers.

The goal is to assign each pixel of the input image to the appropriate class (road, backgroung, etc). So, it is a classification problem, that is why, cross entropy loss was applied.

Setup

Hyperparameters were chosen by the try-and-error process. Adam optimizer was used as a well-established optimizer. Weights were initialized by a random normal initializer. Some benefits of L2 weights regularization were observed, therefore, it was applied in order to reduce grainy edges of masks.

Augmentation

Resized input images were also treated by random contrast and brightness augmentation (as linear function of the input image). It helps to produce reasonable predictions in difficult light conditions.

def bc_img(img, s = 1.0, m = 0.0):
    img = img.astype(np.int)
    img = img * s + m
    img[img > 255] = 255
    img[img < 0] = 0
    img = img.astype(np.uint8)
    return img

Deep shadows and contrast variations are not a problem because of rich augmentation on the training stage.

Cityscapes

Two classes (roads and cars) were chosen from the Cityscapes dataset for the optional task. The classes are unbalanced (roads are prevalent), so, a weighted loss function was involved (see Segmentation_cityscapes.ipynb for details). Interestingly, RMSProp optimizer performed better for the imageset.

Unfortunately, accord to the Cityscapes dataset licence I can not publish all produced images, however, there are some of them.

It correctly do not label a cyclist as a car, but mark small partly occluded cars.

It does not have problems with recognizing a cobbled street as a road.

And the ANN is able to mark cars in different projections.

References:

KITTI dataset
Cityscapes dataset
FCN ANN

_____________________ Udacity Readme.md ____________________

Setup

Frameworks and Packages

Make sure you have the following is installed:

Dataset

Download the Kitti Road dataset from here. Extract the dataset in the data folder. This will create the folder data_road with all the training a test images.

Start

Implement

Implement the code in the main.py module indicated by the "TODO" comments. The comments indicated with "OPTIONAL" tag are not required to complete.

Run

Run the following command to run the project:

python main.py

Note If running this in Jupyter Notebook system messages, such as those regarding test status, may appear in the terminal rather than the notebook.

Submission

Ensure you've passed all the unit tests.
Ensure you pass all points on the rubric.
Submit the following in a zip file.

helper.py
main.py
project_tests.py
Newest inference images from runs folder

How to write a README

A well written README file can enhance your project and portfolio. Develop your abilities to create professional README files by completing this free course.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Segmentation

Udacity Self-Driving Car Engineer Nanodegree. Project: Vehicle Detection and Tracking

Results

Content of this repo

Architecture

Setup

Augmentation

Cityscapes

Setup

Frameworks and Packages

Dataset

Start

Implement

Run

Submission

How to write a README

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
readme_img		readme_img
runs/1507149201.998534		runs/1507149201.998534
README.md		README.md
Segmentation.ipynb		Segmentation.ipynb
Segmentation_cityscapes.ipynb		Segmentation_cityscapes.ipynb
cityscapes.ipynb		cityscapes.ipynb
helper.py		helper.py
helper_cityscapes.py		helper_cityscapes.py
project_tests.py		project_tests.py

NikolasEnt/Road-Semantic-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Semantic Segmentation

Udacity Self-Driving Car Engineer Nanodegree. Project: Vehicle Detection and Tracking

Results

Content of this repo

Architecture

Setup

Augmentation

Cityscapes

Setup

Frameworks and Packages

Dataset

Start

Implement

Run

Submission

How to write a README

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages