Build a Traffic Sign Recognition Project

The goals / steps of this project are the following:

Load the data set：German Traffic Sign Dataset
Explore, summarize and visualize the data set
Design, train and test a Convenlutional Neural Network model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images

Usage

Traffic_Sign_Classifier-Copy1.ipynb for Jupyter Notebook source code.
Traffic_Sign_Classifier-Copy1.html for browsing.

Data Set Summary & Exploration

I calculate summary statistics of the traffic signs data set:

The size of training set is 34799
The size of the validation set is 4410
The size of test set is 12630
The shape of a traffic sign image is 32x32x3
The number of unique classes/labels in the data set is 43

Visualization of the Traffic Sign data set:

Design and Test a Model Architecture

1. Preprocess the image data.

My first step to process the data is cropping the margin of the image. It will reduce useless information and speed up the computation. Next I apply Gasussian Blur (cv2.GaussianBlur()) to sharpen the image as most of them are vague. Finally Equalisation Histogram (cv2.equalizeHist()) is used to normalize the image. In contrary to MINIST dataset, grayscale is not a good technique to apply here because color stands for important infomation as well.

And I notice the contribution of the sample classes is far from even. The minimum and maximum numbers of one label varies from less than 200 to more than 2000. A small training sample set will definitely cause underfit. Hence I create some new data through rotate_img if its class number is less than 400. As a result, I create 5130 new data to add into the training set.

The size of updated training set is 39929
The shape of a processed traffic sign image is 28x28x3

2. My final model consists of the following layers:

Layer No.	Layer	Description
	Input	28x28x1 image
1	Convolution 5x5	1x1 stride, valid padding, outputs 24x24x8
	RELU	Activation
	Max pooling	2x2 stride, outputs 12x12x8
2	Convolution 3x3	1x1 stride, valid padding, outputs 10x10x20
	RELU	Activation
	Max pooling	2x2 stride, outputs 5x5x20
3	Convolution 2x2	1x1 stride, valid padding, outputs 4x4x60
	RELU	Activation
	Max pooling	2x2 stride, outputs 2x2x60
	Flatten	outputs 240
4	Fully connected	outputs 160, dropout
	RELU	Activation
5	Fully connected	outputs 80
	RELU	Activation
6	Fully connected	outputs 43

3. How to train my model.

To train the model, I used an optimizer, batch size = 128, epochs = 30, learning rate = 0.001.

optimizer = tf.train.AdamOptimizer(learning_rate = 0.001)
training_operation = optimizer.minimize(loss_operation)

Then feed data to the training model batch by batch.

for offset in range(0, num_examples, BATCH_SIZE):
    end = offset + BATCH_SIZE
    batch_x, batch_y = X_train[offset:end], y_train[offset:end]
    sess.run(training_operation, feed_dict={x: batch_x, y: batch_y, keep_prob: 0.5})

4. Model results

My final model results were:

validation set accuracy of 95.7%
test set accuracy of 93.4%

An iterative approach was chosen:

What was the first architecture that was tried and why was it chosen?

I choose LeNet because they have the similar application scenario - training the data set to learn how to classifer categories.

What were some problems with the initial architecture?

LeNet architecture adoption is underfitting to this project.

How was the architecture adjusted and why was it adjusted?

I add one layer to make the architecture deeper, including convolution and max pooling, due to initial model accurancy is around 83%, indicating underfitting.

Which parameters were tuned? How were they adjusted and why?

I tune filter height and width, depth. Too large filter size results in inefficient whereas too small size underfitting.

What are some of the important design choices and why were they chosen? For example, why might a convolution layer work well with this problem? How might a dropout layer help with creating a successful model?

I consider adopting 1x1 conv, dropout and max pooling techniques to improve the model performance. 1x1 conv is an inexpensive way to make model deeper and have more parameters. However, it does not make my model efficient. In terms of dropout, it is a technique for regularization. It makes things more robust and prevents over fitting to improve performance. Since there is a large size of training data size, I do not worry about abandon of redundant details. And dropout really helps in my model! Max pooling has no risk an increase in overfitting. It is more accurate but more expensive to compute.

The following figure is the summary comparison.

Finnally I apply dropout and max pooling in my model.

Test a Model on New Images

1. Test the model with new images and analyze performance.

Test images are isolated from training data and validation data to ensure REAL effectiveness of test accuracy. My test set accuracy is 93.4%.

2. Choose German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

Some images might be difficult to classify because they are too dark and low contract, resulting hard for the model to extract the feature to classify correctly.

Reflection

During tuning my project, I found that it consumes lots of time on finetuning (hyper)parameters. It will be boring to tune the model to fit a new classifier project every single time. Transfer learning will play magic to save repetitive efforts and get satisfactory achievements.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Test image		Test image
CNN-traffic-sign-classifier_dropout.py		CNN-traffic-sign-classifier_dropout.py
ModelAccurancyCompare.png		ModelAccurancyCompare.png
README.md		README.md
TrafficSignExample.JPG		TrafficSignExample.JPG
Traffic_Sign_Classifier-Copy1.html		Traffic_Sign_Classifier-Copy1.html
Traffic_Sign_Classifier-Copy1.ipynb		Traffic_Sign_Classifier-Copy1.ipynb
process1.PNG		process1.PNG
signnames.csv		signnames.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build a Traffic Sign Recognition Project

Usage

Data Set Summary & Exploration

Design and Test a Model Architecture

1. Preprocess the image data.

2. My final model consists of the following layers:

3. How to train my model.

4. Model results

What was the first architecture that was tried and why was it chosen?

What were some problems with the initial architecture?

How was the architecture adjusted and why was it adjusted?

Which parameters were tuned? How were they adjusted and why?

What are some of the important design choices and why were they chosen? For example, why might a convolution layer work well with this problem? How might a dropout layer help with creating a successful model?

Test a Model on New Images

1. Test the model with new images and analyze performance.

2. Choose German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

Reflection

About

Releases

Packages

Languages

uranus4ever/Traffic-Sign-Classifier-CNN

Folders and files

Latest commit

History

Repository files navigation

Build a Traffic Sign Recognition Project

Usage

Data Set Summary & Exploration

Design and Test a Model Architecture

1. Preprocess the image data.

2. My final model consists of the following layers:

3. How to train my model.

4. Model results

What was the first architecture that was tried and why was it chosen?

What were some problems with the initial architecture?

How was the architecture adjusted and why was it adjusted?

Which parameters were tuned? How were they adjusted and why?

What are some of the important design choices and why were they chosen? For example, why might a convolution layer work well with this problem? How might a dropout layer help with creating a successful model?

Test a Model on New Images

1. Test the model with new images and analyze performance.

2. Choose German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

Reflection

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages