GitHub - DavidG1011/Traffic-Sign-Neural-Network: Neural network to identify and classify traffic signs.

Traffic Sign Recognition Writeup

Build a Traffic Sign Recognition Project

The goals / steps of this project are the following:

Load the data set (see below for links to the project data set)
Explore, summarize and visualize the data set
Design, train and test a model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images
Summarize the results with a written report

Rubric Points:

The rubric covered Here was the basis for the approach to this project.

1. Preprocessing: In order to reduce the amount of image data and ease the process of training, I used the preprocessing methods of grayscaling and normalization. I used these methods for the simplicity and effectiveness of them.

Gray:

Normalization:

Normalization was as simple as taking the pixel values of each image and applying the formula of (pixel - 128) / 128. This reduces pixel data without changing the content of an image.

Data Set Summary & Exploration

I used the numpy library and built in python len() to calculate summary statistics of the traffic signs data set:

Number of training examples = 34799
Number of testing examples = 12630
Number of validation examples = 4410
Image data shape = (32, 32, 3)
Number of classes = 43

Visual Exploration

I created bar graphs of my data to show the amount of signs per class for each of the data sets. I chose bar graphs as they are easy to read, while still providing a helpful visual representation of the size of my data set.

Training Graph

Validation Graph

Testing Graph

Design and Test a Model Architecture

The architecture I used was a modified LeNet 5 - Link. I feel that it is perhaps too simplistic a model, but it still sports a relatively high accuracy with some added touches.

The final model I decided on was:

Layer	Description
Input	32x32x1 Gray, Normalized image
Convolution 5x5	1x1 stride, Valid padding, outputs 28x28x6
RELU
Max pooling	2x2 stride, outputs 14x14x6
Convolution 5x5	1x1 stride, outputs 10x10x16
RELU
Max pooling	2x2 stride, outputs 5x5x16
Flatten	Outputs 400
Fully Connected Conv:	In 400, Out 120
RELU
Dropout Layer	Probability 0.5
Convolution	In 120, Out 84
RELU
Dropout Layer	Probability 0.5
Convolution	In 84, Out 43: (Number of Classes)

Training The Model

To train the model, I used an Adam Optimizer from the tensorflow library. I found this was an adequate optimizer with the correct learning rate. My findings from doing testing were that the learning rate was the most crucial paramater to the accuracy of my model.

The generalized results are as follows:

Learning Rate	Validation Accuracy
> 0.01	Very poor. Usually stuck at less than 1%
< 0.01	Good. Diminishing returns if too low.

The final paramaters I landed on are as follows:

Epochs = 100

May be a bit high, results tend to stagnate towards the end.

Batch Size = 1000

Quick with a 2GB NVIDIA GPU.

Learn Rate = 0.002

The lucky number I came to after many tunings.

My final model results were:

validation set accuracy of 96.281%
test set accuracy of 94.3%

Using an Iterative Model to Improve:

The first model I used was a straight LeNet5 with no modifications, to achieve a baseline of how well it would perform with the dataset. This was also before any image normalization.
The problem I ran into with this was that the model was constantly at a low accuracy. It wouldn't budge past under 10% accuracy or so. Tuning all the paramaters, I could only achieve at most a very low validation rate of under 50%.
From here, I modified the model to use Grayed and Normalized images.
Next, I retested my model on the validation set only to notice a still low validation accuracy.
From here I realized that my model was overfitting. To counter this, I added two dropout layers with a 50% keep probability. Tuning my models learning rate, I finally achieved an acceptable accuracy of 96.281%

Testing The Model On New Images

The 5 new images I used for testing:

Image Quality: Here Comes The Fuzz:

A difficulty I faced using these images is that some are quite big, even when cropped from their original size. Cropping these large images down further to only 32x32 has made them very compressed, and possibly made it hard for the network to discern characteristics for these specific images.

Comparing these images to their originals above, it can be seen that resizing these images vastly changes the data of it.

How Resizing to 32x32 Changes An Image Numerically

Original Image Size (pixels)	Resize Pixel Loss X	Resize Pixel Loss Y
Speed limit (30km/h) - 122 x 119	~73.77%	~73.11%
Yield - 297x 263	~89.23%	~87.83%
Stop - 498 x 455	~93.57%	~92.97%
Speed limit (70km/h) - 56 x 52	~42.86%	~38.46%
Speed limit (120km/h) - 119 x 100	~73.11%	~68.0%

Looking at these losses, it's easy to see how the network may have difficulty identifying these images. For example, the stop sign image lost over 93% of its original X pixel value and almost 93% of its original Y pixel value. These high loss resized images could contribute to my findings for prediction accuracy...

Image Qualities

Resolution/Distortion: As mentioned above, resizing the images reduces the resolution or overall image data to a very small percentage. This is probably the most limiting factor when it comes to predicting these images.

Brightness: Good, very bright. These images were also normalized with the same method as the other image sets.

Viewing Angle: The angle that the signs are at should be fine for prediction. They are relatively forward facing.

Edges: I made sure to crop the images as best as possible to reduce outer edge noise, some extra image data is left, however.

The results I got from my predctions were as follows:

Image	Prediction	Correct
Speed limit (30km/h)	Speed limit (30km/h)	Yes
Yield	Yield	Yes
Speed limit (30km/h)	Stop	No
Speed limit (30km/h)	Speed limit (70km/h)	No
Speed limit (120km/h)	Speed limit (120km/h)	Yes

The model was able to correctly guess 3 of the 5 traffic signs, which gives an accuracy of 60%

Comparing These Results To Test Set Results

Test Set Accuracy	Prediction Accuracy	Difference
94.3%	60%	- 34.3%

Comparing to my test set accuracy, it performed unfavorably with a 34.3% difference in accuracy. I feel this could be due to how lossy the images were once compressed down to 32x32. If I were to select images again, I would try to find ones with a smaller original size, to reduce the amount of compression. Using different resizing methods in python and using methods such as antialiasing seemed to make no difference in accuracy.

This is not as well as I had hoped the model would do, but in viewing the probability distribution for the model, the correct prediction is within the top 5 probabilities of each wrong image.

The probabilities for each new sign are as follows:

Image 1: Speed limit (30km/h)

Prediction	Probability
Speed limit (30km/h)	100.0000000000%
Speed limit (50km/h)	0.0000000000%
Speed limit (70km/h)	0.0000000000%
Stop	0.0000000000%
Speed limit (60km/h)	0.0000000000%

Image 2: Yield

Prediction	Probability
Yield	100.0000000000%
Speed limit (20km/h)	0.0000000000%
Speed limit (30km/h)	0.0000000000%
Speed limit (50km/h)	0.0000000000%
Speed limit (60km/h)	0.0000000000%

Image 3: Stop

Prediction	Probability
Speed limit (30km/h)	99.9953508377%
Stop	0.0033806267%
Yield	0.0012675282%
Speed limit (50km/h)	0.0000002531%
Keep left	0.0000000453%

Image 4: Speed limit (70km/h)

Prediction	Probability
Speed limit (30km/h)	99.9999403954%
Speed limit (60km/h)	0.0000351977%
Stop	0.0000248258%
Speed limit (70km/h)	0.0000027374%
Turn right ahead	0.0000000031%

Image 5: Speed limit (120km/h)

Prediction	Probability
Speed limit (120km/h)	73.9777505398%
Speed limit (20km/h)	22.8198796511%
Speed limit (30km/h)	2.8353558853%
Go straight or right	0.2367284149%
Keep right	0.0944264291%

Conclusion:

Overall I am pleased with the testing and validation accuracy. I am a little suprised with my new images not doing as well, but the features are being recognized, as evidenced by the probabilities of the correct sign being in the top 5 results. This was a cool exploration into the world of neural networks, and I hope to use what I learned here to make even better models in the future.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.ipynb_checkpoints		.ipynb_checkpoints
New_Images		New_Images
README.md		README.md
Traffic_Sign_Classifier.ipynb		Traffic_Sign_Classifier.ipynb
checkpoint		checkpoint
lenet.data-00000-of-00001		lenet.data-00000-of-00001
lenet.index		lenet.index
lenet.meta		lenet.meta
report.html		report.html
signnames.csv		signnames.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

DavidG1011/Traffic-Sign-Neural-Network

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages