Behavioral Cloning

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Files Submitted & Code Quality

1. Submission includes all required files and can be used to run the simulator in autonomous mode

My project includes the following files:

model.py containing the script to create and train the model
model.h5 containing a trained convolution neural network
writeup_report.md summarizing the results
run1.mp4 video showcasing autonomous lap of track 1

2. Submission includes functional code

Using the Udacity provided simulator and my drive.py file, the car can be driven autonomously around the track by executing

python drive.py model.h5

3. Submission code is usable and readable

The 'Behavioural cloning.ipynb' jupyter notebook contains the code for training and saving the convolution neural network. The file shows the pipeline I used for training and validating the model, and it contains comments to explain how the code works.

Model Architecture and Training Strategy

1. An appropriate model architecture has been employed

My model consists of a convolution neural network with 5x5 and 3x3 filter sizes and depths between 24 and 64. All convolutional layers use RELU activation functions to introduce linearity.

4 fully connected layers is used with 2 dropout layers to avoid overfitting.

2. Attempts to reduce overfitting in the model

Dropout

Dropout layers with probability 0.5 is added to first 2 fully connected layers. Dropout was not added to convolutional layers as weights are shared between spatial positions so there shouldn't be huge number of parameters to overfit.

Regularization

Didn't try regularization as model already performed quite well without it.

Training

The model was trained and validated on different data sets (80/20% split) to ensure that the model was not overfitting. The model was tested iteratively by running it through the simulator after every change and ensuring that given change actually improved how vehicle is staying on the track.

3. Model parameter tuning

Optimizer

The model used an adam optimizer, so the learning rate was not tuned manually.

Hyperparameters

Hyperparameters from NVIDIA blog was used, didn't need to fine tune them.

4. Appropriate training data

I used a combination of center lane driving, recovering from the left and right sides of the road.

Training data from 2nd track was not used in final model. Although I tried to train model for second track separately.

Normalization

Keras lambda layer is used to normalize data around mean 0, which improves vanishing gradients and also increases convergence.

Data augmentation

Was not needed as network already performed with current dataset, but if bigger dataset is needed, Keras generators must be used as memory requirements were already limiting factor on AWS EC2 g2.2xlarge instance.

Model Architecture and Training Strategy

1. Solution Design Approach

I followed most most suggestions provided in Udacity classroom and these were enough for the vehicle to be able to drive in the middle of the track.

I used iterative approach doing small changes and testing resulting model in simulator. Step by step description can be found in the next chapter below.

2. Iterative Training Process

To capture good driving behavior I iteratively did small changes or added new training data and tested how change affected driving. These were the steps:

1. Flatten layer

Change: added just one fully connected layer and tested it with provided test data.

Result: car drives just circles, so behaves quite badly.

2. Normalization

Change: added data normalization

Result: validation loss decreased dramatically, but car behavior didn't improve, it still kept circling

3. Network architecture

Change: implemented network architecture from NVIDIA blog: https://devblogs.nvidia.com/deep-learning-self-driving-cars/

Result: car is already driving on road, but still wanders on lines and off road.

4. Recorded training data

Change: as architecture was already quite complex and car behavior still quite rudimental, I decided to record new training data.

I recorded around 2 laps both direction of track. Tried to stay in middle of road and be as smooth as possible.

Result: car drives quite well, but for some corners goes over the lines, loses track and don't know how to get back on track. This is probably caused from training data as while recording I only drove in middle of road.

5. Additional training data

Change: to help car to find way back to middle of road I recorded the vehicle recovering from the left side and right sides of the road back to center.

This was done in everywhere on track that had different line markings.

Result: vehicle mostly stays in middle of road, but in some places like tight bends it still went off the track

6. Cropping

Change: added cropping top and bottom of the image.

Result: model worked better, but it still struggled in bends and seemed like steering angle was limited to quite small values

7. Dropout

Change: added dropouts to 2 fully connected layers to avoid overfitting

Result: model manage now to take first tight bend, but still was not able to take second tight bend

8. Multiple camera angles

Changes: added usage of left and right cameras with steering correction of 0.2

Result: this improved car behaviour dramatically, it stayed in the middle of road and easily took even the tight bends

This is the final model I submitted. I also played around with second track, but didn't find a way how handle tight ascents, so the architecture in current state does not work on second track. To better understand what change is needed, visualizing internal CNN states is needed.

3. Final Model Architecture

The final model architecture consisted of a convolution neural network with the following layers:

Layer	Description
Input	160x320x3 RGB image
Convolution 5x5	24 filters, 5x5 kernel, 2x2 strides
Relu
Convolution 5x5	36 filters, 5x5 kernel, 2x2 strides
Relu
Convolution 5x5	48 filters, 5x5 kernel, 2x2 strides
Relu
Convolution 3x3	64 filters, 3x3 kernel, no strides
Relu
Convolution 3x3	64 filters, 3x3 kernel, no strides
Flatten	Flattens from 2D to 1D
Dropout	Dropout with 0.5 probability
Fully connected	Input 1164, outputs 100
Dropout	Dropout with 0.5 probability
Fully connected	Input 100, outputs 50
Fully connected	Input 50, outputs 10
Fully connected	Input 10, outputs 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

writeup_report.md

writeup_report.md

Behavioral Cloning

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Files Submitted & Code Quality

1. Submission includes all required files and can be used to run the simulator in autonomous mode

2. Submission includes functional code

3. Submission code is usable and readable

Model Architecture and Training Strategy

1. An appropriate model architecture has been employed

2. Attempts to reduce overfitting in the model

Dropout

Regularization

Training

3. Model parameter tuning

Optimizer

Hyperparameters

4. Appropriate training data

Normalization

Data augmentation

Model Architecture and Training Strategy

1. Solution Design Approach

2. Iterative Training Process

1. Flatten layer

2. Normalization

3. Network architecture

4. Recorded training data

5. Additional training data

6. Cropping

7. Dropout

8. Multiple camera angles

3. Final Model Architecture

Files

writeup_report.md

Latest commit

History

writeup_report.md

File metadata and controls

Behavioral Cloning

Rubric Points

Here I will consider the rubric points individually and describe how I addressed each point in my implementation.

Files Submitted & Code Quality

1. Submission includes all required files and can be used to run the simulator in autonomous mode

2. Submission includes functional code

3. Submission code is usable and readable

Model Architecture and Training Strategy

1. An appropriate model architecture has been employed

2. Attempts to reduce overfitting in the model

Dropout

Regularization

Training

3. Model parameter tuning

Optimizer

Hyperparameters

4. Appropriate training data

Normalization

Data augmentation

Model Architecture and Training Strategy

1. Solution Design Approach

2. Iterative Training Process

1. Flatten layer

2. Normalization

3. Network architecture

4. Recorded training data

5. Additional training data

6. Cropping

7. Dropout

8. Multiple camera angles

3. Final Model Architecture