Motion forecasting for vehicles

A note about file structure

prelude/ introductory work
api/ A FastAPI api for serving dataset
training/ code (noteboks and helper scripts) for training the model
clustering code for second phase i.e. optimal cluster formation

Block Diagram

Problem Statement

Vehicular networks enable vehicles to communicate with each other and with roadside infrastructure. These smart vehicles are constrained in terms of computational capacity. Task offloading is a technique to perform computation intensive tasks in a coordinated manner by sharing the task load.

We have proposed an efficient technique for task offloading to reduce average computation time and the probability of failed task computations.

In summary, this project revolves around:

Lack of computational capacity in smart vehicles
Latency issues with cloud
Task offloading decisions

Data Pipeline

Data

Dataset is generated using the SUMO traffic simulator.
Different variants from the dataset are served through an API developed with FastAPI on Heroku.

Exploratory Data Analysis

Obersvations and suggestions for position data

Vehicle positions are almost purely linear data.
A correlation was speculated between the positions of some vehicles in the network duing dataset generation, the EDA phase supported this speculation.
Positions start from 0 for each vehicle and are in the range of a continuous real interval different for each vehicle.

These observation hint towards using suitable standardization or normalization techniques since Deep Learning models work better on standardized data.

Obersvations and suggestions for speed data

Vehicle speeds are in in the form of a time series range.
Some of the vehicles have peaks when the trip is initialized, and this needs to be taken care of. This is because it is a difficult task for time series regression algorithms to predict and generalize peaks in the data, especially when there are no seasonality trends in the data and it is completely random.
There should be some stationarity checks applied to this time series data for getting more insights before starting the development of the model. These tests are a part of the next phase in the pipeline.

Data Preprocessing

Before applying some preprocessing techniques, some tests were performed to get detailed insights. Stationarity checks were ran on the dataset.

Observations

Standard deviation is constant and therefore the variance is constant which should be the case for a time series to be stationary.
However, some vehicles violated the condition of test statistics by having a value lesser than the critical values.

Therefore, the data failed the stationarity test and needed to be made stationary before training.

Lag1 Differencing was applied to the dataset to make it stationary.
Vector normalization was also applied to the training data

Data Splitting

Data is splitted as: 85% train - 15% test
Data is also shaped as (number_of_time_steps, number_of_features) in order for it to train on Sequential DL Models (RNN, LSTM, GRU)

Model Development

The models is developed using Keras
After comparing the several models that were developed, the model with best results (Dropout Regularized LSTM) was chosen

Architecture of chosen model

Clustering

The forecasts from the model are given to the clustering algorithm which figures out the optimal cluster formation for a given test vehicle and constraints.

Data Preprocessing

The speed values are quantized upto 3 decimal places.

Feature Engineering

Vehicle Positions were dropped from the training data because of these observations.

Position data is always linearly increasing while the speed data is a time series.
The dataset simulated by SUMO was set to output readings every one second so we can easily derive the positions after forecasting.

This decision also reduces the neural network computation since there would now be 20 features for training instead of 40 in a 20 vehicle network.

Mean Computation Time Calculation

This phase is explained in the Algorithm section.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.ipynb_checkpoints		.ipynb_checkpoints
api		api
clustering		clustering
prelude		prelude
training		training
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Motion forecasting for vehicles

A note about file structure

Block Diagram

Problem Statement

Data Pipeline