Time Series ARCH Model and AWS Deployment

Business Objective

A time series is a sequence of data points ordered over time. In time series analysis, time is often the independent variable, and the primary goal is to make future forecasts. Time series data finds applications in various day-to-day activities, such as:

Tracking daily, hourly, or weekly weather data
Monitoring changes in application performance
Visualizing real-time vitals in medical devices

The ARCH (Autoregressive Conditional Heteroskedasticity) process, introduced by Engle in 1982, explicitly accounts for the difference between unconditional and conditional variance. It allows the conditional variance to change over time based on past errors. The key components are:

Autoregressive: Current values are correlated with previous values.
Conditional: Variance depends on past errors.
Heteroskedasticity: The series exhibits varying variance.

GARCH (Generalized Autoregressive Conditional Heteroskedasticity) is an extension of the ARCH model, incorporating past variances and squared residuals to estimate current and future variances. It is well-suited for modeling time series data with heteroskedasticity and volatility clustering.

Deployment is the process of integrating a machine learning model into an existing production environment, enabling practical decision-making based on data. MLOps (Machine Learning Operations) emphasizes automation and monitoring at all stages of ML system construction, including integration, testing, releasing, deployment, and infrastructure management.

In this project, we aim to create an MLOps project for deploying the time series ARCH model using Python on the AWS cloud platform (Amazon Web Services), with a focus on cost optimization by minimizing the use of services.

Data Description

The dataset is "Call-centers" data, recorded at a monthly level. Calls are categorized by domain as the call center operates for various domains. External regressors, such as the number of channels and phone lines, indicate traffic predictions by in-house analysts and resource availability.

The dataset contains 132 rows and 8 columns:

Month, healthcare, telecom, banking, technology, insurance, number of phone lines, and number of channels.

Aim

Build ARCH and GARCH models on the provided dataset.
Create an MLOps pipeline using the Amazon Web Services (AWS) platform to deploy the time series ARCH model in a production environment.

Tech Stack

Language: Python
Libraries: Flask, pickle, pandas, numpy, matplotlib, seaborn, statsmodels, scipy, arch
Services: Flask, AWS, Docker, Lightsail, EC2

Approach

Import the required libraries and read the dataset.
Perform descriptive analysis.
Data pre-processing:
- Set the date as the index.
- Set the frequency as monthly.
Exploratory Data Analysis (EDA):
- Visualize the data.
Perform train-test split.
Calculate returns and volatility.
ARCH model:
- Install necessary libraries.
- Build ARCH models with varying parameters.
- Build higher-lag ARCH models.
GARCH model:
- Build a GARCH model.
Forecasting the results:
- Forecast results using the best model.

Deployment Approach

Model Creation
- Save the model in a pickle format (.pkl).
Flask App
- Create a Flask application.
EC2 Machine Setup
- Create an EC2 instance using the AWS Management Console.
- Launch the instance.
- Install the 'Putty' tool on your local machine for remote access.
EC2 and Docker Setup
- Follow the instructions in the 'install-docker.sh' file.
AWS CLI Installation
- Follow the steps in the 'install-aws-cli.sh' file.
Lightsail Installation
- Refer to the steps in the 'install-lightsail-cli.sh' file.
Upload Files to the EC2 Machine
- Method 1:
  - Upload the code file in zip format on AWS Console (Cloud Shell).
- Method 2:
  - Create an S3 storage bucket.
  - Copy the object URL and use it on the EC2 machine to download the code.
  - Unzip the Bitbucket folder.
Deployment
- Follow the installation order as outlined in 'lightsail-deployment.md'.

Code Structure

Input: CallCenterData.xlsx
MLPipeline: This folder contains all the functions organized into different Python files.
Notebook: Time series ARCH model IPython notebook file.
Output: ARCH model saved in a pickle format.
App.py: Flask app configuration.
Dockerfile: Docker image configuration.
Engine.py: File where the MLPipeline files are called.
install-aws-cli.sh: Steps for AWS CLI installation.
install-docker.sh: Steps for Docker installation.
install-lightsail-cli.sh: Steps for Lightsail installation.
lightsail-deployment.md: Lightsail deployment readme file.
requirements.txt: List of essential libraries with their versions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time Series ARCH Model and AWS Deployment

Business Objective

Data Description

Aim

Tech Stack

Approach

Deployment Approach

Code Structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Input		Input
MLPipeine		MLPipeine
Output		Output
notebook		notebook
Dockerfile		Dockerfile
Engine.py		Engine.py
LICENSE		LICENSE
app.py		app.py
containers.json		containers.json
install-aws-cli.sh		install-aws-cli.sh
install-docker.sh		install-docker.sh
install-lightsail-cli.sh		install-lightsail-cli.sh
lightsail-deployment.md		lightsail-deployment.md
public-endpoint.json		public-endpoint.json
readme.md		readme.md
requirements.txt		requirements.txt

License

AjNavneet/TimeSeries-ARCH-GARCH-AWS-Deployment

Folders and files

Latest commit

History

Repository files navigation

Time Series ARCH Model and AWS Deployment

Business Objective

Data Description

Aim

Tech Stack

Approach

Deployment Approach

Code Structure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages