ANALYSIS-SP500

Using pca and transformations to analyze s&p dataset.

General
- Background
Installation
Footnote

General

We used s&p database to find outliers. Due the number of variables, we had to take actions to change the dataset to spot ones and to use to database in general.

Background

The data is formed from two files:

Prices - Includes stock symbol, volume and for each day open, close and high prices. The data is ranging 2010 to 2016.
Securities - Has additional information about the stocks. It includes the stock sector, sub industry, address of headquarters, security, and filling type.

PCA - which stands for Principal Component Analysis is used to represent multivariate data as a new dataset with less variables in order view trades, outliers, and clusters.

Installation

I will use google as an example, but similar process can be performed on other notebook editors

Open google Colab

Clone the project by:

!git clone https://github.com/elaysason/ANALYSIS-SP500.git

Now the folder is in your files on colab. Simpily download the notebook as showed

Footnote

The exercise is focused on the data from 2016 and includes only stocks which had data for each one of the days in the year.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
HW0.ipynb		HW0.ipynb
PCA_SP500.ipynb		PCA_SP500.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ANALYSIS-SP500

General

Background

Installation

Footnote

About

Releases

Packages

Languages

elaysason/ANALYSIS-SP500

Folders and files

Latest commit

History

Repository files navigation

ANALYSIS-SP500

General

Background

Installation

Footnote

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages