Skip to content

A repository that contains the implementation of the PCA method. It involves working on the famous business problem - Wine Dataset.

Notifications You must be signed in to change notification settings

Madhura12gj/Principal-Component-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Principal-Component-Analysis

Business Problem

The business problem of this famous dataset is that there are too many independent variables and one dependent variable. Assuming that the business owner gathered information of the independent variables and applied some clustering technique, identifying around 3 groups of customers. Each segemnt having a specific preferrence for a specific wine.

Here is a Logistic regression classification model that uses the Dimensionality reduction technique-PCA. For m independent variables in the dataset, PCA extracts p<m new independent variables that explain the most the variance of the dataset, regardless of the dependent variables.

That makes PCA as an unsupervised model,since independent variable is not considered.In this model we reduce the independent varaibles to 2 features and visualize our results.

About

A repository that contains the implementation of the PCA method. It involves working on the famous business problem - Wine Dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages