⚒️ Data Preprocessing Techniques ✨

Data Preprocessing is that step in which the data gets transformed, or Encoded, to bring it to such a state that now the machine can easily parse it. In other words, the features of the data now become Algorithm interpretable.

By the end of this ,you will be equiped to data handle gracefully.so lets gets started 🏃‍♀️

Why Data Preprocessing ⚡

✅Accuracy: To check whether the data entered is correct or not.
✅Believability: The data should be trustable.
✅Completeness: To check whether the data is available or not recorded.
✅Consistency: To check whether the same data is kept in all the places that do or do not match.
✅Interpretability: The understandability of the data.
✅Timeliness: The data should be updated correctly.

Primary Tasks 🎯

Libraries required

Use the package manager pip to install below

pip install numpy
pip install pandas
pip install sklearn

Table of Content:

No	Topics	Code Link 🔗
1	Cardinality Encoding	Code
2	Delete Missing Values	Code
3	Delete outliers	Code
4	Feature Discreatization	Code
5	Feature Rescaling	Code
6	Handling Imbalance	Code
7	Data Imputation - Mean	Code
8	Imputation Missing Labels	Code
9	Normalization	Code
10	One Hot Encoding	Code
11	Outliers Dealing	Code
12	Pandas Categorical with Sklearn	Code
13	Preprocess Categorical Features	Code
14	Standardize IRIS	Code

Want to Stay Updated !!

Fork 🍴 the repository

Learned Something !!

Give a 🌟 to support me 😊

@misc{Charged Neuron,
    author       = {Roja Achary},
    title        = {Data Preprocessing Techniques},
    Credits      = {websites,CA,me,AV},
    month        = {November},
    year         = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Poster_Preprocessing.png		Poster_Preprocessing.png
Standardise_IRIS.ipynb		Standardise_IRIS.ipynb
cardinality_encoding.ipynb		cardinality_encoding.ipynb
del_missing_val.ipynb		del_missing_val.ipynb
del_outliers.ipynb		del_outliers.ipynb
feature_discreatization.ipynb		feature_discreatization.ipynb
feature_rescaling.ipynb		feature_rescaling.ipynb
handle_imbalance.ipynb		handle_imbalance.ipynb
imputation_with_mean.ipynb		imputation_with_mean.ipynb
impute_miss_labels.ipynb		impute_miss_labels.ipynb
normalization.ipynb		normalization.ipynb
one_hot_encoding.ipynb		one_hot_encoding.ipynb
outliers_problem.ipynb		outliers_problem.ipynb
pandas_cat_4_sklearn.ipynb		pandas_cat_4_sklearn.ipynb
prepro_cat_feat.ipynb		prepro_cat_feat.ipynb
preprocessing-types.png		preprocessing-types.png
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚒️ Data Preprocessing Techniques ✨

Why Data Preprocessing ⚡

Primary Tasks 🎯

Libraries required

Table of Content:

Want to Stay Updated !!

Learned Something !!

About

Releases

Packages

Languages

rojaAchary/Data_Preprocessing_Techniques

Folders and files

Latest commit

History

Repository files navigation

⚒️ Data Preprocessing Techniques ✨

Why Data Preprocessing ⚡

Primary Tasks 🎯

Libraries required

Table of Content:

Want to Stay Updated !!

Learned Something !!

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages