Skip to content

Time to get your data sorted! The Data Preparation Handbook, published by Manning within the MEAP release, is the go-to guide for handling messy data. All the book's code and resources can be found here.

Notifications You must be signed in to change notification settings

datacorner/dataprep-handbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data preparation handbook (code and resources)

The book is currently available in the Manning Early Access Program (MEAP)), please fille free to participate in this awesome opportunity and give me your feedbacks.

How to

How to leverage the resources / Install and configure your environment

Resources available per chapter

Note: Some datasets have been modified from their original versions for compatibility with the provided code examples. To ensure the code works as intended, it is recommended to use the modified datasets (as they are referenced already). However, for reference and additional context, links to the original datasets are also included.

Chapter 1 - Introduction to data preparation

N.A.

Chapter 2 - Unveiling the secrets of data

Chapter 3 - Data quality challenges

Chapter 4 - Techniques for data transformation

Chapter 5 - Reveiling informations

Chapter 6 - Data preparation for machine learning and AI

Chapter 7 - Data preparation for dashboards and reports

Chapter 8 - Generative AI for data preparation

Chapter 9 - Visual data preparation with Alteryx

The Alteryx exports (yxmd files)can be found here, you can just copy the file on your desktop and open them by using the Alteryx client.

Note: the exports have been done with Alteryx v2024.1.1.93 Patch: 3

Chapter 10 - Data preparation at scale

Available soon

Chapter 11 - Trends and future challenges

Available soon

Profiling

Most of the datasets used in this book have already been profiled. The outcomes can be found here.

About

Time to get your data sorted! The Data Preparation Handbook, published by Manning within the MEAP release, is the go-to guide for handling messy data. All the book's code and resources can be found here.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages