Skip to content

Latest commit

 

History

History
69 lines (54 loc) · 3.39 KB

README.md

File metadata and controls

69 lines (54 loc) · 3.39 KB

Data preparation handbook (code and resources)

The book is currently available in the Manning Early Access Program (MEAP)), please fille free to participate in this awesome opportunity and give me your feedbacks.

How to

How to leverage the resources / Install and configure your environment

Resources available per chapter

Note: Some datasets have been modified from their original versions for compatibility with the provided code examples. To ensure the code works as intended, it is recommended to use the modified datasets (as they are referenced already). However, for reference and additional context, links to the original datasets are also included.

Chapter 1 - Introduction to data preparation

N.A.

Chapter 2 - Unveiling the secrets of data

Chapter 3 - Data quality challenges

Chapter 4 - Techniques for data transformation

Chapter 5 - Reveiling informations

Chapter 6 - Data preparation for machine learning and AI

Chapter 7 - Data preparation for dashboards and reports

Chapter 8 - Generative AI for data preparation

Chapter 9 - Visual data preparation with Alteryx

The Alteryx exports (yxmd files)can be found here, you can just copy the file on your desktop and open them by using the Alteryx client.

Note: the exports have been done with Alteryx v2024.1.1.93 Patch: 3

Chapter 10 - Data preparation at scale

Available soon

Chapter 11 - Trends and future challenges

Available soon

Profiling

Most of the datasets used in this book have already been profiled. The outcomes can be found here.