Python Data Science Handbook

This repository contains entire Python Data Science Handbook, in the form of (free!) Jupyter notebooks.

This repository contains the full listing of IPython notebooks behind the Python Data Science Handbook, including all text, code, and figures. The book was written and tested with Python 3.5, though older Python versions (including Python 2.7) should work in nearly all cases.

The book introduces the core libraries essential for working with data in Python: particularly IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and related packages. Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project, A Whirlwind Tour of Python: it's a fast-paced introduction to the Python language aimed at researchers and scientists.

The following listing links to the notebooks in this repository, rendered through the nbviewer service:

Required Packages

The code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.

The packages required to run the code in the book are listed in requirements.txt. To install these using conda, run the following at the command-line:

$ conda install --file requirements.txt

To create a stand-alone environment named PDSH with Python 3.5 and all the required package versions, run the following:

$ conda create -n PDSH python=3.5 --file requirements.txt

You can read more about using conda environments in the Managing Environments section of the conda documentation.

License

Code

The code in this repository, including all code samples in the notebooks listed above, is released under the MIT license. Read more at the Open Source Initiative.

Text

The text content of the book is released under the CC-BY-NC-ND license. Read more at Creative Commons.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
notebooks		notebooks
tools		tools
.gitignore		.gitignore
LICENSE-CODE		LICENSE-CODE
LICENSE-TEXT		LICENSE-TEXT
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Python Data Science Handbook

Table of Contents

Preface

1. IPython: Beyond Normal Python

2. Introduction to NumPy

3. Data Manipulation with Pandas

4. Visualization with Matplotlib

5. Machine Learning

Appendix: Figure Code

Required Packages

License

Code

Text

About

Licenses found

Releases

Packages

Languages

License

Licenses found

s1728k/PythonDataScienceHandbook

Folders and files

Latest commit

History

Repository files navigation

Python Data Science Handbook

Required Packages

License

Code

Text

About

Resources

License

Licenses found

Stars

Watchers

Forks

Languages