DASK is a python based parallel computing and task scheduling Library. It's smooth integration with popular python libraries numpy, pandas, sklearn make this an increasingly popular choice for parallel computing.
The intention of this tutorial is to give an overview about DASK to a beginner.
Some interesting tutorials:
- https://github.com/dask/dask-tutorial
- https://github.com/mrocklin/dask-workshop
- Distributed Random Forests: https://gist.github.com/mrocklin/9f5720d8658e5f2f66666815b1f03f00
NOTE: I am not a developer of DASK. It is developed by Continuum Analytics. See http://dask.pydata.org/en/latest/index.html for more information.