R Graphics with ggplot2 pbdR The "Programming with Big Data in R" project (pbdR) is a set of highly scalable R packages for distributed computing and profiling in data science. Motivation: Why pbdR? Basics: Parallel Computing with R and MPI via pbdMPI Task Parallelism with the tasktools Package Distributed Matrices the pbdR Way Parallel I/O with hdfio Machine Specific Information R and pbdR on Summit