Datasets for some of the Advanced Test Suites are not shipped with the repository. You can get them as follows:
For the TPC-H SF100 Parquet tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpch100/parquet
For the TPC-DS SF100 tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpcds_sf100/parquet
For the Mondrian tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/mondrian
Download the required data set from https://s3.amazonaws.com/apache-drill/files/tpch100_dir_partitioned_50000files-lineitem.tgz Extract this compresses file and copy over files to "/drill/testdata/tpch100_dir_partitioned_50000files/lineitem"
For the data-shapes widestring 100000rows parquet tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/data-shapes/wide-columns/5000/100000rows/parquet