Datasets for some of the Advanced Test Suites are not shipped with the repository. You can get them as follows:
For the TPC-H SF100 Parquet tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpch100/parquet
For the TPC-DS SF100 tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpcds_sf100/parquet
For the Mondrian tests, download dataset from Amazon S3
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/mondrian
Download the required data set from https://s3.amazonaws.com/apache-drill/files/tpch100_dir_partitioned_50000files-lineitem.tgz Extract this compresses file and copy over files to "/drill/testdata/tpch100_dir_partitioned_50000files/lineitem"
For the data-shapes widestring 100000rows parquet tests, download dataset from [Amazon S3](http://drill-public.s3.amazonaws .com/data-shapes/wide-columns/5000/100000rows/parquet/widestrings.tar.gz)
Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/data-shapes/wide-columns/5000/100000rows/parquet