Skip to content

Latest commit

 

History

History
32 lines (18 loc) · 1.63 KB

File metadata and controls

32 lines (18 loc) · 1.63 KB

Advanced Test Suites

Datasets for some of the Advanced Test Suites are not shipped with the repository. You can get them as follows:

TPC-H SF 100

For the TPC-H SF100 Parquet tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpch100/parquet

TPC-DS SF 100

For the TPC-DS SF100 tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/tpcds_sf100/parquet

Mondrian

For the Mondrian tests, download dataset from Amazon S3

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/mondrian

Metadata Caching

Download the required data set from https://s3.amazonaws.com/apache-drill/files/tpch100_dir_partitioned_50000files-lineitem.tgz Extract this compresses file and copy over files to "/drill/testdata/tpch100_dir_partitioned_50000files/lineitem"

Data-shapes widestring

For the data-shapes widestring 100000rows parquet tests, download dataset from [Amazon S3](http://drill-public.s3.amazonaws .com/data-shapes/wide-columns/5000/100000rows/parquet/widestrings.tar.gz)

Extract this compressed file and copy over files to MapR-FS / HDFS into /drill/testdata/data-shapes/wide-columns/5000/100000rows/parquet