A hand coded program can use the most efficient methods to gain the fastest query running speed, thus we can know the performance gap between current TiDB and the ideal database implementations.
git clone https://github.com/pingcap/tidb-bench.git
cd tidb-bench/tpch
make tbl
the last command will generate 1 csv file for 1 table:
➜ ls -l *.tbl
-rw-r--r-- 1 jianzhang.zj staff 24346144 Mar 30 21:55 customer.tbl
-rw-r--r-- 1 jianzhang.zj staff 759863287 Mar 30 21:55 lineitem.tbl
-rw-r--r-- 1 jianzhang.zj staff 2224 Mar 30 21:55 nation.tbl
-rw-r--r-- 1 jianzhang.zj staff 171952161 Mar 30 21:55 orders.tbl
-rw-r--r-- 1 jianzhang.zj staff 24135125 Mar 30 21:55 part.tbl
-rw-r--r-- 1 jianzhang.zj staff 118984616 Mar 30 21:55 partsupp.tbl
-rw-r--r-- 1 jianzhang.zj staff 389 Mar 30 21:55 region.tbl
-rw-r--r-- 1 jianzhang.zj staff 1409184 Mar 30 21:55 supplier.tbl
The defailt scale factor is 1, to generate a larger csv file, take a look at Makefile
for more details.
- 2011 VLDB: "Efficiently Compiling Efficient Query Plans for Modern Hardware"
- 2011 ICDE: "HyPer: A Hybrid OLTP&OLAP Main Memory Database System Based on Virtual Memory Snapshots"