SQL on Hadoop Benchmarking – Spark 2.0
Following on from my previous blog on 12th Oct 2016. We have recently upgraded our CDH from 5.7.1 to 5.8.2 and have been concentrating on getting the TPC-DS benchmarks up and running for Spark. Spark 1.6 comes with the CDH distribution but we also installed the Spark 2.0 beta available from the Cloudera. In the…