Blog

Four ways to get it wrong with SQL on Hadoop

More than half of professional developers work with SQL. And for all their early revolutionary fervor, even NoSQL databases are increasingly turning to SQL-like query languages. Despite its imperfections, SQL is how the world queries data. Hadoop started from a fundamentally different position. Hadoop was about solving the problems of big data: variety, volume and…

Read more

A Review of DataWorks Summit, San Jose 2017

The DataWorks Summit in San Jose was held on June 13-15, and this blog post summarises interesting talks at the event. Keynote section Sumeet Singh (Yahoo) Sumeet talked about Yahoo’s migration from MapReduce jobs to those running on Tez on the 39K+ nodes that they use for Hadoop processing with over 1M jobs per day….

Read more