Posted Jun 19, 2018

In General


Read time 1 min

Yesterday Hortonworks announced Hortonworks 3.0, a big step forward for making Hadoop a more viable option for analytical workloads.

At a high level, Hadoop has historically struggled to perform when SQL-on-Hadoop was used (think IBM Big SQL and Hive), especially compared to the performance of OLAP (cubes or MOLAP) and even star schemas (data warehouses/marts). This meant traditional BI and data biz tools could work on Hadoop but did not perform up to standard. Remember Hadoop is intended to do very big workloads, like aggregation/summation of massive data sets – not joins and filters as BI requires. In Hortonworks 3.0, this has been addressed by supercharging Hive with Apache Druid as a columnar data store, and the details are in the press release I included below.

Big News for Big Data - Hortonworks 3.0

Hortonworks continues to strengthen its partnerships as well, and from IBM there is a brand new service, called IBM Hosted Analytics with Hortonworks (IHAH). This service combines Hortonworks Data Platform, IBM’s Big SQL and the IBM Data Science Experience (Watson Studio).

Details can be found here.

Written by: Chris Foster, Practice Lead, Newcomp Analytics.

Line graphic of a mountain

No matter where you are in your analytics journey, we'll guide you the rest of the way.

Animated Graphic: mountain-cloud
Consultation Form
First Name
Last Name
What Are You Interested In? *
Animated Graphic: mountain