Blogs

Three reasons Hadoop in the cloud is a good option

When it comes to improving big data analytics solutions, one technology that is being increasingly adopted is Hadoop.

With the launch of Hadoop 2.0 in October, this is now a rapidly-maturing technology and the benefits are being seen by a growing number of businesses.

A survey released in October by the Sand Hill Group revealed that while many firms are still dipping their toes in the water when it comes to Hadoop – with 44 percent of respondents still in the exploration and education phase – this is set to change as many companies invest more in the platform.

It found almost a quarter of firms say advanced data analytics will be their top priority for Hadoop in the next 12 to 18 months – a threefold increase from the number of companies focusing on this today.

For many businesses, solutions that take advantage of cloud computing tools will be the way forward for their Hadoop deployments. In a piece for Smart Data Collective, vice-president of corporate marketing at MapR Technologies Michele Nemschoff said: “The cloud offers several advantages for businesses looking to use Hadoop, so all businesses – including small and medium-sized ones – can truly start to take advantage of big data.”

She highlighted three key reasons why this is the case. These are:

Flexibility

Cloud-based Hadoop deployments allow businesses to ensure they always have the right solution for their current needs, with the flexibility to quickly scale up and deploy additional capacity when needed.

Cost-effectiveness

Ms Nemschoff said it is a myth that Hadoop is a cheap technology, as it will often require large investments in hardware, development, maintenance and expertise when it is deployed on-premises. However, with the cloud, many of these expenses are reduced or eliminated completely.

Real-time results

The generic open-source version of Hadoop can struggle to provide results at the speed that many firms expect in today’s environment. However, Ms Nemschoff noted that there are a number of cloud-based distributions available that can help with this by providing real-time analytics capabilities.