Kognitio: ultra-fast SQL engine for Hadoop

Hadoop has fundamentally changed how businesses approach processing and storing very large, fast-moving data sets. It’s flexible too; it doesn’t matter if your data is structured, semi-structured or unstructured, Hadoop can handle it. Kognitio on Hadoop unlocks the speed and scale of querying this big data.

Get faster answers to bigger questions.

Book a demo Get started

Does Hadoop have a problem?

Businesses have spent time and effort getting data into Hadoop, but it takes a serious dose of technical know-how to get the most out of it.

Data can become limited to expert data scientists, not the wider business. Not to mention, Hadoop is less than ideal for super fast access to big data, for hundreds of concurrent users.

So, we built Kognitio on Hadoop specifically for massively parallel analytical query processing, directly on your Hadoop cluster, for all your business users. Now you can run hundreds of concurrent SQL queries over your big data sets, and the data in your Hadoop cluster is as queryable as your relational database. Sounds more like it, doesn’t it?

people looking at tablet image

Why Kognitio on Hadoop?

With Kognitio, you get huge query throughput and high concurrency to serve all your users, directly on Hadoop.

Kognitio on Hadoop gives you:

Cluster Created with Sketch.

One-server-fits-all design

A design that eliminates the need for workload-specific tuning

Workload Created with Sketch.

Mixed-workload use cases

The ability to run mixed-workload use cases at the same time

Code Created with Sketch.

Ability to write in more languages

Facility to write code in other languages such as Python and R, and invoke that code from inside your queries within Kognitio

Kognitio on Hadoop architecture

Not all SQL on Hadoop solutions are created equal.

There are more than twenty different SQL on Hadoop platforms and multiple use cases for deployment, but the market is far from monocultural. Different SQL on Hadoop engines are optimized for some use cases but not others.

To learn more about how SQL on Hadoop platforms compare, view the full comparison chart.

Installing Kognitio on Hadoop

Before you begin with Kognitio on Hadoop, you’ll need:

A Hadoop cluster, version 2.6 or later

Cluster Created with Sketch.

YARN, HDFS, and ZooKeeper running on your cluster

Master Created with Sketch.

An edge node for the Kognitio installation. This can be any of the existing cluster nodes.

Linux Created with Sketch.

Linux running on your edge, master and worker nodes

Comparing SQL on Hadoop engines

There are many SQL on Hadoop distributions with different strengths and limitations. Read how we benchmark SQL on Hadoop.

Read now

Support for Kognitio

Kognitio is free-to-use, but different free and support options are available

Community

You can get free support via our Community Forum.

Go to Forum

Web support

Access to our customer support portal, and guaranteed response times for web premium support.

Read More

Enterprise

For customers who want to include phone support, and for premium enterprise customers to raise priority 1 problems.

Read More

Frequently Asked Questions

Is Kognitio really free?

Yes, if you deploy Kognitio on Hadoop, it’s completely free to use with no restrictions on time, scale or functionality. We offer Kognitio on Hadoop completely free to use and offer paid support contracts to customers who wish to take out support for production environments.

If you deploy Kognitio on standalone servers or on MapR, it’s free up to 512GB RAM. But should you need more RAM to evaluate Kognitio properly, just get in touch.

Is there a community and an enterprise version?

No, we have one version of Kognitio which comes with full functionality and various deployment options.

Where does your revenue come from if your products are free?

Kognitio on Hadoop is free and revenue for that comes from customers who wish to take out paid support.

If you deploy Kognitio on standalone servers or on MapR, you will require a software license for use above 512GB RAM.

What support is available?

We offer a range of support options for Kognitio, from web only to full enterprise support. For more information visit our support page.

Why is Kognitio so much faster than other SQL engines?

There are large variations in the performance, flexibility and maturity of available SQL engines.

Hive, Impala and SparkSQL, for example, are new SQL implementations that were developed from scratch for Hadoop. Yet SQL is a very large, complex standard which is difficult enough to implement on a serial platform, but to implement it in parallel is incredibly difficult and time consuming.

For 25 years, Kognitio has been developing parallel SQL. So our SQL engine is much more mature and proven to scale-out for the high concurrency required by business users. Kognitio is also a true in-memory engine.

Which version of Kognitio should I be installing?

If you intend to use Kognitio on a Hadoop cluster under YARN, you should install the “on Hadoop” version.

If you intend to use Kognitio on MapR, download the MapR version.

If you are not running Hadoop or MapR, then you should install the standalone version of Kognitio.

Anything we’ve not covered? Read the full list of FAQs here.

Want to know more about Kognitio on Hadoop?

Read our blog.

Hadoop’s biggest problem, and how to fix it

6th August 2018 | By Mark Chopping

Introduction Hadoop was seen as a silver bullet for many companies, but recently there has been an increase in critical headlines like: Hadoop Has Failed Us, Tech Experts Say You’re doing Hadoop and Spark wrong, and they will probably fail Has Hadoop Failed? That’s the Wrong Question The problem Dig behind the headlines, and a…

Read more

What is Hadoop good for?

6th August 2018 | By Roger Gaskell

It must be confusing if you are a Hadoop newbie at the moment. There are so many conflicting opinions about what it should be used for, as anecdotally demonstrated at the end of this beautifully written piece about the Big Data London event. At one end of the spectrum you have people saying, “it’s only…

Read more

Big data sets – where to find and how to harvest them

6th August 2018 | By Chak Leung

Finding big data sets to work with isn’t easy. People who’d like to share need to be wary of the sensitivity of the data they’re sharing (e.g. social media data is very personal) and generally a lot of interesting data sets are off limits because of this. I’ve detailed in this blog some repositories for…

Read more

Still got questions? Give us a call on +44 (0)1344 300770

Get started with a free download today
and never pay a penny for the software!

Get started

Book a live demo with one of our experts.

Fill out the form below and we'll get back to you to arrange a time that suits you.