Are there any performance benchmark(genuine ones) that compare Stinger vs Impala vs Drill? Also, which is preferred - my use case will be mainly towards ad-hoc interactive queries on top of Hive. Thanks.
Apache Drill enables analysts, business users, data scientists and developers to explore and analyze this data without sacrificing the flexibility and agility offered by these datastores. Drill processes the data in-situ without requiring users to define schemas or transform data.
At the core of Apache Drill is the 'Drillbit' service, which is responsible for accepting requests from the client, processing the queries, and returning results to the client. A Drillbit service can be installed and run on all of the required nodes in a Hadoop cluster to form a distributed cluster environment.
There are some performance numbers on the site http://allegro.tech/fast-data-hackathon.html.
In general, we see Drill and Impala are comparable in performance for the interactive queries with the differentiation of Drill being its ability to query without metadata definitions and its ease of use working with JSON data.
Note that these tests are on much older versions on Drill such as 0.8/0.9 (also not configured appropriately for data locality). Now Drill is 1.1 with a lot of improvements on SQL (window functions etc) and performance.
You cannot do benchmark like this, it's no sense and you should never trust a such benchmark.
Everything will depend on your own data, you have JSON files ? prefer Drill. You want to query more than 1TB, prefer Hive and so on.
Also, you may consider file format, JSON, Kudu, Parquet or ORC.
Then come the optimization, Hive+Tez seems better for parrarel queries but very slow for single query. Whereas Impala is the opposite (MapReduce versus MassiveParrarelProcessing).
Also, you want to consider the hardware ressource, disk SSD or not etc..
I recommend, start with Apache Drill + JSON file, then try Apache Drill with Parquet or ORC.
If you want help, describe exactly what you have (data + hardware) and what you want.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With