Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What are the fundamental architectural, SQL compliance, and data use scenario differences between Presto and Impala?

Tags:

impala

presto

Can some experts give some succinct answers to the differences between Presto and Impala from these perspectives?

  1. Fundamental architecture design
  2. SQL compliance
  3. Real-world latency
  4. Any SPOF or fault-tolerance functionality
  5. Structured and unstructured data use scenario performance
like image 362
Yellow Duck Avatar asked Nov 12 '22 18:11

Yellow Duck


1 Answers

Apache Impala is a query engine for HDFS/Hive systems only.

Presto on the other hand is a generic query engine, which support HDFS as just one of many choices. There is a long list of connectors available, Hive/HDFS support is just one of them. This also means that you can query different data source in the same system, at the same time.

like image 162
Manfred Moser Avatar answered Dec 26 '22 04:12

Manfred Moser