Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Athena vs Redshift Spectrum

I am kind of evaluating Athena & Redshift Spectrum. Both serve the same purpose, Spectrum needs a Redshift cluster in place whereas Athena is pure serverless. Athena uses Presto and Spectrum uses its Redshift's engine

Are there any specific disadvantages for Athena or Redshift spectrum? Any limitations on using Athena or Spectrum ?

like image 257
Mukund Avatar asked May 09 '18 09:05

Mukund


People also ask

Is Redshift spectrum faster than Athena?

Query performanceRedshift Spectrum may provide a more stable querying experience than Athena. This is because Redshift Spectrum compute engine distributes queries across thousands of nodes, leading to a fast performance that is not derailed even by lots of parallel queries and high concurrency.

What is the difference between Athena and spectrum?

While both Spectrum and Athena are serverless, they differ in that Athena relies on pooled resources provided by AWS to return query results, whereas Spectrum resources are allocated according to your Redshift cluster size. This means that using Redshift Spectrum gives you more control over performance.

What is the difference between Athena and Redshift?

Athena has an edge in terms of portability and cost, whereas Redshift stands tall in terms of performance and scale. On the other hand, Redshift is a petabyte-scale data warehouse used together with business intelligence tools for modern analytical solutions.

What is the difference between Redshift and Redshift spectrum?

redshift supports automated tasks for configuring, monitoring, backing up, and securing the data warehouse. Redshift Spectrum is the ability to perform analytics directly on the data in the Amazon s3 cluster using a Redshift node.


1 Answers

I have used both across a few different use cases and conclude:

Advantages of Redshift Spectrum:

  • Allows creation of Redshift tables
  • Able to join Redshift tables with Redshift spectrum tables efficiently

If you do not need those things then you should consider Athena as well

Athena differences from Redshift spectrum:

  • Billing. This is the major difference and depending on your use case you may find one much cheaper than the other
  • Performance. I found Athena slightly faster.
  • SQL syntax and features. Athena is derived from presto and is a bit different to Redshift which has its roots in postgres.
  • Connectivity. Its easy enough to connect to Athena using API,JDBC or ODBC but many more products offer "standard out of the box" connection to Redshift

Also, for either solution, make sure you use the AWS Glue metadata, rather than Athena as there are fewer limitations.

like image 113
Jon Scott Avatar answered Sep 23 '22 00:09

Jon Scott