Athena: Query exhausted resources at scale factor

Tags:

I am running a query like:

SELECT f.*, p.countryName, p.airportName, a.name AS agentName
FROM (
    SELECT 
        f.outboundlegid, 
        f.inboundlegid,
        f.querydatetime,
        cast(f.agent as bigint) as agent,
        cast(f.querydestinationplace as bigint) as querydestinationplace,
        f.queryoutbounddate,
        f.queryinbounddate,
        f.quoteageinminutes,
        f.price
    FROM flights f
    WHERE querydatetime >= '2018-01-02'
    AND querydatetime <= '2019-01-10'
) f
INNER JOIN (
  SELECT airportId, airportName, countryName
  FROM airports
  WHERE countryName IN ('Philippines', 'Indonesia', 'Malaysia', 'Hong Kong', 'Thailand', 'Vietnam')
) p
ON f.querydestinationplace = p.airportId
INNER JOIN agents a
ON f.agent = a.id
ORDER BY f.outboundlegid, f.inboundlegid, f.agent, querydatetime DESC

What's wrong with it? Or how can I optimize it? It gives me

Query exhausted resources at this scale factor

I have a flights table and I want to query for flights inside a specific country

938

asked Jan 26 '19 05:01

Jiew Meng

1 Answers

I have been facing this problem since the begining of Athena, the problem is the ORDER BY clause. Athena is just an EMR cluster with hive and prestodb installed. The problem you are facing is: Even if your query is distributed across X numbers of nodes, the ordering phase must be done by just a single node, the master node in this case. So at the end, you can order as much data as memory have the master node.

You can test it by reducing the amount of data the query returns maybe reducing the time range.

114

answered Sep 23 '22 00:09

Roberto

Related questions
                            
                                Oracle SQL pivot query
                            
                                Maximum length of an SQL Query
                            
                                SQL query, select nearest places by a given coordinates [duplicate]
                            
                                Creating PostgreSQL tables + relationships - PROBLEMS with relationships - ONE TO ONE
                            
                                SQL Server - An expression of non-boolean type specified in a context where a condition is expected, near 'RETURN'
                            
                                MySQL Won't let User Login: Error 1524
                            
                                convert any date string to timestamp without timezone
                            
                                MySQL SELECT DISTINCT should be case sensitive?
                            
                                How to add a custom column with a default value in an sql query?
                            
                                Are there any free tools to generate 'INSERT INTO' scripts in MS SQL Server? [closed]
                            
                                Getting the next ID without inserting a row
                            
                                How can I get a list of element names from an XML value in SQL Server
                            
                                oracle call stored procedure inside select
                            
                                What's the R equivalent of SQL's LIKE 'description%' statement?
                            
                                DB2- How to check if varchar field value has integers
                            
                                Trying to sum distinct values SQL
                            
                                counting the amount of rows returned with a query in laravel
                            
                                CONCAT_WS() for SQL Server
                            
                                "Incorrect syntax near 'OFFSET'" modift sql comm 2012 to 2008
                            
                                Saving enumerated values to a database

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Athena: Query exhausted resources at scale factor

Tags:

sql

amazon-web-services

query-optimization

amazon-athena

presto

Jiew Meng

People also ask

1 Answers

Roberto

Recent Activity

Donate For Us