Redshift WLM config: how is unallocated memory used?

Tags:

When you define Redshift query queues, you can assign the proportion of memory allocated to each queue. So for example, if you had 5 queues, you might assign each one of them 20% of the memory. However, you also allowed to allocate the memory such that a portion of it remains unallocated.

In this documentation: http://docs.aws.amazon.com/redshift/latest/dg/cm-c-defining-query-queues.html it says, "Any unallocated memory is managed by Amazon Redshift and can be temporarily given to a queue if the queue requests additional memory for processing. For example, if you configure four queues, you can allocate memory as follows: 20 percent, 30 percent, 15 percent, 15 percent. The remaining 20 percent is unallocated and managed by the service."

Earlier in the documentation, it says, "If a specific query needs more memory than is allocated to a single query slot, you can increase the available memory by increasing the wlm_query_slot_count parameter. The following example sets wlm_query_slot_count to 10, performs a vacuum, and then resets wlm_query_slot_count to 1."

Is this related to the memory allocation? Can the query slot count adjustment be used to temporarily consume more memory than the whole queue is normally allowed?

I think my question is really about this part of the first quote, "Any unallocated memory is managed by Amazon Redshift and can be temporarily given to a queue if the queue requests additional memory for processing."

Does this mean that the user running a query has to specifically request the additional memory? Does this mean that leaving some memory unallocated is of no use unless you make these specific requests?

337

asked Mar 25 '16 20:03

olanmills

2 Answers

The two concepts of wlm_query_slot_count and memory allocation for a queues are different.

When you assign the concurrency level of your cluster to 20 for example, you are creating 20 slots of execution. If these smaller slots (compare to the default larger 5 slots), are too small for some queries (such as VACUUM or larger reports), you can give these specific queries multiple slots instead of a single one, using wlm_query_slot_count.

The resources allocation to the various slots in terms of CPU, IO and RAM doesn't have to be uniform, as you can give some queues more memory than other, as the queries who are sending to this queue need more memory. You can know that more memory is needed when you see that more queries are spilling to disk when they run out of memory during their calculation.

For each query that you are running, Redshift will estimate the memory requirements, based on the columns you are hitting, and the function you are applying on these columns (this is another good reason to have as narrow as possible column definitions). If the WLM has unallocated memory, it can give some of it to the queries that need it.

Nevertheless, when you are creating such queues definitions you are missing on the cluster flexibility to assign resources to queries. For example, you might create a queue that is completely jammed, while other queues are idle and wasting cluster resources. Therefore, do it with care, and monitor the usage of these queues to verify that you are actually improving your cluster prioritization and performance and not hurting it.

179

answered Oct 14 '22 07:10

Guy

The short answer is - wlm_query_slot_count and unallocated memory memory management are two different orthogonal things.

Think of wlm_query_slot_count as cell merge in Excel. If you have 5 cells (5 slots in a queue), each text can by default only take 1 cell (1 slot). By setting wlm_query_slot_count explicitly for the query you are telling Redshift to merge the cells (slots) for that bit of text (query). So if you set wlm_query_slot_count to 3, this particular query will take 3 slots, its like decided to spread long text into 3 merged cells in Excel.

From the queue management point of view, that would be as if someone has taken 3 slots already. So only 2 more 1-slot queries are allowed into the queue, everyone else has to wait.

In terms of memory, queue has fixed memory allocation overall, equally spread between slots. So if whole queue has 100GB of memory, 5 slots, each slot would get 20GB. Query which was given 3 slots in this queue, would then get 60GB.

And "unallocated memory management" is orthogonal to that - regardless of slots and queues, if memory is needed and it is unallocated, Redshift at its own discretion can decide to give it to any query (I think the wording of "if the queue requests additional memory" is misleading), usually based on the plan/table statistics.

answered Oct 14 '22 08:10

denismo

Related questions
                            
                                Amazon EC2 High Availability Database Architecture
                            
                                ubuntu mysqldb not installed properly
                            
                                How can I get the MIME types of objects in Amazon S3?
                            
                                AWS Command Line Tools need recoding for secure login?
                            
                                Overview of 3rd party platform sdks or libraries for Android [closed]
                            
                                Is there a way to automatically terminate unhealthy EC2 instances from ELB?
                            
                                Elastic Beanstalk: Application directory/script location on the EC2 Server
                            
                                What are the different patterns for S3 uRLS?
                            
                                How can I transfer data from an EBS volume to a S3 bucket?
                            
                                Recursive Fetch All Items In DynamoDB Query using Node JS
                            
                                How to Export/Backup DB from SQL Server on Amazon RDS
                            
                                AWS RDS IP static or dynamic?
                            
                                zookeeper installation on multiple AWS EC2instances
                            
                                Automatically offload dynamo table to cloud search domain
                            
                                Is it possible to use Postman to access Dynamodb via http and the dynamo api
                            
                                Can I tell if an Amazon SQS message is still in flight?
                            
                                Cannot create S3 Bucket on another region after deleting on the other
                            
                                How to access keys from buckets with periods (.) in their names using boto3?
                            
                                Route53 Unable to Resolve Host
                            
                                AWS IAM Account Lockout on failed login attempt

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Redshift WLM config: how is unallocated memory used?

Tags:

amazon-web-services

amazon-redshift

olanmills

People also ask

2 Answers

Guy

denismo

Recent Activity

Donate For Us