I run my spark application in yarn cluster. In my code I use number available cores of queue for creating partitions on my dataset: <pre class="prettyprint"><code>Dataset ds = ... ds.coalesce(config.getNumberOfCores()); </code></pre> My question: how can I get number available cores of queue by programmatically way and not by configuration?

According to Databricks if the driver and executors are of the same node type, this is the way to go: <pre class="prettyprint"><code>java.lang.Runtime.getRuntime.availableProcessors * (sc.statusTracker.getExecutorInfos.length -1) </code></pre>

Found this while looking for the answer to pretty much the same question. I found that: <pre class="prettyprint"><code>Dataset ds = ... ds.coalesce(sc.defaultParallelism()); </code></pre> does exactly what the OP was looking for. For example, my 5 node x 8 core cluster returns 40 for the <code>defaultParallelism</code>.

Spark: get number of cluster cores programmatically

Tags:

java

dataset

apache-spark

core

hadoop-yarn

I run my spark application in yarn cluster. In my code I use number available cores of queue for creating partitions on my dataset:

Dataset ds = ...
ds.coalesce(config.getNumberOfCores());

My question: how can I get number available cores of queue by programmatically way and not by configuration?

506

asked Nov 20 '17 18:11

Rougher

2 Answers

According to Databricks if the driver and executors are of the same node type, this is the way to go:

java.lang.Runtime.getRuntime.availableProcessors * (sc.statusTracker.getExecutorInfos.length -1)

119

answered Sep 18 '22 06:09

zaxme

Found this while looking for the answer to pretty much the same question.

I found that:

Dataset ds = ...
ds.coalesce(sc.defaultParallelism());

does exactly what the OP was looking for.

For example, my 5 node x 8 core cluster returns 40 for the defaultParallelism.

answered Sep 22 '22 06:09

Steve C

Related questions
                            
                                Maven WebApp with Intellij - procedure
                            
                                Using jedis How to cache Java object
                            
                                Selenium UnreachableBrowserException - "Could not start a new session" in SoapUI Groovy TestStep
                            
                                Delete records from more than 1 year ago
                            
                                Java Stream equivalent of LINQ SelectMany()
                            
                                How do I dynamically change the table accessed using DynamoDB's Java Mapper?
                            
                                How to use Spring ClassPathResource: with classpath: or classpath*: and leading / or not?
                            
                                Can not close an editor tab in eclipse
                            
                                Spring cloud config organising files in folders
                            
                                How to use @JsonIdentityInfo with circular references?
                            
                                Serialize @JsonIgnore-d field
                            
                                Cucumber with Spring Boot 1.4: Dependencies not injected when using @SpringBootTest and @RunWith(SpringRunner.class)
                            
                                EL1008E:(pos 8):Property or field cannot be found on object of type '...security.web.access.expression.WebSecurityExpressionRoot' maybe not public?
                            
                                Confusion with Java Time parsing UTC
                            
                                How to abstract away java.time.Clock for testing purposes in Spring
                            
                                How to reuse application of filter & map on a Stream?
                            
                                Can we use object as a key in hashmap in Java?
                            
                                Append the value of argLine param in maven-surefire-plugin
                            
                                Dynamic POJO validation based on groups in spring
                            
                                Properly set (system) properties in JUnit 5

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With