E.g. I need to get a list of all available executors and their respective multithreading capacity (NOT the total multithreading capacity, sc.defaultParallelism already handle that). Since this parameter is implementation-dependent (YARN and spark-standalone have different strategy for allocating cores) and situational (it may fluctuate because of dynamic allocation and long-term job running). I cannot use other method to estimate this. Is there a way to retrieve this information using Spark API in a distributed transformation? (E.g. TaskContext, SparkEnv) UPDATE As for Spark 1.6, I have tried the following methods: 1) run a 1-stage job with many partitions ( >> defaultParallelism ) and count the number of distinctive threadIDs for each executorID: <pre class="prettyprint"><code>val n = sc.defaultParallelism * 16 sc.parallelize(n, n).map(v => SparkEnv.get.executorID -> Thread.currentThread().getID) .groupByKey() .mapValue(_.distinct) .collect() </code></pre> This however leads to an estimation higher than actual multithreading capacity because each Spark executor uses an overprovisioned thread pool. 2) Similar to 1, except that n = defaultParallesim, and in every task I add a delay to prevent resource negotiator from imbalanced sharding (a fast node complete it's task and asks for more before slow nodes can start running): <pre class="prettyprint"><code>val n = sc.defaultParallelism sc.parallelize(n, n).map{ v => Thread.sleep(5000) SparkEnv.get.executorID -> Thread.currentThread().getID } .groupByKey() .mapValue(_.distinct) .collect() </code></pre> it works most of the time, but is much slower than necessary and may be broken by very imbalanced cluster or task speculation. 3) I haven't try this: use java reflection to read BlockManager.numUsableCores, this is obviously not a stable solution, the internal implementation may change at any time. Please tell me if you have found something better.

I would try to implement <code>SparkListener</code> in a way similar to web UI does. This code might be helpful as an example.

Method to get number of cores for a executor on a task node?

Tags:

apache-spark

E.g. I need to get a list of all available executors and their respective multithreading capacity (NOT the total multithreading capacity, sc.defaultParallelism already handle that).

Since this parameter is implementation-dependent (YARN and spark-standalone have different strategy for allocating cores) and situational (it may fluctuate because of dynamic allocation and long-term job running). I cannot use other method to estimate this. Is there a way to retrieve this information using Spark API in a distributed transformation? (E.g. TaskContext, SparkEnv)

UPDATE As for Spark 1.6, I have tried the following methods:

1) run a 1-stage job with many partitions ( >> defaultParallelism ) and count the number of distinctive threadIDs for each executorID:

val n = sc.defaultParallelism * 16
sc.parallelize(n, n).map(v => SparkEnv.get.executorID -> Thread.currentThread().getID)
.groupByKey()
.mapValue(_.distinct)
.collect()

This however leads to an estimation higher than actual multithreading capacity because each Spark executor uses an overprovisioned thread pool.

2) Similar to 1, except that n = defaultParallesim, and in every task I add a delay to prevent resource negotiator from imbalanced sharding (a fast node complete it's task and asks for more before slow nodes can start running):

val n = sc.defaultParallelism
sc.parallelize(n, n).map{
  v =>
    Thread.sleep(5000)
    SparkEnv.get.executorID -> Thread.currentThread().getID
}
.groupByKey()
.mapValue(_.distinct)
.collect()

it works most of the time, but is much slower than necessary and may be broken by very imbalanced cluster or task speculation.

3) I haven't try this: use java reflection to read BlockManager.numUsableCores, this is obviously not a stable solution, the internal implementation may change at any time.

Please tell me if you have found something better.

304

asked Jul 20 '17 04:07

tribbloid

2 Answers

It is pretty easy with Spark rest API. You have to get application id:

val applicationId = spark.sparkContext.applicationId

ui URL:

val baseUrl = spark.sparkContext.uiWebUrl

and query:

val url = baseUrl.map { url => 
  s"${url}/api/v1/applications/${applicationId}/executors"
}

With Apache HTTP library (already in Spark dependencies, adapted from https://alvinalexander.com/scala/scala-rest-client-apache-httpclient-restful-clients):

import org.apache.http.impl.client.DefaultHttpClient
import org.apache.http.client.methods.HttpGet
import scala.util.Try

val client = new DefaultHttpClient()

val response = url
  .flatMap(url => Try{client.execute(new HttpGet(url))}.toOption)
  .flatMap(response => Try{
    val s = response.getEntity().getContent()
    val json = scala.io.Source.fromInputStream(s).getLines.mkString
    s.close
    json
  }.toOption)

and json4s:

import org.json4s._
import org.json4s.jackson.JsonMethods._
implicit val formats = DefaultFormats

case class ExecutorInfo(hostPort: String, totalCores: Int)

val executors: Option[List[ExecutorInfo]] = response.flatMap(json => Try {
  parse(json).extract[List[ExecutorInfo]]
}.toOption)

As long as you keep application id and ui URL at hand and open ui port to external connections you can do the same thing from any task.

149

answered Nov 03 '22 00:11

Alper t. Turker

I would try to implement SparkListener in a way similar to web UI does. This code might be helpful as an example.

answered Nov 03 '22 01:11

Vitalii Kotliarenko

Related questions
                            
                                Thread-safe bounded queue hangs in Boost 1.54
                            
                                bash: Are writes to named pipes atomic?
                            
                                Possible to kill/terminate a certain thread in the Visual Studio debugger?
                            
                                How to check if an event loop has pending events outside of a thread?
                            
                                In which order should I send callback() & notify waiters?
                            
                                Python threads and atomic operations
                            
                                Can a C++11 thread_local variable inherit its initial value from the parent thread?
                            
                                Django 1.6 transactions to avoid race conditions
                            
                                Android AsyncTask.THREAD_POOL_EXECUTOR vs custom ThreadPool with Runnables
                            
                                "Magic static" singleton crashing when referenced in static destruction phase of another translation unit
                            
                                Is Cucumber-jvm thread safe?
                            
                                Python: Stop thread that is waiting for user input
                            
                                GCC's TSAN reports a data race with a thread safe static local
                            
                                Can I share a external texture between 2 OpenGL contexts, Android
                            
                                atomic fetch_add vs add performance
                            
                                Least-restrictive memory ordering for single-producer, single-consumer ringbuffer?
                            
                                Creating Three.js meshes in a WebWorker
                            
                                MySQL NDB API AccessViolationException
                            
                                What is the relationship between Thread.sleep and happens-before?
                            
                                Can a separate thread change static variable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Method to get number of cores for a executor on a task node?

Tags:

multithreading

scala

distributed-computing