I'm using spark streaming. According to the Spark Programming Guide (see http://spark.apache.org/docs/latest/programming-guide.html#accumulators), named accumulators will be displayed in the WebUI as below: <img src="https://i.stack.imgur.com/kTmlm.png" alt="Accumulators in Spark WebUI"> Unfortunately, I cannot find this anywhere. I am registering the accumulators like this (Java): <pre class="prettyprint"><code>LongAccumulator accumulator = new LongAccumulator(); ssc.sparkContext.sc().register(accumulator, "my accumulator"); </code></pre> I am using Spark 2.0.0.

I do not have a working streaming example but in non streaming example this UI could be found at the stages tab when choosing a specific stage. Also, I generally create the accumulator like this: <pre class="prettyprint"><code>val accum = sc.longAccumulator("My Accumulator") </code></pre> The equivalent in for spark streaming would probably be to replace sc with ssc.SparkContext

It worked for me. Below is my sample code <pre class="prettyprint"><code>Accumulator<Integer> spansWritten = jsc.sparkContext().intAccumulator(0,"Spans_Written"); JavaDStream dStream = SourceFactory.getSource().createStream(jsc) .map( s -> { spansWritten.add(1); return s; }); </code></pre> However, when I tried to use them inside a Decoder while creating stream for kafka, it didn't show up in the UI. Here is how it looks in the UI (select stages tab from the top, and click on one of the stage) screen shot

Spark accumulator not displayed in spark WebUI

Tags:

apache-spark

I'm using spark streaming. According to the Spark Programming Guide (see http://spark.apache.org/docs/latest/programming-guide.html#accumulators), named accumulators will be displayed in the WebUI as below: Accumulators in Spark WebUI Unfortunately, I cannot find this anywhere. I am registering the accumulators like this (Java):

LongAccumulator accumulator = new LongAccumulator();    
ssc.sparkContext.sc().register(accumulator, "my accumulator");

I am using Spark 2.0.0.

692

asked Apr 23 '15 18:04

Jack

2 Answers

I do not have a working streaming example but in non streaming example this UI could be found at the stages tab when choosing a specific stage. Also, I generally create the accumulator like this:

val accum = sc.longAccumulator("My Accumulator")

The equivalent in for spark streaming would probably be to replace sc with ssc.SparkContext

answered Sep 28 '22 15:09

Assaf Mendelson

It worked for me. Below is my sample code

Accumulator<Integer> spansWritten = jsc.sparkContext().intAccumulator(0,"Spans_Written");
JavaDStream<Span> dStream = SourceFactory.getSource().createStream(jsc)
    .map( s -> {
      spansWritten.add(1);
      return s;
    });

However, when I tried to use them inside a Decoder while creating stream for kafka, it didn't show up in the UI.

Here is how it looks in the UI (select stages tab from the top, and click on one of the stage) screen shot

answered Sep 28 '22 16:09

NAbbas

Related questions
                            
                                How to solve "Can't assign requested address: Service 'sparkDriver' failed after 16 retries" when running spark code?
                            
                                map values in a dataframe from a dictionary using pyspark
                            
                                Replacing whitespace in all column names in spark Dataframe
                            
                                Dropping multiple columns from Spark dataframe by Iterating through the columns from a Scala List of Column names
                            
                                pyspark approxQuantile function
                            
                                Spark: error reading DateType columns in partitioned parquet data
                            
                                Apache Spark shell crashes when trying to start executor on worker
                            
                                Spark RDD equivalent to Scala collections partition
                            
                                ON DUPLICATE KEY UPDATE while inserting from pyspark dataframe to an external database table via JDBC
                            
                                Why spark executor receives SIGTERM?
                            
                                Spark ML - MulticlassClassificationEvaluator - can we get precision/recall by each class label?
                            
                                Is proper event-time sessionization possible with Spark Structured Streaming?
                            
                                Python Spark Dataframes: Better way to export groups to text file
                            
                                Proper save/load of MatrixFactorizationModel
                            
                                How does Spark send closures to workers?
                            
                                Pyspark: applying kmeans on different groups of a dataframe
                            
                                Structured streaming - Metrics in Grafana

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With