I'm running a Spark application locally of 4 nodes. when I'm running my Application it displays my driver having this address 10.0.2.15: <pre class="prettyprint"><code>INFO Utils: Successfully started service 'SparkUI' on port 4040. INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://10.0.2.15:4040 </code></pre> at the end of running it displays : <pre class="prettyprint"><code>INFO SparkUI: Stopped Spark web UI at http://10.0.2.15:4040 INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped! INFO MemoryStore: MemoryStore cleared INFO BlockManager: BlockManager stopped INFO BlockManagerMaster: BlockManagerMaster stopped INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped! INFO SparkContext: Successfully stopped SparkContext </code></pre> I tried to access the Spark Web by: <code>10.0.2.15:4040</code> but the page is inaccessible. Trying with the below address also didn't helped: <pre class="prettyprint"><code> http://localhost:18080 </code></pre> Using <code>ping 10.0.2.15</code> the result is: <pre class="prettyprint"><code>Send a request 'Ping' 10.0.2.15 with 32 bytes of data Waiting time exceeded Waiting time exceeded Waiting time exceeded Waiting time exceeded Ping statistics for 10.0.2.15: Packages: sent = 4, received = 0, lost = 4 (100% loss) </code></pre> Checked the availability of the port 4040 using <code>netstat -a</code> to verify which ports are available. The result is as follow: <pre class="prettyprint"><code> Active connexion: Active local address Remote address state TCP 127.0.0.1:4040 DESKTOP-FF4U.....:0 Listening </code></pre> PS.: Knowning that my code is running succesfully. What could be the reason?

<pre class="prettyprint"><code>INFO Utils: Successfully started service 'SparkUI' on port 4040. INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://10.0.2.15:4040 </code></pre> That's how Spark reports that the web UI (which is known as <code>SparkUI</code> internally) is bound to the port <code>4040</code>. As long as the Spark application is up and running, you can access the web UI at http://10.0.2.15:4040. <pre class="prettyprint"><code>INFO SparkUI: Stopped Spark web UI at http://10.0.2.15:4040 ... INFO SparkContext: Successfully stopped SparkContext </code></pre> This is when a Spark application has finished (it does not really matter whether it finished properly or not). From now on, the web UI (at http://10.0.2.15:4040) is no longer available. <blockquote> I tried to access the Spark Web by: 10.0.2.15:4040 but the page is inaccessible. </blockquote> That's the expected behaviour of a Spark application. Once it's completed, <code>4040</code> (which is the default port of a web UI) is no longer available. <blockquote> Trying with the below address also didn't helped: http://localhost:18080 </blockquote> <code>18080</code> is the default port of Spark History Server. It is a separate process and may or may not be available regardless of availability of running Spark applications. Spark History Server is completely different from a Spark application. Quoting the official Spark docs: <blockquote> It is still possible to construct the UI of an application through Spark’s history server, provided that the application’s event logs exist. You can start the history server by executing: <pre class="prettyprint"><code>./sbin/start-history-server.sh </code></pre> This creates a web interface at http://:18080 by default, listing incomplete and completed applications and attempts. </blockquote> As you could read, you have to start Spark History Server yourself to have <code>18080</code> available. Moreover, you have to use <code>spark.eventLog.enabled</code> and <code>spark.eventLog.dir</code> configuration properties to be able to view the logs of Spark applications once they're completed their execution. Quoting the Spark official docs: <blockquote> The spark jobs themselves must be configured to log events, and to log them to the same shared, writable directory. For example, if the server was configured with a log directory of <code>hdfs://namenode/shared/spark-logs</code>, then the client-side options would be: <pre class="prettyprint"><code>spark.eventLog.enabled true spark.eventLog.dir hdfs://namenode/shared/spark-logs </code></pre> </blockquote>

How to access Spark Web UI?

Tags:

apache-spark

I'm running a Spark application locally of 4 nodes. when I'm running my Application it displays my driver having this address 10.0.2.15:

INFO Utils: Successfully started service 'SparkUI' on port 4040.
INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://10.0.2.15:4040

at the end of running it displays :

INFO SparkUI: Stopped Spark web UI at http://10.0.2.15:4040
INFO MapOutputTrackerMasterEndpoint: MapOutputTrackerMasterEndpoint stopped!
INFO MemoryStore: MemoryStore cleared
INFO BlockManager: BlockManager stopped
INFO BlockManagerMaster: BlockManagerMaster stopped
INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
INFO SparkContext: Successfully stopped SparkContext

I tried to access the Spark Web by: 10.0.2.15:4040 but the page is inaccessible. Trying with the below address also didn't helped:

 http://localhost:18080

Using ping 10.0.2.15 the result is:

Send a request 'Ping' 10.0.2.15 with 32 bytes of data

Waiting time exceeded

Waiting time exceeded

Waiting time exceeded

Waiting time exceeded

Ping statistics for 10.0.2.15: Packages: sent = 4, received = 0, lost = 4 (100% loss)

Checked the availability of the port 4040 using netstat -a to verify which ports are available. The result is as follow:

   Active connexion:

    Active       local address        Remote address                      state

    TCP          127.0.0.1:4040      DESKTOP-FF4U.....:0                 Listening

PS.: Knowning that my code is running succesfully. What could be the reason?

942

asked Dec 25 '16 16:12

sirine

1 Answers

INFO Utils: Successfully started service 'SparkUI' on port 4040.
INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://10.0.2.15:4040

That's how Spark reports that the web UI (which is known as SparkUI internally) is bound to the port 4040.

As long as the Spark application is up and running, you can access the web UI at http://10.0.2.15:4040.

INFO SparkUI: Stopped Spark web UI at http://10.0.2.15:4040
...
INFO SparkContext: Successfully stopped SparkContext

This is when a Spark application has finished (it does not really matter whether it finished properly or not). From now on, the web UI (at http://10.0.2.15:4040) is no longer available.

I tried to access the Spark Web by: 10.0.2.15:4040 but the page is inaccessible.

That's the expected behaviour of a Spark application. Once it's completed, 4040 (which is the default port of a web UI) is no longer available.

Trying with the below address also didn't helped: http://localhost:18080

18080 is the default port of Spark History Server. It is a separate process and may or may not be available regardless of availability of running Spark applications.

Spark History Server is completely different from a Spark application. Quoting the official Spark docs:

It is still possible to construct the UI of an application through Spark’s history server, provided that the application’s event logs exist. You can start the history server by executing:
./sbin/start-history-server.sh
This creates a web interface at http://:18080 by default, listing incomplete and completed applications and attempts.

As you could read, you have to start Spark History Server yourself to have 18080 available.

Moreover, you have to use spark.eventLog.enabled and spark.eventLog.dir configuration properties to be able to view the logs of Spark applications once they're completed their execution. Quoting the Spark official docs:

The spark jobs themselves must be configured to log events, and to log them to the same shared, writable directory. For example, if the server was configured with a log directory of hdfs://namenode/shared/spark-logs, then the client-side options would be:
spark.eventLog.enabled true
spark.eventLog.dir hdfs://namenode/shared/spark-logs

152

answered Oct 29 '22 14:10

Jacek Laskowski

Related questions
                            
                                Assign value to specific cell in PySpark dataFrame
                            
                                How to get the value of the location for a Hive table using a Spark object?
                            
                                For each RDD in a DStream how do I convert this to an array or some other typical Java data type?
                            
                                Persist in memory not working in Spark
                            
                                JavaSparkContext not serializable
                            
                                Spark streaming network_wordcount.py does not print result
                            
                                What is the right Date/Datetime format in JSON for Spark SQL to automatically infer the schema for it?
                            
                                How to group by multiple keys in spark?
                            
                                Splitting strings in Apache Spark using Scala
                            
                                Save a spark RDD to the local file system using Java
                            
                                Why does Spark/Scala compiler fail to find toDF on RDD[Map[Int, Int]]?
                            
                                What do WARN messages mean when starting spark-shell?
                            
                                Spark + Scala transformations, immutability & memory consumption overheads
                            
                                pyspark row number dataframe
                            
                                How to register byte[][] using kryo serialization for spark
                            
                                Error in Spark while declaring a UDF
                            
                                Changing Nulls Ordering in Spark SQL
                            
                                Use more than one collect_list in one query in Spark SQL
                            
                                How to convert an RDD of Maps to dataframe
                            
                                How to write into PostgreSQL hstore using Spark Dataset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With