Spark UI History server on Kubernetes?

1 Answers

Yes it is possible. Briefly you will need to ensure following:

Make sure all your applications store event logs in a specific location (filesystem, s3, hdfs etc).
Deploy the history server in your cluster with access to above event logs location.

Now spark (by default) only read from the filesystem path so I will elaborate this case in details with spark operator:

Create a PVC with a volume type that supports ReadWriteMany mode. For example NFS volume. The following snippet assumes you have storage class for NFS (nfs-volume) already configured:

apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: spark-pvc
  namespace: spark-apps
spec:
  accessModes:
    - ReadWriteMany
  volumeMode: Filesystem
  resources:
    requests:
      storage: 5Gi
  storageClassName: nfs-volume

Make sure all your spark applications have event logging enabled and at the correct path:

  sparkConf:
    "spark.eventLog.enabled": "true"
    "spark.eventLog.dir": "file:/mnt"

With event logs volume mounted to each application (you can also use operator mutating web hook to centralize it ) pod. An example manifest with mentioned config is show below:

---
apiVersion: "sparkoperator.k8s.io/v1beta2"
kind: SparkApplication
metadata:
  name: spark-java-pi
  namespace: spark-apps

spec:
  type: Java
  mode: cluster

  image: gcr.io/spark-operator/spark:v2.4.4
  mainClass: org.apache.spark.examples.SparkPi
  mainApplicationFile: "local:///opt/spark/examples/jars/spark-examples_2.11-2.4.4.jar"

  imagePullPolicy: Always
  sparkVersion: 2.4.4
  sparkConf:
    "spark.eventLog.enabled": "true"
    "spark.eventLog.dir": "file:/mnt"
  restartPolicy:
    type: Never
  volumes:
    - name: spark-data
      persistentVolumeClaim:
        claimName: spark-pvc
  driver:
    cores: 1
    coreLimit: "1200m"
    memory: "512m"
    labels:
      version: 2.4.4
    serviceAccount: spark
    volumeMounts:
      - name: spark-data
        mountPath: /mnt
  executor:
    cores: 1
    instances: 1
    memory: "512m"
    labels:
      version: 2.4.4
    volumeMounts:
      - name: spark-data
        mountPath: /mnt

Install spark history server mounting the shared volume. Then you will have access events in history server UI:

apiVersion: apps/v1beta1
kind: Deployment

metadata:
  name: spark-history-server
  namespace: spark-apps

spec:
  replicas: 1

  template:
    metadata:
      name: spark-history-server
      labels:
        app: spark-history-server

    spec:
      containers:
        - name: spark-history-server
          image: gcr.io/spark-operator/spark:v2.4.0

          resources:
            requests:
              memory: "512Mi"
              cpu: "100m"

          command:
            -  /sbin/tini
            - -s
            - --
            - /opt/spark/bin/spark-class
            - -Dspark.history.fs.logDirectory=/data/
            - org.apache.spark.deploy.history.HistoryServer

          ports:
            - name: http
              protocol: TCP
              containerPort: 18080

          readinessProbe:
            timeoutSeconds: 4
            httpGet:
              path: /
              port: http

          livenessProbe:
            timeoutSeconds: 4
            httpGet:
              path: /
              port: http

          volumeMounts:
            - name: data
              mountPath: /data
      volumes:
      - name: data
        persistentVolumeClaim:
          claimName: spark-pvc
          readOnly: true

Feel free to configure Ingress, Service for accessing the UI. enter image description here

Also you can use Google Cloud Storage, Azrue Blob Storage or AWS S3 as event log location. For this you will need to install some extra jars so I would recommend having a look at lightbend spark history server image and charts.

150

answered Sep 18 '22 16:09

Qasim Sarfraz

Related questions
                            
                                How to profile pyspark jobs
                            
                                PySpark: org.apache.spark.sql.AnalysisException: Attribute name ... contains invalid character(s) among " ,;{}()\n\t=". Please use alias to rename it [duplicate]
                            
                                sbt assembly shading to create fat jar to run on spark
                            
                                Spark + Parquet + Snappy: Overall compression ratio loses after spark shuffles data
                            
                                Bypassing org.apache.hadoop.mapred.InvalidInputException: Input Pattern s3n://[...] matches 0 files
                            
                                Why does spark-shell --master yarn-client fail (yet pyspark --master yarn seems to work)?
                            
                                In spark join, does table order matter like in pig?
                            
                                Spark query running very slow
                            
                                Spark Error: Could not initialize class org.apache.spark.rdd.RDDOperationScope
                            
                                Spark Multi Label classification
                            
                                ALS model - predicted full_u * v^t * v ratings are very high
                            
                                How to get the progress bar (with stages and tasks) with yarn-cluster master?
                            
                                Spark DAG differs with 'withColumn' vs 'select'
                            
                                How to decide on the number of partitions required for input data size and cluster resources?
                            
                                Spark Streaming textFileStream not supporting wildcards
                            
                                When to prefer Hadoop MapReduce over Spark?
                            
                                How to join big dataframes in Spark SQL? (best practices, stability, performance)
                            
                                How to fetch offset id while consuming Kafka from Spark, save it in Cassandra and use it to restart Kafka?
                            
                                How to run Spark Scala code on Amazon EMR
                            
                                Apache Spark Structured Streaming vs Apache Flink: what is the difference?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark UI History server on Kubernetes?

Tags:

kubernetes

apache-spark

JDev

People also ask

1 Answers

Qasim Sarfraz

Recent Activity

Donate For Us