I cannot use --package option on bitnami/spark docker container

Tags:

I pulled docker image and executed below command to run image.

docker run -it bitnami/spark:latest /bin/bash
spark-shell --packages="org.elasticsearch:elasticsearch-spark-20_2.11:7.5.0"

and i got message like below

Ivy Default Cache set to: /opt/bitnami/spark/.ivy2/cache
The jars for the packages stored in: /opt/bitnami/spark/.ivy2/jars
:: loading settings :: url = jar:file:/opt/bitnami/spark/jars/ivy-2.4.0.jar!/org/apache/ivy/core/settings/ivysettings.xml
org.elasticsearch#elasticsearch-spark-20_2.11 added as a dependency
:: resolving dependencies :: org.apache.spark#spark-submit-parent-c785f3e6-7c78-469f-ab46-451f8be61a4c;1.0
        confs: [default]
Exception in thread "main" java.io.FileNotFoundException: /opt/bitnami/spark/.ivy2/cache/resolved-org.apache.spark-spark-submit-parent-c785f3e6-7c78-469f-ab46-451f8be61a4c-1.0.xml (No such file or directory)
        at java.io.FileOutputStream.open0(Native Method)
        at java.io.FileOutputStream.open(FileOutputStream.java:270)
        at java.io.FileOutputStream.<init>(FileOutputStream.java:213)
        at java.io.FileOutputStream.<init>(FileOutputStream.java:162)
        at org.apache.ivy.plugins.parser.xml.XmlModuleDescriptorWriter.write(XmlModuleDescriptorWriter.java:70)
        at org.apache.ivy.plugins.parser.xml.XmlModuleDescriptorWriter.write(XmlModuleDescriptorWriter.java:62)
        at org.apache.ivy.core.module.descriptor.DefaultModuleDescriptor.toIvyFile(DefaultModuleDescriptor.java:563)
        at org.apache.ivy.core.cache.DefaultResolutionCacheManager.saveResolvedModuleDescriptor(DefaultResolutionCacheManager.java:176)
        at org.apache.ivy.core.resolve.ResolveEngine.resolve(ResolveEngine.java:245)
        at org.apache.ivy.Ivy.resolve(Ivy.java:523)
        at org.apache.spark.deploy.SparkSubmitUtils$.resolveMavenCoordinates(SparkSubmit.scala:1300)
        at org.apache.spark.deploy.DependencyUtils$.resolveMavenDependencies(DependencyUtils.scala:54)
        at org.apache.spark.deploy.SparkSubmit.prepareSubmitEnvironment(SparkSubmit.scala:304)
        at org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:774)
        at org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:161)
        at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:184)
        at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:86)
        at org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:920)
        at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:929)
        at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)

I tried other package, but it is not working with all same error message.

Can you give some advice to avoid this error?

368

asked Mar 11 '20 07:03

sun lim

2 Answers

Found the solution to it as given in https://github.com/bitnami/bitnami-docker-spark/issues/7 what we have to do is create a volume on host mapped to docker path

volumes:
  - ./jars_dir:/opt/bitnami/spark/ivy:z

give this path as cache path like this

spark-shell --conf spark.jars.ivy=/opt/bitnami/spark/ivy --conf spark.cassandra.connection.host=127.0.0.1 --packages com.datastax.spark:spark-cassandra-connector_2.12:3.0.0-beta --conf spark.sql.extensions=com.datastax.spark.connector.CassandraSparkExtensions

All happened because /opt/bitnami/spark is not writable and we have to mount a volume to bypass that.

143

answered Oct 19 '22 09:10

palash kulshreshtha

The error "java.io.FileNotFoundException: /opt/bitnami/spark/.ivy2/" occured because the location /opt/bitnami/spark/ is not writable. so in order to resolve this issue do modify the master spark service like this. Added user as root and add mounted volume path for required jars.

see the working block of spark service written in docker compose:

spark:
image: docker.io/bitnami/spark:3
container_name: spark
environment:
  - SPARK_MODE=master
  - SPARK_RPC_AUTHENTICATION_ENABLED=no
  - SPARK_RPC_ENCRYPTION_ENABLED=no
  - SPARK_LOCAL_STORAGE_ENCRYPTION_ENABLED=no
  - SPARK_SSL_ENABLED=no
user: root
ports:
  - '8880:8080'
volumes:
  - ./spark-defaults.conf:/opt/bitnami/spark/conf/spark-defaults.conf
  - ./jars_dir:/opt/bitnami/spark/ivy:z

answered Oct 19 '22 08:10

krishna kumar mishra

Related questions
                            
                                Docker-ized Consul, Zookeeper and Kafka in Amazon-ECS
                            
                                Docker force overwrite last tag and pushing on AWS ECR
                            
                                How to use wkhtmltopdf with Docker
                            
                                Maven inside docker container horribly slow
                            
                                Docker Oracle12c Enterprise image created from container symlink broken
                            
                                Build docker image from Dockerfile using Bazel
                            
                                Docker container can't curl, SSL wrong version number
                            
                                mkdir: cannot create directory '/ffa_app': Permission denied
                            
                                Libreoffice gives "Application Error" when called from R
                            
                                Redis sentinel failover configuration in docker swarm
                            
                                Run MySQL a prefilled docker container as random (non-root) linux user?
                            
                                How to add containers to a Kubernetes pod on runtime
                            
                                Reclaim disk space after removing file from Docker container
                            
                                `docker start` in parallel?
                            
                                docker error in windows container read tcp : wsarecv: An existing connection was forcibly closed by the remote host
                            
                                Local Development Best Practices: Java, Docker, Kubernetes
                            
                                Is it possible to mount a CIFS volume using a credentials file against the LOCAL driver?
                            
                                "[NetworkError]" in Angular SSR console with no additional information
                            
                                Run a pipeline cleanup automatically after a merge request was merged
                            
                                Docker container fails because service rabbit failed programming external connectivity on endpoint encoder_rabbit_1

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

I cannot use --package option on bitnami/spark docker container

Tags:

docker

elasticsearch

apache-spark

sun lim

People also ask

2 Answers

palash kulshreshtha

krishna kumar mishra

Recent Activity

Donate For Us