Spark Unable to find JDBC Driver

Tags:

So I've been using sbt with assembly to package all my dependencies into a single jar for my spark jobs. I've got several jobs where I was using c3p0 to setup connection pool information, broadcast that out, and then use foreachPartition on the RDD to then grab a connection, and insert the data into the database. In my sbt build script, I include

"mysql" % "mysql-connector-java" % "5.1.33"

This makes sure the JDBC connector is packaged up with the job. Everything works great.

So recently I started playing around with SparkSQL and realized it's much easier to simply take a dataframe and save it to a jdbc source with the new features in 1.3.0

I'm getting the following exception :

java.sql.SQLException: No suitable driver found for jdbc:mysql://some.domain.com/myschema?user=user&password=password at java.sql.DriverManager.getConnection(DriverManager.java:596) at java.sql.DriverManager.getConnection(DriverManager.java:233)

When I was running this locally I got around it by setting

SPARK_CLASSPATH=/path/where/mysql-connector-is.jar

Ultimately what I'm wanting to know is, why is the job not capable of finding the driver when it should be packaged up with it? My other jobs never had this problem. From what I can tell both c3p0 and the dataframe code both make use of the java.sql.DriverManager (which handles importing everything for you from what I can tell) so it should work just fine?? If there is something that prevents the assembly method from working, what do I need to do to make this work?

790

asked Apr 10 '15 03:04

Adam Ritter

2 Answers

This person was having similar issue: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-use-DataFrame-with-MySQL-td22178.html

Have you updated your connector drivers to the most recent version? Also did you specify the driver class when you called load()?

Map<String, String> options = new HashMap<String, String>(); options.put("url", "jdbc:mysql://localhost:3306/video_rcmd?user=root&password=123456"); options.put("dbtable", "video"); options.put("driver", "com.mysql.cj.jdbc.Driver"); //here DataFrame jdbcDF = sqlContext.load("jdbc", options);

In spark/conf/spark-defaults.conf, you can also set spark.driver.extraClassPath and spark.executor.extraClassPath to the path of your MySql driver .jar

answered Sep 23 '22 21:09

insomniak

These options are clearly mentioned in spark docs: --driver-class-path postgresql-9.4.1207.jar --jars postgresql-9.4.1207.jar

The mistake I was doing was mentioning these options after my application's jar.

However the correct way is to specify these options immediately after spark-submit:

spark-submit --driver-class-path /somepath/project/mysql-connector-java-5.1.30-bin.jar --jars /somepath/project/mysql-connector-java-5.1.30-bin.jar --class com.package.MyClass target/scala-2.11/project_2.11-1.0.jar

answered Sep 26 '22 21:09

Ayush Vatsyayan

Related questions
                            
                                Setting Oracle size of row fetches higher makes my app slower?
                            
                                How to get a java.time object from a java.sql.Timestamp without a JDBC 4.2 driver?
                            
                                Understanding the concept behind Service provider framework like JDBC using the factory method
                            
                                How do I get the row count in JDBC?
                            
                                Failed to load driver class com.mysql.jdbc.Driver
                            
                                differences between ms sql microsoft's jdbc drivers and jTDS's driver
                            
                                Java JDBC Lazy-Loaded ResultSet
                            
                                Sql Server - connect with windows authentication
                            
                                How to set current date and time using prepared statement?
                            
                                Resultset To List
                            
                                JDBC - How to set char in a prepared statement
                            
                                SQLException: No suitable driver found for jdbc:derby://localhost:1527
                            
                                Glassfish Admin Console throws java.lang.IllegalStateException when creating JDBC Pool
                            
                                Should a database connection stay open all the time or only be opened when needed?
                            
                                Overcomplicated oracle jdbc BLOB handling
                            
                                How to set list of parameters on prepared statement? [duplicate]
                            
                                How to make Java work with SQL Server?
                            
                                Lightweight JDBC helper library alternative to Apache Commons DbUtils [closed]
                            
                                Am I Using JDBC Connection Pooling?
                            
                                Kafka Connect JDBC vs Debezium CDC

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark Unable to find JDBC Driver

Tags:

jdbc

apache-spark

apache-spark-sql

Adam Ritter

People also ask

2 Answers

insomniak

Ayush Vatsyayan

Recent Activity

Donate For Us