In Apache Spark 2.0.0, is it possible to fetch a query from an external database (rather than grab the whole table)?

Tags:

Using pyspark:

from pyspark.sql import SparkSession

spark = SparkSession\
    .builder\
    .appName("spark play")\
    .getOrCreate()    

df = spark.read\
    .format("jdbc")\
    .option("url", "jdbc:mysql://localhost:port")\
    .option("dbtable", "schema.tablename")\
    .option("user", "username")\
    .option("password", "password")\
    .load()

Rather than fetch "schema.tablename", I would prefer to grab the result set of a query.

571

asked Aug 02 '16 20:08

PBL

1 Answers

Same as in 1.x you can pass valid subquery as dbtable argument for example:

...
.option("dbtable", "(SELECT foo, bar FROM schema.tablename) AS tmp")
...

answered Nov 14 '22 23:11

zero323

Related questions
                            
                                MySQL update query with WHERE clause and INNER JOIN not working
                            
                                MariaDB - Error log configuration?
                            
                                Deleting RDS Read Replicas
                            
                                Add new Datasource (mysql) wildfly
                            
                                MYSQL LEFT JOIN and COUNT and GROUP BY
                            
                                convert datepicker date to mysql date [duplicate]
                            
                                Using Streams in MySQL with Node
                            
                                Sequelize can't create table but when I run the same in MySQL CLI it works
                            
                                Update primary key Django MySQL
                            
                                Resources ID in REST Api = primary key in database
                            
                                How to map a Native Query to a POJO, when I do not have any Entity on my project?
                            
                                Import Big csv file into mySql
                            
                                what the meaning mergeBindings in laravel
                            
                                Subtract value from previous row in mysql
                            
                                mysql: convert timediff() to seconds
                            
                                Get all data of parent child relation-ship from same table in mysql
                            
                                Eloquent: How do I assign values to guarded columns while using the create() method?
                            
                                Select specific rows and columns from an SQL database
                            
                                Invalid default value for 'timestamp'
                            
                                Can we store List type of data to MYSQL database table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In Apache Spark 2.0.0, is it possible to fetch a query from an external database (rather than grab the whole table)?

Tags:

mysql

jdbc

apache-spark

pyspark

PBL

People also ask

1 Answers

zero323

Recent Activity

Donate For Us