How is spark HiveContext/SQLContext retrieving schema/data?

Question

I can't seem to find much documentation on it but when I pull data from Hive in Spark SQL how is it retrieving the schema, is it automatically looking in the Hive Metastore? Also is it Hive telling spark to look at the file location to pull the data into a DataFrame? And how does it handle a view or can it not handle a view yet?

ayan guha · Accepted Answer

Yes, it looks up hive metastore.
Spark delegates hive queries to hive. It captures output and turn it to a dataframe of rows. From docs:

When working with Hive one must construct a HiveContext, which inherits from SQLContext, and adds support for finding tables in the MetaStore and writing queries using HiveQL

How is spark HiveContext/SQLContext retrieving schema/data?

Tags:

apache-spark

apache-spark-sql

theMadKing

1 Answers

ayan guha

Recent Activity

Donate For Us

How is spark HiveContext/SQLContext retrieving schema/data?

Tags:

apache-spark

apache-spark-sql

theMadKing

1 Answers

ayan guha

Related questions

Recent Activity

Donate For Us