Unable to use an existing Hive permanent UDF from Spark SQL

Tags:

I have previously registered a UDF with hive. It is permanent not TEMPORARY. It works in beeline.

CREATE FUNCTION normaliseURL AS 'com.example.hive.udfs.NormaliseURL' USING JAR 'hdfs://udfs/hive-udfs.jar';

I have spark configured to use the hive metastore. The config is working as I can query hive tables. I can see the UDF;

In [9]: spark.sql('describe function normaliseURL').show(truncate=False) +-------------------------------------------+ |function_desc                              | +-------------------------------------------+ |Function: default.normaliseURL             | |Class: com.example.hive.udfs.NormaliseURL  | |Usage: N/A.                                | +-------------------------------------------+

However I cannot use the UDF in a sql statement;

spark.sql('SELECT normaliseURL("value")') AnalysisException: "Undefined function: 'default.normaliseURL'. This function is neither a registered temporary function nor a permanent function registered in the database 'default'.; line 1 pos 7"

If I attempt to register the UDF with spark (bypassing the metastore) it fails to register it, suggesting that it does already exist.

In [12]: spark.sql("create function normaliseURL as 'com.example.hive.udfs.NormaliseURL'") AnalysisException: "Function 'default.normaliseURL' already exists in database 'default';"

I'm using Spark 2.0, hive metastore 1.1.0. The UDF is scala, my spark driver code is python.

I'm stumped.

Am I correct in my assumption that Spark can utilise metastore-defined permanent UDFs?
Am I creating the function correctly in hive?

641

asked Aug 18 '16 16:08

Rob Cowie

1 Answers

Issue is Spark 2.0 is not able to execute the functions whose JARs are located on HDFS.

Spark SQL: Thriftserver unable to run a registered Hive UDTF

One workaround is to define the function as a temporary function in Spark job with jar path pointing to a local edge-node path. Then call the function in same Spark job.

CREATE TEMPORARY FUNCTION functionName as 'com.test.HiveUDF' USING JAR '/user/home/dir1/functions.jar'

answered Sep 19 '22 15:09

Manmohan

Related questions
                            
                                React Native - good practice: SegmentedControlIOS with ListView
                            
                                Is it possible to set the color scheme of SwipeRefreshLayout in XML?
                            
                                R: download data securely using TLS/SSL
                            
                                Configure different timeouts in gunicorn for different endpoints?
                            
                                GraphQL union and conflicting types
                            
                                BitBlt screen capture not working on Windows 10
                            
                                iPhone Safari browser, change # hash URL and then Javascript alert message box stop working
                            
                                Why is the braced-init-list not supported in an aggregate deduction but brace elision is supported?
                            
                                How to prevent Windows from caching Com Class info?
                            
                                How to compile OpenGL with a python C++ extension using distutils on Mac OSX?
                            
                                MPMoviePlayerController re-orientation portrait to landscape and back to portrait (iOS 4.1)
                            
                                Why isn't BLToolkit more popular? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With