I got a oozie workflow, running on a CDH4 cluster of 4 machines (one master-for-everything, three "dumb" workers). The hive metastore runs on the master using mysql (driver is present), the oozie server also runs on the master using mysql, too. Using the web interface I can import and query hive as expected, but when I do the same queries within an oozie workflow it fails. Even the addition of the "IF EXISTS" leads to the error below. I tried to add the connection information as properties to the hive job without any success. Can anybody give me a hint? Did I miss anything? Any further information needed? This is the output of the job's log: <pre class="prettyprint"><code> Script [drop.sql] content: ------------------------ DROP TABLE IF EXISTS performance_log; ------------------------ Hive command arguments : -f drop.sql ================================================================= >>> Invoking Hive command line now >>> Intercepting System.exit(10001) <<< Invocation of Main class completed <<< Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.HiveMain], exit code [10001] Oozie Launcher failed, finishing Hadoop job gracefully </code></pre> And this is the error message: <pre class="prettyprint"><code> FAILED: SemanticException [Error 10001]: Table not found performance_log Intercepting System.exit(10001) Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.HiveMain], exit code [10001] </code></pre>

The problem is other nodes don't know where your MYSQL is , so you are getting error table not found. You need to do 2 things <ol> <li>Copy hive-site.xml in the oozie workflow directory</li> <li>In your Hive action tell oozie that use my hive-site.xml</li> </ol> Something like below <code>action name="hive-node"> <hive xmlns="uri:oozie:hive-action:0.2"> <job-tracker>${jobTracker}</job-tracker> <name-node>${nameNode}</name-node> <job-xml>hive-site.xml</job-xml></code> This should work. Thanks

Oozie workflow: Hive table not found but it does exist

Tags:

hive

cloudera

oozie

I got a oozie workflow, running on a CDH4 cluster of 4 machines (one master-for-everything, three "dumb" workers). The hive metastore runs on the master using mysql (driver is present), the oozie server also runs on the master using mysql, too. Using the web interface I can import and query hive as expected, but when I do the same queries within an oozie workflow it fails. Even the addition of the "IF EXISTS" leads to the error below. I tried to add the connection information as properties to the hive job without any success.

Can anybody give me a hint? Did I miss anything? Any further information needed?

This is the output of the job's log:

  Script [drop.sql] content:
  ------------------------
  DROP TABLE IF EXISTS performance_log;

  ------------------------

  Hive command arguments :
  -f
  drop.sql

  =================================================================

  >>> Invoking Hive command line now >>>

  Intercepting System.exit(10001)

  <<< Invocation of Main class completed <<<

  Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.HiveMain], exit code [10001]

  Oozie Launcher failed, finishing Hadoop job gracefully

And this is the error message:

  FAILED: SemanticException [Error 10001]: Table not found performance_log
  Intercepting System.exit(10001)
  Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.HiveMain], exit code [10001]

923

asked Apr 01 '13 19:04

Mario Mueller

1 Answers

The problem is other nodes don't know where your MYSQL is , so you are getting error table not found.

You need to do 2 things

Copy hive-site.xml in the oozie workflow directory
In your Hive action tell oozie that use my hive-site.xml

Something like below

action name="hive-node"> <hive xmlns="uri:oozie:hive-action:0.2"> <job-tracker>${jobTracker}</job-tracker> <name-node>${nameNode}</name-node> <job-xml>hive-site.xml</job-xml>

This should work.

Thanks

150

answered Jan 04 '23 06:01

user2230605

Related questions
                            
                                Sql Query: co-occurrence of column values
                            
                                Counting in Hadoop Hive
                            
                                FAILED: Error in semantic analysis: Column Found in more than One Tables/Subqueries
                            
                                Spark job did not find table in Hive database
                            
                                Hive on Spark list all partitions for specific hive table and adding a partition
                            
                                The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: -wx------
                            
                                Hive doesn't work on install
                            
                                error in hive metadata: org.apache.thrift.transport.TTransportException: java.net
                            
                                Hive -- split data across files
                            
                                split string that includes semicolons in Hive
                            
                                Hive is not showing tables
                            
                                Data visualisation tools availble on hive hadoop
                            
                                Create HIVE partitioned table HDFS location assistance
                            
                                Spark on embedded mode - user/hive/warehouse not found
                            
                                Creating a partitioned hive table from a non partitioned table
                            
                                spark returns error libsnappyjava.so: failed to map segment from shared object: Operation not permitted
                            
                                Hive: Best way to do incremetal updates on a main table
                            
                                Merging small files in hadoop
                            
                                Hive and SparkSQL do not support datetime type?
                            
                                Hadoop - Hive : Delete data which is older than specified no of days

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With