Error in pig while loading data

Tags:

3 Answers

just building with command "ant clean jar-all -Dhadoopversion=23" is not enough if you are using maven dependencies in your project. You will need to install the jar created by this in your local maven repo or use this dependency (notice "classifier" tag for hadoop2) in your pom.xml

<dependency>
<groupId>org.apache.pig</groupId>
<artifactId>pig</artifactId>
<classifier>h2</classifier>
<version>0.13.0</version>
</dependency>

170

answered Oct 23 '22 16:10

skhurana

Apache Pig 0.12.0 expects an older version of Hadoop by default. You must recompile Pig for Hadoop 2.2.0 and replace two jars with new pig-0.12.1-SNAPSHOT.jar and pig-0.12.1-SNAPSHOT-withouthadoop.jar.

For recompilation unpack the pig archive, go to the directory "pig-0.12.0" and just run:

ant clean jar-all -Dhadoopversion=23

answered Oct 23 '22 17:10

Michal

I've resolved it in another way.

I got the same problem on CDH4.4 & Pig 0.11.0 when my Pig script was invoking a UDF from my java project which was compiled using Maven. I've visited /usr/lib/pig/conf/build.properties file. Verified the versions mentioned against hadoop-core, hadoop-common, hadoop-mapreduce properties. Ensured that all these artifacts with same versions are included as dependencies in my java project's POM.xml file.

( Infact hadoop-mapreduce has 6 artifact Ids as per http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Installation-Guide/cdh4ig_topic_31.html. I've included all of them in the dependencies list).

After building my project's jar file with these POM settings, Pig script was able to invoke UDF without any problem.

answered Oct 23 '22 15:10

Raja

Related questions
                            
                                How to use Hadoop InputFormats In Apache Spark?
                            
                                Hadoop MapReduce: Clarification on number of reducers
                            
                                What is the difference between hadoop job -kill job_id and yarn application -kill application_id
                            
                                localhost: ERROR: Cannot set priority of datanode process 32156
                            
                                Hadoop on Kubernetes vs Standard Hadoop
                            
                                java.io.IOException: Incompatible clusterIDs
                            
                                how to order my tuple of spark results descending order using value
                            
                                Setting YARN queue in PySpark
                            
                                CAP with distributed System
                            
                                How to copy first few lines of a large file in hadoop to a new file?
                            
                                Could you give me any clue Why 'Cannot call methods on a stopped SparkContext'?
                            
                                How to find Hadoop hdfs directory on my system?
                            
                                Running jobs parallely in hadoop
                            
                                How to import org.apache Java dependencies w/ or w/o Maven
                            
                                dfs.namenode.servicerpc-address or dfs.namenode.rpc-address is not configured
                            
                                Is it possible to import data into Hive table without copying the data
                            
                                How can I pre split in hbase
                            
                                Avoid creation of _$folder$ keys in S3 with hadoop (EMR)
                            
                                Run hadoop in the Mac OS
                            
                                How to Practice Hadoop Programming? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Error in pig while loading data

Tags:

hadoop

apache-pig

Hardik Barot

People also ask

3 Answers

skhurana

Michal

Raja

Recent Activity

Donate For Us