I have a class <code>ImageInputFormat</code> in Hadoop which reads images from HDFS. How to use my InputFormat in Spark? Here is my <code>ImageInputFormat</code>: <pre class="prettyprint"><code>public class ImageInputFormat extends FileInputFormat<Text, ImageWritable> { @Override public ImageRecordReader createRecordReader(InputSplit split, TaskAttemptContext context) throws IOException, InterruptedException { return new ImageRecordReader(); } @Override protected boolean isSplitable(JobContext context, Path filename) { return false; } } </code></pre>

The SparkContext has a method called <code>hadoopFile</code>. It accepts classes implementing the interface <code>org.apache.hadoop.mapred.InputFormat</code> Its description says "Get an RDD for a Hadoop file with an arbitrary InputFormat". Also have a look at the Spark Documentation.

How to use Hadoop InputFormats In Apache Spark?

Tags:

apache-spark

hadoop

hdfs

I have a class ImageInputFormat in Hadoop which reads images from HDFS. How to use my InputFormat in Spark?

Here is my ImageInputFormat:

Click to copy

public class ImageInputFormat extends FileInputFormat<Text, ImageWritable> {

    @Override
    public ImageRecordReader createRecordReader(InputSplit split, 
                  TaskAttemptContext context) throws IOException, InterruptedException {
        return new ImageRecordReader();
    }

    @Override
    protected boolean isSplitable(JobContext context, Path filename) {
        return false;
    }
}

869

asked Jan 09 '14 09:01

Hellen

1 Answers

The SparkContext has a method called hadoopFile. It accepts classes implementing the interface org.apache.hadoop.mapred.InputFormat

Its description says "Get an RDD for a Hadoop file with an arbitrary InputFormat".

Also have a look at the Spark Documentation.

156

answered Sep 17 '22 10:09

Robert Metzger

Related questions
                            
                                Hadoop Mapreduce Error Input path does not exist: hdfs://localhost:54310/user/hduser/input"
                            
                                Can I use Spark without Hadoop for development environment?
                            
                                What does "Client" exactly mean for Hadoop / HDFS?
                            
                                Can I submit an oozie job with multiple configuration files?
                            
                                Reading file as single record in hadoop
                            
                                Which HDFS operations are atomic?
                            
                                How to Best Run Hadoop on Single Machine?
                            
                                Converting IntWritatble to int
                            
                                GUI tools for viewing/editing Apache Parquet
                            
                                Hadoop MapReduce: Appropriate input files size?
                            
                                Hadoop - composite key
                            
                                How can i output hadoop result in csv format
                            
                                Apache Hadoop setXIncludeAware UnsupportedOperationException
                            
                                IOException: Filesystem closed exception when running oozie workflow
                            
                                Java: com.sun.tools.javac.Main not found when trying to compile Hadoop program
                            
                                Differences between Hadoop-common, Hadoop-core and Hadoop-client?
                            
                                overwrite hive partitions using spark
                            
                                Global variables in hadoop
                            
                                A way to export the results from Pig to a database
                            
                                Find the average of numbers using MapReduce

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With