In Spark SQL when I tried to use map function on DataFrame then I am getting below error. The method map(Function1, ClassTag) in the type DataFrame is not applicable for the arguments (new Function(){}) I am following spark 1.3 documentation as well. https://spark.apache.org/docs/latest/sql-programming-guide.html#inferring-the-schema-using-reflection Have any one solution? Here is my testing code. <pre class="prettyprint"><code> // SQL can be run over RDDs that have been registered as tables. DataFrame teenagers = sqlContext.sql("SELECT name FROM people WHERE age >= 13 AND age <= 19"); List<String> teenagerNames = teenagers.map( new Function<Row, String>() { public String call(Row row) { return "Name: " + row.getString(0); } }).collect(); </code></pre>

Change this to: Java 6 & 7 <pre class="prettyprint"><code>List<String> teenagerNames = teenagers.javaRDD().map( new Function<Row, String>() { public String call(Row row) { return "Name: " + row.getString(0); } }).collect(); </code></pre> Java 8 <pre class="prettyprint"><code>List<String> t2 = teenagers.javaRDD().map( row -> "Name: " + row.getString(0) ).collect(); </code></pre> Once you call javaRDD() it works just like any other RDD map function. This works with Spark 1.3.0 and up.

No need to convert to RDD, its delays the execution it can be done as below `public static void mapMethod() { // Read the data from file, where the file is in the classpath. Dataset df = sparkSession.read().json("file1.json"); <pre class="prettyprint"><code>// Prior to java 1.8 Encoder<String> encoder = Encoders.STRING(); List<String> rowsList = df.map((new MapFunction<Row, String>() { private static final long serialVersionUID = 1L; @Override public String call(Row row) throws Exception { return "string:>" + row.getString(0).toString() + "<"; } }), encoder).collectAsList(); // from java 1.8 onwards List<String> rowsList1 = df.map((row -> "string >" + row.getString(0) + "<" ), encoder).collectAsList(); System.out.println(">>> " + rowsList); System.out.println(">>> " + rowsList1); </code></pre> }`

Java - Spark SQL DataFrame map function is not working

Tags:

java

sql

map-function

apache-spark

In Spark SQL when I tried to use map function on DataFrame then I am getting below error.

The method map(Function1, ClassTag) in the type DataFrame is not applicable for the arguments (new Function(){})

I am following spark 1.3 documentation as well. https://spark.apache.org/docs/latest/sql-programming-guide.html#inferring-the-schema-using-reflection Have any one solution?

Here is my testing code.

   // SQL can be run over RDDs that have been registered as tables.
DataFrame teenagers = sqlContext.sql("SELECT name FROM people WHERE age >= 13 AND age <= 19");

List<String> teenagerNames = teenagers.map(
            new Function<Row, String>() {
          public String call(Row row) {
            return "Name: " + row.getString(0);
          }
        }).collect();

340

asked Apr 22 '15 07:04

user3206330

2 Answers

Change this to:

Java 6 & 7

List<String> teenagerNames = teenagers.javaRDD().map(
    new Function<Row, String>() {
    public String call(Row row) {
        return "Name: " + row.getString(0);
    }
}).collect();

Java 8

List<String> t2 = teenagers.javaRDD().map(
    row -> "Name: " + row.getString(0)
).collect();

Once you call javaRDD() it works just like any other RDD map function.

This works with Spark 1.3.0 and up.

answered Oct 03 '22 23:10

econn

No need to convert to RDD, its delays the execution it can be done as below

`public static void mapMethod() { // Read the data from file, where the file is in the classpath. Dataset df = sparkSession.read().json("file1.json");

// Prior to java 1.8 
Encoder<String> encoder = Encoders.STRING();
    List<String> rowsList = df.map((new MapFunction<Row, String>() {
        private static final long serialVersionUID = 1L;

        @Override
        public String call(Row row) throws Exception {
            return "string:>" + row.getString(0).toString() + "<";
        }
    }), encoder).collectAsList();

// from java 1.8 onwards
List<String> rowsList1 = df.map((row -> "string >" + row.getString(0) + "<" ), encoder).collectAsList();
System.out.println(">>> " + rowsList);
System.out.println(">>> " + rowsList1);

answered Oct 04 '22 00:10

Vijay Anantharamu

Related questions
                            
                                How can I use Joiner, to join only specific properties?
                            
                                Consuming One-Shot ResponseBody from Okhttp causes issues with Retrofit
                            
                                Generic casting in Kotlin
                            
                                Android EditText enter key listener
                            
                                Mapping a Nested Optional?
                            
                                Enum with array/list values
                            
                                What is the difference between Flyweight design pattern and Java cache
                            
                                How can I get all Class files in a specific package in Java?
                            
                                Can't login to my custom login page in spring boot security
                            
                                Class Cast Exception when passing array of Serializables from one activity to another
                            
                                Foreach with multidimensional arrays in Java
                            
                                Why is catch block optional?
                            
                                Read Notification Bar title, message using Accessibility Service Programmatically
                            
                                How to create a custom Spring PropertySource that depends on a Spring Bean
                            
                                Proguard while Butterknife library and other warnings
                            
                                Creating many object types/classes just to use instanceof
                            
                                Things to keep in mind while overriding hashCode
                            
                                NoSuchMethodError: com.google.common.base.Platform.systemNanoTime() In GWT project
                            
                                Testing java HBase connection
                            
                                Unable to find any jvms matching version 1.7 while starting neo4j

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With