Creating a simple 1-row Spark DataFrame with Java API

Tags:

In Scala, I can create a single-row DataFrame from an in-memory string like so:

val stringAsList = List("buzz")
val df = sqlContext.sparkContext.parallelize(jsonValues).toDF("fizz")
df.show()

When df.show() runs, it outputs:

+-----+
| fizz|
+-----+
| buzz|
+-----+

Now I'm trying to do this from inside a Java class. Apparently JavaRDDs don't have a toDF(String) method. I've tried:

List<String> stringAsList = new ArrayList<String>();
stringAsList.add("buzz");
SQLContext sqlContext = new SQLContext(sparkContext);
DataFrame df = sqlContext.createDataFrame(sparkContext
    .parallelize(stringAsList), StringType);
df.show();

...but still seem to be coming up short. Now when df.show(); executes, I get:

++
||
++
||
++

(An empty DF.) So I ask: Using the Java API, how do I read an in-memory string into a DataFrame that has only 1 row and 1 column in it, and also specify the name of that column? (So that the df.show() is identical to the Scala one above)?

709

asked Oct 10 '16 21:10

smeeb

1 Answers

Building on what @jgp suggested. If you want to do this for mixed types you can do:

List<Tuple2<Integer, Boolean>> mixedTypes = Arrays.asList(
                new Tuple2<>(1, false),
                new Tuple2<>(1, false),
                new Tuple2<>(1, false));

JavaRDD<Row> rowRDD = sparkContext.parallelize(mixedTypes).map(row -> RowFactory.create(row._1, row._2));

StructType mySchema = new StructType()
                .add("id", DataTypes.IntegerType, false)
                .add("flag", DataTypes.BooleanType, false);

Dataset<Row> df = spark.sqlContext().createDataFrame(rowRDD, mySchema).toDF();

This might help with the @jdk2588 's question.

106

answered Sep 22 '22 14:09

cauchy_cat

Related questions
                            
                                Why does foo+=foo+1 in a loop result in -1?
                            
                                OAuth 2.0 using Spring Security + WSO2 Identity Server
                            
                                What destroys the local variable in java?
                            
                                Where does the JIT compiled code reside?
                            
                                How to make objects of an enum in java
                            
                                limit jetty scanning in maven plugin
                            
                                Click Listener for Info Window Google Maps V2
                            
                                Format currency to local locale, but ignore .00 decimal places
                            
                                Spring-Security : Username is sent empty for login after upgrade to Spring-Security 4.1
                            
                                Get `Instant` from `ZonedDateTime` in java.time
                            
                                Populating a zoo object with animal objects using enum in Java
                            
                                JavaFX TreeView of multiple object types? (and more)
                            
                                How do I conditionally add log4j2 appender depending on java system property?
                            
                                When to use @Embedded and @Embeddable?
                            
                                Matching multiple properties in one Matcher
                            
                                MyBatis - Mapped Statements collection already contains value for
                            
                                How to block thread to wait for response in vert.x?
                            
                                Detecting value change using Rxjava
                            
                                Java8 streams : Transpose map with values as list
                            
                                How to convert json Object to Document in mongodb using java [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating a simple 1-row Spark DataFrame with Java API

Tags:

java

apache-spark

spark-dataframe

smeeb

People also ask

1 Answers

cauchy_cat

Recent Activity

Donate For Us