Exception: 'writeStream' can be called only on streaming Dataset/DataFrame

Tags:

Trying to create a test for spark data streaming writeStream function as shown below:

SparkSession spark = SparkSession.builder().master("local").appName("spark 
session").getOrCreate()

val lakeDF = spark.createDF(List(("hi")), List(("word", StringType, true)))

lakeDF.writeStream
  .trigger(Trigger.Once)
  .format("parquet")
  .option("checkpointLocation", checkpointPath)
  .start(dataPath)

But I am getting following exception: org.apache.spark.sql.AnalysisException: 'writeStream' can be called only on streaming Dataset/DataFrame;

I am very new to spark streaming, please let me know how can i create a streaming dataframe/convert the above regular dataframe into streaming dataframe for my test suite.

327

asked Jul 18 '18 17:07

Dhruvajyoti Chatterjee

1 Answers

In Spark Structured Streaming dataframes/datasets are created out stream using readStream on SparkSession. If the dataframe/dataset are not created using stream then you are not allowed store using writeStream.

So create the dataframes/datasets using readStream and store the dataframes/datasets using writeStream

 val kafkaStream = sparkSession.readStream.format("kafka")
.option("kafka.bootstrap.servers", "kafka-broker-hostname:port")
.option("subscribe", "topicname")
.load()

124

answered Sep 26 '22 06:09

Naga

Related questions
                            
                                Why is scala.math.PartialOrdering.lteq abstract, rather than defined in terms of .tryCompare?
                            
                                Is there a way to get proper report of runtime compilation errors in scala 2.10?
                            
                                Play ReactiveMongo - exception when trying to find one document
                            
                                Better ways to implement more secure Play Scala framework session via cookie
                            
                                Refactoring domain model with mutability and cyclical dependencies to work for Scala with good FP practices?
                            
                                Why does scalac not believe that a method does not fit the required type signature?
                            
                                How can I find the source file that causes a Scalac Compiler crash
                            
                                Right Click on a Button / Scala
                            
                                Akka-http streaming using Slick 3.0 Databasepublisher
                            
                                Live resources in Akka Stream flow description
                            
                                How to create a separate compile task without a separate config, but different scalacOptions?
                            
                                Retry / replay of failed messages in AKKA
                            
                                Spark throws java.util.NoSuchElementException: key not found: 67
                            
                                Scala Play template vararg HtmlContent
                            
                                Drop into a Scala interpreter in Spark script?
                            
                                How to import libraries in Spark Notebook
                            
                                What's the benefit of scalaz.concurrent.Future, in comparison to scalaz.ContT[Trampoline, Unit, ?]
                            
                                Select in SAP HANA + Hibernate throws error: `Method unwrap of com.sap.db.jdbc.CallableStatementSapDBFinalize is not supported`
                            
                                Combining/Updating Cassandra Queried data to Structured Streaming receieved from Kafka
                            
                                Spark fails to read CSV when last column name contains spaces

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Exception: 'writeStream' can be called only on streaming Dataset/DataFrame

Tags:

scala

apache-spark

spark-streaming

Dhruvajyoti Chatterjee

People also ask

1 Answers

Naga

Recent Activity

Donate For Us