Spark Streaming Window Operation

Tags:

The following is simple code to get the word count over a window size of 30 seconds and slide size of 10 seconds.

import org.apache.spark.SparkConf
import org.apache.spark.streaming._
import org.apache.spark.streaming.StreamingContext._
import org.apache.spark.api.java.function._
import org.apache.spark.streaming.api._
import org.apache.spark.storage.StorageLevel

val ssc = new StreamingContext(sc, Seconds(5))

// read from text file
val lines0 = ssc.textFileStream("test")
val words0 = lines0.flatMap(_.split(" "))

// read from socket
val lines1 = ssc.socketTextStream("localhost", 9999, StorageLevel.MEMORY_AND_DISK_SER)
val words1 = lines1.flatMap(_.split(" "))

val words = words0.union(words1)
val wordCounts = words.map((_, 1)).reduceByKeyAndWindow(_ + _, Seconds(30), Seconds(10))

wordCounts.print()
ssc.checkpoint(".")
ssc.start()
ssc.awaitTermination()

However, I am getting error from this line:

val wordCounts = words.map((_, 1)).reduceByKeyAndWindow(_ + _, Seconds(30), Seconds(10))

. Especially, from _ + _. The error is

51: error: missing parameter type for expanded function ((x$2, x$3) => x$2.$plus(x$3))

Could anybody tell me what the problem is? Thanks!

963

asked Jul 22 '14 16:07

user2895478

1 Answers

This is extremely easy to fix, just be explicit about the types.
val wordCounts = words.map((_, 1)).reduceByKeyAndWindow((a:Int,b:Int)=>a+b, Seconds(30), Seconds(10))

The reason scala can't infer the type in this case is explained in this answer

answered Sep 28 '22 00:09

aaronman

Related questions
                            
                                Errors and failures in Scala Parser Combinators
                            
                                How to solve "Implementation restriction: trait ... accesses protected method ... inside a concrete trait method."
                            
                                Scalaz Validation with applicative functor |@| not working
                            
                                Usage of Scala Future in Playframework?
                            
                                Why global ExecutionContext is not a default parameter in future block?
                            
                                Using typesafe's config to manage my database connection
                            
                                Where is the man page for scaladoc?
                            
                                "does not take parameters" when chaining method calls without periods
                            
                                How to flatten a Try[Option[T]]
                            
                                Where is the scaladoc for scala.language.existentials?
                            
                                Avoiding loops in Scala
                            
                                Running Typesafe Console/Atmos to monitor actor system/scala app. Running from IntelliJ IDEA or any other IDE
                            
                                How to pass a code block to function?
                            
                                How do I create a POST request with form field content with Spray?
                            
                                What is the Clojure equivalent of Scalaz Foldable's foldmap?
                            
                                Why do these similar looking statements yield objects of different types?
                            
                                @Repeat Form Helper with complex object - Play Framework
                            
                                Scala raw strings error in unicode escape
                            
                                Play Framework 2.X and blocking database call
                            
                                How to compute the mean with Apache spark?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark Streaming Window Operation

Tags:

scala

apache-spark

distributed

spark-streaming

user2895478

People also ask

1 Answers

aaronman

Recent Activity

Donate For Us