Imagine the following code: <pre class="prettyprint"><code>def myUdf(arg: Int) = udf((vector: MyData) => { // complex logic that returns a Double }) </code></pre> How can I define the return type for myUdf so that people looking at the code will know immediately that it returns a Double?

I see two ways to do it, either define a method first and then lift it to a function <pre class="prettyprint"><code>def myMethod(vector:MyData) : Double = { // complex logic that returns a Double } val myUdf = udf(myMethod _) </code></pre> or define a function first with explicit type: <pre class="prettyprint"><code>val myFunction: Function1[MyData,Double] = (vector:MyData) => { // complex logic that returns a Double } val myUdf = udf(myFunction) </code></pre> I normally use the firt approach for my UDFs

Define return value in Spark Scala UDF

Tags:

scala

apache-spark

user-defined-functions

udf

Imagine the following code:

def myUdf(arg: Int) = udf((vector: MyData) => {
  // complex logic that returns a Double
})

How can I define the return type for myUdf so that people looking at the code will know immediately that it returns a Double?

468

asked May 31 '17 18:05

Marsellus Wallace

1 Answers

I see two ways to do it, either define a method first and then lift it to a function

def myMethod(vector:MyData) : Double = {
  // complex logic that returns a Double
}

val myUdf = udf(myMethod _)

or define a function first with explicit type:

val myFunction: Function1[MyData,Double] = (vector:MyData) => {
  // complex logic that returns a Double
}

val myUdf = udf(myFunction)

I normally use the firt approach for my UDFs

answered Sep 29 '22 22:09

Raphael Roth

Related questions
                            
                                Scala pattern matching on generic type with TypeTag generates a warning while ClassTag not?
                            
                                Adding Two Lists of Same Size at Compile-time [duplicate]
                            
                                How to join two lists in Scala?
                            
                                Missing parameter type
                            
                                Akka Future - Parallel versus Concurrent?
                            
                                Convert scala to native binary
                            
                                Access Spark broadcast variable in different classes
                            
                                How to normalize or standardize the data having multiple columns/variables in spark using scala?
                            
                                providing a constructor for a scala trait
                            
                                array transpose in scala
                            
                                play silhouette is not inserting password into database table
                            
                                Akka stream - List to mapAsync of individual elements
                            
                                What is the advantage of using Option.map over Option.isEmpty and Option.get?
                            
                                SBT - Multi project merge strategy and build sbt structure when using assembly
                            
                                What is '_=' in scala?
                            
                                Scala: Spark SQL to_date(unix_timestamp) returning NULL
                            
                                Scala combinations function not terminating
                            
                                Tuple to data frame in spark scala
                            
                                What is the usage of _ in Int => Int = _ + 1 [duplicate]
                            
                                Spark create UDF that doesn't take in input

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With