I'm trying to implement neural networks in spark and scala but unable to perform any vector or matrix multiplication. Spark provide two vectors. Spark.util vector support dot operation but it is deprecated. mllib.linalg vectors do not support operations in scala. Which one to use to store weights and training data? How to perform vector multiplication in spark scala with mllib like w*x where w is vector or matrix of weights and x is input. pyspark vector support dot product but in scala I'm not able to find such function in vectors

Well, if you need a full support for linear algebra operators you have to implement these by yourself or use an external library. In the second case the obvious choice is Breeze. It is already used behind the scenes so doesn't introduce additional dependencies and you can easily modify existing Spark code for conversions: <pre class="prettyprint"><code>import breeze.linalg.{DenseVector => BDV, SparseVector => BSV, Vector => BV} def toBreeze(v: Vector): BV[Double] = v match { case DenseVector(values) => new BDV[Double](values) case SparseVector(size, indices, values) => { new BSV[Double](indices, values, size) } } def toSpark(v: BV[Double]) = v match { case v: BDV[Double] => new DenseVector(v.toArray) case v: BSV[Double] => new SparseVector(v.length, v.index, v.data) } </code></pre> Mahout provides interesting Spark and Scala bindings you may find interesting as well. For simple matrix vector multiplications it can be easier to leverage existing matrix methods. For example <code>IndexedRowMatrix</code> and <code>RowMatrix</code> provide <code>multiply</code> methods which can take a local matrix. You can check Matrix Multiplication in Apache Spark for an example usage.

Difference between Apache spark mllib.linalg vectors and spark.util vectors for machine learning

Tags:

machine-learning

scala

apache-spark

apache-spark-mllib

I'm trying to implement neural networks in spark and scala but unable to perform any vector or matrix multiplication. Spark provide two vectors. Spark.util vector support dot operation but it is deprecated. mllib.linalg vectors do not support operations in scala.

Which one to use to store weights and training data?

How to perform vector multiplication in spark scala with mllib like w*x where w is vector or matrix of weights and x is input. pyspark vector support dot product but in scala I'm not able to find such function in vectors

810

asked Jan 20 '16 04:01

gaurav.rai

1 Answers

Well, if you need a full support for linear algebra operators you have to implement these by yourself or use an external library. In the second case the obvious choice is Breeze.

It is already used behind the scenes so doesn't introduce additional dependencies and you can easily modify existing Spark code for conversions:

import breeze.linalg.{DenseVector => BDV, SparseVector => BSV, Vector => BV}

def toBreeze(v: Vector): BV[Double] = v match {
  case DenseVector(values) => new BDV[Double](values)
  case SparseVector(size, indices, values) => {
    new BSV[Double](indices, values, size)
  }
}

def toSpark(v: BV[Double]) = v match {
  case v: BDV[Double] => new DenseVector(v.toArray)
  case v: BSV[Double] => new SparseVector(v.length, v.index, v.data)
}

Mahout provides interesting Spark and Scala bindings you may find interesting as well.

For simple matrix vector multiplications it can be easier to leverage existing matrix methods. For example IndexedRowMatrix and RowMatrix provide multiply methods which can take a local matrix. You can check Matrix Multiplication in Apache Spark for an example usage.

188

answered Sep 25 '22 07:09

zero323

Related questions
                            
                                Akka: Testing interactions with the IO managers
                            
                                How do I add a no-arg constructor to a Scala case class with a macro annotation?
                            
                                IDEA Scala: Could not find output directory
                            
                                Random shuffle not working for Range
                            
                                Scala: Enforcing A is not a subtype of B
                            
                                spark <console>:12: error: not found: value sc
                            
                                Difference between scan and scanLeft in Scala [duplicate]
                            
                                Why are aggregate and fold two different APIs in Spark?
                            
                                get all keys of play.api.libs.json.JsValue
                            
                                How to run sbt tests for debugging when debug is disabled by default?
                            
                                SparkSQL MissingRequirementError when registering table
                            
                                Inserting default values if column value is 'None' using slick
                            
                                What are Tower[A] and IvoryTower in Scalaz?
                            
                                Multiple SLF4J bindings with Play 2.3.8
                            
                                Shuffling Range in Scala is Odd
                            
                                How to flatMap a function on GroupedDataSet in Apache Flink
                            
                                Why `Random.nextInt` is considered not ' composable, modular, easily parallelized'
                            
                                Why does a large array constructor call break the Scala compiler?
                            
                                Some questions about difference between call-by-name and 0-arity functions
                            
                                How to get Histogram of all columns in a large CSV / RDD[Array[double]] using Apache Spark Scala?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With