SparkR vs sparklyr [closed]

Tags:

Does someone have an overview with respect to advantages/disadvantages of SparkR vs sparklyr? Google does not yield any satisfactory results and both seem fairly similar. Trying both out, SparkR appears a lot more cumbersome, whereas sparklyr is pretty straight forward (both to install but also to use, especially with the dplyr inputs). Can sparklyr only be used to run dplyr functions in parallel or also "normal" R-Code?

Best

476

asked Sep 14 '16 15:09

koVex

2 Answers

The biggest advantage of SparkR is the ability to run on Spark arbitrary user-defined functions written in R:

https://spark.apache.org/docs/2.0.1/sparkr.html#applying-user-defined-function

Since sparklyr translates R to SQL, you can only use very small set of functions in mutate statements:

http://spark.rstudio.com/dplyr.html#sql_translation

That deficiency is somewhat alleviated by Extensions (http://spark.rstudio.com/extensions.html#wrapper_functions).

Other than that, sparklyr is a winner (in my opinion). Aside from the obvious advantage of using familiar dplyr functions, sparklyr has much more comprehensive API for MLlib (http://spark.rstudio.com/mllib.html) and the Extensions mentioned above.

answered Sep 19 '22 15:09

Alex Vorobiev

Being a wrapper, there are some limitations to sparklyr. For example, using copy_to() to create a Spark dataframe does not preserve columns formatted as dates. With SparkR, as.Dataframe() preserves dates.

answered Sep 16 '22 15:09

Reuben L.

Related questions
                            
                                How do I change a single value in a data.frame?
                            
                                Producing subscripts in R markdown
                            
                                Unable to load rJava on R
                            
                                How to output text in the R console without creating new lines?
                            
                                Get the mean across multiple Pandas DataFrames
                            
                                Write a data frame to csv file without column header in R [duplicate]
                            
                                Return row number(s) for a particular value in a column in a dataframe
                            
                                R - test if first occurrence of string1 is followed by string2
                            
                                How do I save warnings and errors as output from a function?
                            
                                Extract R-square value with R in linear models [duplicate]
                            
                                Practical limits of R data frame
                            
                                remove all line breaks (enter symbols) from the string using R
                            
                                Finding percentage in a sub-group using group_by and summarise
                            
                                How to order a data frame by one descending and one ascending column?
                            
                                Why do I get "warning longer object length is not a multiple of shorter object length"?
                            
                                How to Reverse a string in R
                            
                                How to control ordering of stacked bar chart using identity on ggplot2
                            
                                Calculate AUC in R?
                            
                                How to do a data.table merge operation
                            
                                Specify widths and heights of plots with grid.arrange

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

SparkR vs sparklyr [closed]

Tags:

r

apache-spark

sparkr

sparklyr

koVex

People also ask

2 Answers

Alex Vorobiev

Reuben L.

Recent Activity

Donate For Us