apache-spark tutorials and guides

Scala String Variable Substitution

Dec 26, 2022

scala apache-spark apache-spark-sql

Reading multiple csv files at different folder depths

Dec 26, 2022

scala csv apache-spark dataframe wildcard

How to replace elements of a breeze matrix in Scala based on some condition?

Dec 26, 2022

scala apache-spark scala-breeze

Why Spark ML ALS algorithm print RMSE = NaN?

Dec 26, 2022

scala apache-spark machine-learning

Getting a date x days back from a custom date in Scala

Dec 26, 2022

scala apache-spark

How to create DataFrame with nulls using toDF?

Dec 25, 2022

scala apache-spark apache-spark-sql

Using custome UDF withColumn in a Spark Dataset<Row>; java.lang.String cannot be cast to org.apache.spark.sql.Row

Dec 25, 2022

java apache-spark apache-spark-sql user-defined-functions apache-spark-dataset

Spark job fails on java 9 NumberFormatException for input string ea

Dec 26, 2022

java scala apache-spark java-9

How can dataframereader read http?

Dec 26, 2022

scala apache-spark intellij-idea apache-spark-sql hdfs

Spark Dataframe - Implement Oracle NVL Function while joining

Dec 25, 2022

scala apache-spark apache-spark-sql

How to convert from org.apache.spark.mllib.linalg.SparseVector to org.apache.spark.ml.linalg.SparseVector?

Dec 26, 2022

scala apache-spark rdd apache-spark-mllib apache-spark-ml

What's the difference between SparkSession.sql and Dataset.sqlContext.sql?

Dec 25, 2022

apache-spark apache-spark-sql

how to make string as parameters that include several strings

Dec 25, 2022

scala apache-spark

PySpark- How to use a row value from one column to access another column which has the same name as of the row value

Dec 25, 2022

apache-spark pyspark apache-spark-sql apache-spark-1.6

If I already have Hadoop installed, should I download Apache Spark WITH Hadoop or WITHOUT Hadoop?

Dec 25, 2022

apache-spark hadoop hadoop3

How to use SparkSession and StreamingContext together?

Dec 24, 2022

scala apache-spark spark-dataframe spark-streaming

How can I export Scala Spark DataFrames schema to a Json file?

Dec 26, 2022

dataframe scala apache-spark apache-spark-sql

How can I read from S3 in pyspark running in local mode?

Dec 25, 2022

python apache-spark amazon-s3 pyspark

Spark on Dataproc: possible to run more executors per CPU?

Dec 25, 2022

apache-spark google-cloud-dataproc

How to change the location of _spark_metadata directory?

Dec 25, 2022

apache-spark amazon-s3 parquet spark-structured-streaming

New posts in apache-spark