Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in apache-spark

How can I obtain the DAG of an Apache Spark job without running it?

Apr 14, 2022

scala apache-spark

Why is no map function for dataframe in pyspark while the spark equivalent has it?

Nov 06, 2022

apache-spark pyspark

How to set spark.driver.memory for Spark/Zeppelin on EMR

Apr 20, 2019

apache-spark emr amazon-emr apache-zeppelin

Is there a way to validate the syntax of raw spark sql query?

May 21, 2022

scala apache-spark

java.lang.UnsupportedOperationExceptionfieldIndex on a Row without schema is undefined: Exception on row.getAs[String]

Sep 05, 2022

scala apache-spark

How to select multiple columns of dataset, given a list of column names?

May 08, 2022

java apache-spark apache-spark-sql

Spark decimal type precision loss

Jun 16, 2022

scala apache-spark apache-spark-sql

Comparison of a `float` to `np.nan` in Spark Dataframe

Sep 07, 2022

python numpy apache-spark pyspark nan

How do I get a spark dataframe to print it's explain plan to a string

Nov 17, 2022

scala apache-spark dataframe

How to find the max String length of a column in Spark using dataframe?

Sep 15, 2022

scala apache-spark apache-spark-sql

Spark: How to aggregate/reduce records based on time difference?

Sep 15, 2022

dataframe apache-spark pyspark apache-spark-sql rdd

Reading Excel (.xlsx) file in pyspark

Nov 04, 2022

apache-spark pyspark spark-excel

What is the optimal way to read from multiple Kafka topics and write to different sinks using Spark Structured Streaming?

Aug 26, 2022

apache-spark pyspark apache-kafka spark-structured-streaming

Elasticsearch for spark 3.0

Feb 20, 2022

apache-spark elasticsearch

"'JavaPackage' object is not callable" error executing explain() in Pyspark 3.0.1 via Zeppelin

Aug 29, 2022

apache-spark pyspark

Workaround for Scala RDD not being covariant

Oct 28, 2022

scala types covariance apache-spark

Apache Spark ALS Recommendation Rating values higher than range

Oct 31, 2022

apache-spark machine-learning apache-spark-mllib collaborative-filtering

Spark: Counting co-occurrence - Algorithm for efficient multi-pass filtering of huge collections

Mar 29, 2022

algorithm scala group-by apache-spark filtering

Joining two spark dataframes on time (TimestampType) in python

Oct 02, 2019

join apache-spark apache-spark-sql pyspark

write an RDD into HDFS in a spark-streaming context

Oct 18, 2022

scala hadoop apache-spark hdfs spark-streaming

« Newer Entries Older Entries »