Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Pivot String column on Pyspark Dataframe
Aug 30, 2022
python
apache-spark
dataframe
pyspark
apache-spark-sql
Difference between SparkContext, JavaSparkContext, SQLContext, and SparkSession?
Aug 30, 2022
java
scala
apache-spark
rdd
apache-spark-dataset
What is the difference between rowsBetween and rangeBetween?
Oct 22, 2022
sql
apache-spark
pyspark
apache-spark-sql
window-functions
Calculating the averages for each KEY in a Pairwise (K,V) RDD in Spark with Python
Aug 30, 2022
python
apache-spark
aggregate
average
rdd
How do I split an RDD into two or more RDDs?
Aug 22, 2022
apache-spark
pyspark
rdd
Encoder error while trying to map dataframe row to updated row
Oct 29, 2022
scala
apache-spark
apache-spark-sql
apache-spark-dataset
apache-spark-encoders
How to convert unix timestamp to date in Spark
Aug 30, 2022
scala
datetime
apache-spark
timestamp
nscala-time
NoClassDefFoundError com.apache.hadoop.fs.FSDataInputStream when execute spark-shell
Apr 20, 2022
apache-spark
Drop spark dataframe from cache
Aug 30, 2022
apache-spark
apache-spark-sql
spark-streaming
Why does spark-submit and spark-shell fail with "Failed to find Spark assembly JAR. You need to build Spark before running this program."?
Oct 09, 2022
apache-spark
Spark using python: How to resolve Stage x contains a task of very large size (xxx KB). The maximum recommended task size is 100 KB
Jul 30, 2022
apache-spark
spark-streaming
How can I connect to a postgreSQL database into Apache Spark using scala?
Aug 30, 2022
scala
apache-spark
psql
Cleanest, most efficient syntax to perform DataFrame self-join in Spark
Aug 30, 2022
apache-spark
dataframe
apache-spark-sql
SparkSQL vs Hive on Spark - Difference and pros and cons?
Aug 30, 2022
apache-spark
hadoop
hive
apache-spark-sql
Compute size of Spark dataframe - SizeEstimator gives unexpected results
Aug 30, 2022
apache-spark
spark-dataframe
build.sbt: how to add spark dependencies
Oct 17, 2022
scala
apache-spark
sbt
spark-streaming
Why spark-shell fails with NullPointerException?
Aug 30, 2022
scala
hadoop
apache-spark
Pyspark convert a standard list to data frame [duplicate]
Aug 26, 2022
python
apache-spark
pyspark
pyspark-sql
What should be the optimal value for spark.sql.shuffle.partitions or how do we increase partitions when using Spark SQL?
Aug 30, 2022
apache-spark
apache-spark-sql
Adding a new column in Data Frame derived from other columns (Spark)
Aug 30, 2022
python
apache-spark
apache-spark-sql
pyspark
« Newer Entries
Older Entries »