Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Spark Object (singleton) serialization on executors
Dec 24, 2025
scala
apache-spark
serialization
singleton
Spark two level aggregation
Dec 24, 2025
apache-spark
Error when reading a file in Spark
Dec 23, 2025
scala
cassandra
apache-spark
datastax-enterprise
pyspark function.lag on condition
Dec 24, 2025
apache-spark
pyspark
apache-spark-sql
Spark/Scala parallel write to redis
Dec 22, 2025
scala
apache-spark
redis
spark-redis
how should I express the hdfs path in spark textfile?
Dec 23, 2025
scala
apache-spark
hdfs
Merge two RDDs in Spark Scala
Dec 23, 2025
scala
apache-spark
Compare rows of two dataframes to find the matching column count of 1's
Dec 23, 2025
apache-spark
pyspark
apache-spark-sql
rdd.saveAsTextFile doesn't seem to work, but repetitions throw FileAlreadyExistsException
Dec 23, 2025
hadoop
apache-spark
Flatten any nested json string and convert to dataframe using spark scala
Dec 23, 2025
json
scala
apache-spark
apache-spark-sql
flatten
how to index categorical features in another way when using spark ml
Dec 23, 2025
apache-spark
apache-spark-mllib
How to get job or application IDs from SparkSession?
Dec 22, 2025
apache-spark
apache-spark-sql
Connect to Spark running on VM
Dec 23, 2025
apache-spark
virtualbox
bigdata
How to get new/updated records from Delta table after upsert using merge?
Dec 23, 2025
apache-spark
databricks
spark-structured-streaming
delta-lake
Spark: RDD Left Outer Join Optimization for Duplicate Keys
Dec 22, 2025
apache-spark
join
rdd
Why does Databricks Connect Test not work on Mac?
Dec 21, 2025
apache-spark
pyspark
databricks
« Newer Entries
Older Entries »