Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Passing typesafe config conf files to DataProcSparkOperator
Feb 21, 2026
apache-spark
airflow
google-cloud-dataproc
typesafe-config
google-cloud-composer
Google Dataproc in-cluster encryption
Feb 21, 2026
apache-spark
encryption
google-cloud-platform
google-cloud-dataproc
Adding an extra column that represents the difference between the closest difference of a previous column
Feb 22, 2026
sql
apache-spark
apache-spark-sql
livy curl request error for Kerberos Cloudera Hadoop
Feb 21, 2026
apache-spark
hadoop
cloudera
livy
What nodes are used in aggregation and reduction for reduce?
Feb 20, 2026
apache-spark
Flattening JSON into Tabular Structure using Spark-Scala RDD only fucntion
Feb 21, 2026
scala
apache-spark
rdd
saveAsNewAPIHadoopFile() giving error when used as output format
Feb 20, 2026
scala
apache-spark
Is there a way to sample a Spark RDD for exactly a specified number of elements instead of a percentage?
Feb 20, 2026
apache-spark
rdd
scala - convert each json row to table
Feb 20, 2026
scala
apache-spark
apache-spark-sql
Schema order change after join operation in Spark (JAVA)
Feb 21, 2026
java
join
apache-spark
multiple-columns
Rename all columns after all columns aggregation [duplicate]
Feb 20, 2026
python
apache-spark
dataframe
pyspark
aggregate
Handle null/NaN values in spark mllib classifier
Feb 21, 2026
apache-spark
classification
random-forest
apache-spark-mllib
What is a good number of partitions in spark as a function of number of executors and threads?
Feb 19, 2026
scala
amazon-web-services
apache-spark
scalability
amazon-emr
See progress while "iterating" over Dataframe
Feb 19, 2026
dataframe
apache-spark
plsql
pyspark
progress-bar
No such table while writing to sqlite3 database from Pyspark via JDBC
Feb 19, 2026
sqlite
jdbc
apache-spark
pyspark
« Newer Entries
Older Entries »