Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Gradle download sources failed
Mar 22, 2026
java
apache-spark
gradle
intellij-idea
Null values best practices in Parquet files
Mar 24, 2026
apache-spark
null
parquet
apache-drill
Incrementally add data to Parquet tables in S3
Mar 22, 2026
amazon-s3
apache-spark
apache-spark-sql
parquet
With Delta Lake, how to remove original file after compaction
Mar 22, 2026
apache-spark
spark-streaming
databricks
delta-lake
Spark 1.6.Token can be issued only with kerberos or web authentication
Mar 23, 2026
hadoop
apache-spark
kerberos
gssapi
keytab
How to define schema of streaming dataset dynamically to write to csv?
Mar 23, 2026
scala
apache-spark
apache-kafka
spark-structured-streaming
spark-csv
How to use "sqlContext" in different notebooks when using one of them as a module (Pyspark)
Mar 23, 2026
python
apache-spark
pyspark
jupyter-notebook
jupyter
AttributeError: 'NoneType' object has no attribute 'write in Pyspark
Mar 23, 2026
apache-spark
apache-spark-sql
pyspark
How to get or create a Hadoop client from a Spark Executor
Mar 23, 2026
scala
apache-spark
hadoop
apache-spark-sql
hdfs
Impala is converting time into GMT how to avoid that
Mar 23, 2026
scala
hadoop
apache-spark
hive
impala
Wrapping pyspark Pipeline.__init__ and decorators
Mar 23, 2026
python
apache-spark
pyspark
Pyspark RDD aggregate different value fields differently
Mar 23, 2026
python
apache-spark
pyspark
aggregate
rdd
Databricks: Z-order vs partitionBy
Mar 21, 2026
apache-spark
databricks
partitioning
delta-lake
z-order
Read only Delta between 2 versions of deltaLake
Mar 23, 2026
apache-spark
pyspark
databricks
azure-synapse
delta-lake
Pass a function with any case class return type as parameter
Mar 22, 2026
scala
apache-spark
dataframe
case-class
classtag
Developing a spark streaming application
Mar 21, 2026
apache-spark
spark-streaming
Convert csv.gz files into Parquet using Spark
Mar 22, 2026
scala
hadoop
amazon-s3
apache-spark
parquet
« Newer Entries
Older Entries »