Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Attach column names to elements with Spark and Scala using FlatMap
Nov 09, 2025
scala
apache-spark
flatmap
Impossible to operate on custom type after it is encoded? Spark Dataset
Nov 09, 2025
apache-spark
apache-spark-dataset
kryo
apache-spark-encoders
Validate CSV file columns with Spark
Nov 08, 2025
java
csv
apache-spark
What is the meaning of : Warning in do.call(.f, args, envir = .env) : "what" must be a function or character string
Nov 08, 2025
r
apache-spark
tidyverse
databricks
azure-databricks
The difference on reading files in PySpark between reading the whole directory then filtering and reading a part of the directory?
Nov 08, 2025
apache-spark
pyspark
apache-spark-sql
What is the compatible datatype for bigint in Spark and how can we cast bigint into a spark compatible datatype?
Nov 08, 2025
apache-spark
hadoop
hive
apache-spark-sql
How to aggregate columns into a JSON array?
Nov 07, 2025
apache-spark
apache-spark-sql
Pyspark - Join timestamp window against timestamp values
Nov 06, 2025
apache-spark
pyspark
SparkSQL function require type Decimal
Nov 07, 2025
scala
types
apache-spark
apache-spark-sql
How to set Hadoop fs.s3a.acl.default on AWS EMR?
Nov 06, 2025
scala
apache-spark
hadoop
amazon-s3
amazon-emr
how to add JVM option -Xss512m to spark-submit?
Nov 06, 2025
apache-spark
Writing BigQuery Table from PySpark Dataframe using Dataproc Servereless
Nov 05, 2025
apache-spark
google-bigquery
google-cloud-dataproc
Check every column in a spark dataframe has a certain value
Nov 06, 2025
scala
apache-spark
dataframe
apache-spark-sql
Pyspark handle multiple datetime formats when casting from string to timestamp
Nov 06, 2025
python
apache-spark
pyspark
Scala Spark - empty map on DataFrame column for map(String, Int)
Nov 04, 2025
scala
dictionary
apache-spark
dataframe
to_date gives null on format yyyyww (202001 and 202053)
Nov 03, 2025
date
apache-spark
pyspark
apache-spark-sql
week-number
Minio in docker cluster is not reachable from spark container
Nov 03, 2025
python
python-3.x
apache-spark
pyspark
minio
DeltaTable schema not updating when using `ALTER TABLE ADD COLUMNS`
Nov 04, 2025
python
apache-spark
pyspark
delta-lake
Overwrite a Parquet file with Pyspark
Nov 04, 2025
apache-spark
hadoop
pyspark
parquet
Merging multiple parquet files and creating a larger parquet file in s3 using AWS glue
Nov 04, 2025
amazon-web-services
scala
apache-spark
amazon-s3
aws-glue
« Newer Entries
Older Entries »