Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Window Function Tie breaker on other field to get the Latest Record
Oct 18, 2022
sql
apache-spark
pyspark
apache-spark-sql
pyspark-sql
How to set optimal config values - trigger time, maxOffsetsPerTrigger - for Spark Structured Streaming while reading messages from Kafka?
Oct 17, 2022
apache-spark
apache-kafka
spark-streaming
spark-structured-streaming
structured streaming Kafka 2.1->Zeppelin 0.8->Spark 2.4: spark does not use jar
Oct 18, 2022
python
apache-spark
pyspark
apache-kafka
apache-zeppelin
Cross account GCS access using Spark on Dataproc
Oct 18, 2022
apache-spark
google-cloud-platform
google-bigquery
google-cloud-storage
google-cloud-dataproc
How to overwrite a parquet file from where DataFrame is being read in Spark
Oct 18, 2022
python
apache-spark
metadata
parquet
How to call a web service called from a Spark job?
Oct 18, 2022
apache-spark
apache-spark-sql
spark-structured-streaming
How does parquet determine which encoding to use?
Oct 16, 2022
apache-spark
hadoop
hive
parquet
Scala module requiring specific version of data bind for Spark
Oct 17, 2022
java
scala
apache-spark
jackson-databind
Spark: Dataframe Serialization
Jun 14, 2022
scala
apache-spark
serialization
spark-dataframe
kryo
How to encode optional fields in spark dataset with java?
Aug 12, 2022
java
apache-spark
option-type
encoder
How can PySpark be called in debug mode?
Sep 11, 2022
python
python-2.7
hadoop
intellij-idea
apache-spark
spark streaming checkpoint recovery is very very slow
Sep 13, 2022
apache-spark
amazon-s3
spark-streaming
amazon-kinesis
checkpointing
How to change case of whole column to lowercase?
Oct 03, 2022
java
apache-spark
apache-spark-sql
apache-spark-dataset
Spark Standalone Mode: How to compress spark output written to HDFS
Feb 23, 2022
scala
compression
hdfs
apache-spark
How to create a custom Estimator in PySpark
May 21, 2020
python
apache-spark
pyspark
apache-spark-mllib
apache-spark-ml
Error to start pre-built spark-master when slf4j is not installed
Oct 30, 2022
apache-spark
Spark sql queries vs dataframe functions
Oct 21, 2022
sql
performance
apache-spark
dataframe
apache-spark-sql
SparkContext Error - File not found /tmp/spark-events does not exist
Oct 05, 2021
python
amazon-web-services
apache-spark
amazon-ec2
pyspark
How to shuffle the rows in a Spark dataframe?
Sep 08, 2022
scala
apache-spark
dataframe
apache-spark-sql
Spark : Read file only if the path exists
Apr 30, 2022
scala
apache-spark
parquet
« Newer Entries
Older Entries »