Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Rerun Scala code with -deprecation using Apache Zeppelin
Mar 29, 2022
scala
apache-spark
apache-zeppelin
one-hot encode of multiple string categorical features using Spark DataFrames
Jun 21, 2022
python
apache-spark
pyspark
apache-spark-sql
bigdata
Getting error while reading from S3 server using pyspark : [java.lang.IllegalArgumentException]
Mar 01, 2022
python
apache-spark
amazon-s3
pyspark
Spark/k8s: How to run spark submit on Kubernetes with client mode
Apr 30, 2022
docker
apache-spark
kubernetes
Aggregate while dropping duplicates in pyspark
Jul 02, 2022
dataframe
apache-spark
pyspark
apache-spark-sql
databricks
Spark not ignoring empty partitions
Sep 27, 2022
performance
apache-spark
amazon-s3
partitioning
parquet
Low parallelism when running Apache Beam wordcount pipeline on Spark with Python SDK
Jul 02, 2022
python
apache-spark
apache-beam
How to run a Spark-java program from command line [closed]
Aug 26, 2022
hadoop
hdfs
apache-spark
Apache Spark Throws java.lang.IllegalStateException: unread block data
Aug 07, 2021
scala
hadoop
hdfs
apache-spark
Spark Standalone Mode multiple shell sessions (applications)
Jul 02, 2022
apache-spark
Specifying the output file name in Apache Spark
Aug 25, 2022
python
apache-spark
Spark - convert string IDs to unique integer IDs
Jan 26, 2022
apache-spark
Usage of local variables in closures when accessing Spark RDDs
Mar 26, 2022
closures
apache-spark
rdd
pyspark
How do you read and write from/into different ElasticSearch clusters using spark and elasticsearch-hadoop?
Nov 12, 2022
apache-spark
elasticsearch
hdfs
elasticsearch-hadoop
distributed-filesystem
How to format data for the spark mlib kmeans clustering algorithm?
Nov 06, 2022
java
algorithm
machine-learning
apache-spark
How to extract complex JSON structures using Apache Spark 1.4.0 Data Frames
Nov 21, 2022
apache-spark
apache-spark-sql
If the one partition is lost, we can use lineage to reconstruct it. Will the base RDD be loaded again?
Oct 31, 2022
apache-spark
rdd
Use Serializable lambda in Spark JavaRDD transformation
Aug 27, 2022
java
lambda
apache-spark
serializable
How does Scala compiler handle unused variable values?
Oct 13, 2020
performance
scala
memory
apache-spark
Can I run a Time Series Database (TSDB) over Apache Spark?
May 04, 2021
database
apache-spark
time-series
bigdata
« Newer Entries
Older Entries »