Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
java.io.InvalidClassException: org.apache.spark.internal.io.HadoopMapReduceCommitProtocol; local class incompatible
Nov 08, 2022
java
hadoop
apache-spark
cluster-computing
Spark deploy-related properties in spark-submite
Nov 08, 2022
java
apache-spark
Spark Structured Streaming with Kafka - How to repartition the data and distribute the processing among worker nodes
Nov 09, 2022
scala
apache-spark
apache-kafka
spark-structured-streaming
spark-kafka-integration
Pyspark - Failed to locate the winutils binary in the hadoop binary path [duplicate]
Nov 09, 2022
python
apache-spark
pyspark
Custom state store provider for Apache Spark on Mesos
Nov 08, 2022
apache-spark
mesos
spark-structured-streaming
Convert Spark DataFrame schema to new schema
Nov 09, 2022
scala
apache-spark
dataframe
Java Read Parquet File to JSON Output
Nov 10, 2022
java
json
apache-spark
hadoop
parquet
Pyspark SQL Pandas UDF: Returning an array
Nov 08, 2022
python
apache-spark
pyspark
databricks
user-defined-functions
Spark 2.x + Tika: java.lang.NoSuchMethodError: org.apache.commons.compress.archivers.ArchiveStreamFactory.detect
Nov 09, 2022
apache-spark
apache-tika
cloudera-cdh
Writing Parquet files with Scala for spark without spark as dependency
Nov 09, 2022
scala
apache-spark
parquet
Compile multiple jars from single source project using Gradle
Nov 08, 2022
scala
apache-spark
gradle
Merging rows into a single struct column in spark scala has efficiency problems, how do we do it better?
Nov 10, 2022
scala
apache-spark
Handling schema mismatches in Spark
Nov 09, 2022
scala
apache-spark
How i can maintain a temporary dictionary in a pyspark application?
Nov 09, 2022
python
apache-spark
pyspark
word2vec
fasttext
Is there a compatibility matrix for Hadoop components?
Nov 08, 2022
apache-spark
hadoop
PySpark Array<double> is not Array<double>
Nov 09, 2022
apache-spark
pyspark
apache-spark-ml
Read timed out Httpfs HDFS
Nov 09, 2022
scala
apache-spark
kubernetes
hdfs
Unable to groupBy MapType column within Spark DataFrame
Nov 09, 2022
scala
apache-spark
Why am I getting an exception when using a Range Join hint?
Nov 08, 2022
python
apache-spark
pyspark-sql
databricks
azure-databricks
could not find function "switch_lang"
Nov 09, 2022
r
apache-spark
dplyr
sparklyr
rlang
« Newer Entries
Older Entries »