Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Extracting several regex matches in PySpark
Apr 24, 2026
python
regex
string
apache-spark
pyspark
How to combine or merge two sparse vectors in Spark using Java?
Apr 24, 2026
java
apache-spark
sparse-matrix
apache-spark-mllib
Spark get datatype of nested object
Apr 24, 2026
arrays
apache-spark
dataframe
apache-spark-sql
DataFrame.count() == 0 Vs DataFrame.rdd.isEmpty(): please compare for execution speed
Apr 25, 2026
scala
apache-spark
apache-spark-sql
Compare and Highlight the differences of two dataframes using spark and java
Apr 26, 2026
java
dataframe
apache-spark
apache-spark-sql
Where is Spark Streamings state stored?
Apr 25, 2026
apache-spark
spark-streaming
Local Kafka Application failing with: NoSuchMethodError: createEphemeral
Apr 26, 2026
apache-spark
apache-kafka
producer-consumer
apache-zookeeper
How to count the number of occurence of a key in pyspark dataframe (2.1.0)
Apr 25, 2026
python
apache-spark
pyspark
apache-spark-2.0
Dynamically select multiple columns while joining different Dataframe in Scala Spark
Apr 25, 2026
scala
apache-spark
dataframe
apache-spark-sql
NoSuchMethodError while running Spark Streaming job on HDP 2.2
Apr 25, 2026
scala
apache-spark
hortonworks-data-platform
spark-streaming
why spark sort is slower than scala original sort method
Apr 24, 2026
scala
sorting
apache-spark
Spark structured streaming of Kafka protobuf
Apr 25, 2026
scala
apache-spark
protocol-buffers
spark-streaming
scalapb
Apache Spark write to MySQL with JDBC connector (Write Mode: Ignore) is not performing as expected [duplicate]
Apr 24, 2026
mysql
apache-spark
jdbc
pyspark
apache-spark-sql
How to pass DataSet(s) to a function that accepts DataFrame(s) as arguments in Apache Spark using Scala?
Apr 25, 2026
scala
apache-spark
apache-spark-sql
apache-spark-dataset
How to implement a custom Pyspark explode (for array of structs), 4 columns in 1 explode?
Apr 23, 2026
python-3.x
apache-spark
pyspark
apache-spark-sql
Add batch number to DataFrame based on moving sum in spark
Apr 23, 2026
python
dataframe
apache-spark
pyspark
spark streaming DirectKafkaInputDStream: kafka data source can easily stress the driver node
Apr 24, 2026
apache-spark
apache-kafka
spark-streaming
« Newer Entries
Older Entries »