Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Databricks SQL - CTE namespace (bug?) with temporary views
Dec 07, 2025
sql
apache-spark
databricks
databricks-sql
How to strip headers from all files in RDD, where RDD = sc.textFile("s3n://bucket/*.csv")?
Dec 07, 2025
csv
amazon-s3
header
apache-spark
rdd
Spark LuceneRDD - how does it work
Dec 06, 2025
java
scala
apache-spark
lucene
apache-spark-2.0
Why does collecting dataset fail with org.apache.spark.shuffle.FetchFailedException?
Dec 08, 2025
scala
apache-spark
apache-spark-sql
cassandra
spark-cassandra-connector
Using windowing functions in Spark
Dec 08, 2025
apache-spark
apache-spark-sql
window-functions
How to insert (not save or update) RDD into Cassandra?
Dec 07, 2025
cassandra
apache-spark
Unable to load 25GB dataset in PySpark local mode with 56GB RAM free
Dec 07, 2025
java
python
apache-spark
pyspark
heap-memory
How to load history data when starting Spark Streaming process, and calculate running aggregations
Dec 06, 2025
apache-spark
apache-kafka
spark-streaming
apache-spark-sql
apache-spark-1.4
Linear regression with Spark MLlib only returns monotonic predictions
Dec 06, 2025
scala
apache-spark
linear-regression
apache-spark-mllib
What is appName in SparkContext constructor and what is the usage of it?
Dec 07, 2025
hadoop
apache-spark
How can I configure spark-submit (or DataProc) to download maven dependencies (jars) from GitHub packages?
Dec 06, 2025
apache-spark
ivy
google-cloud-dataproc
spark-submit
github-package-registry
How to get top N elements from an Apache Spark RDD for large N
Dec 07, 2025
algorithm
apache-spark
rdd
Apache spark (graphx) probably not utilizing all the cores and memory
Dec 06, 2025
apache-spark
Calculate time difference between consecutive rows in pairs per group in pyspark
Dec 05, 2025
apache-spark
pyspark
apache-spark-sql
« Newer Entries
Older Entries »