Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Why does the Scala compiler fail with missing parameter type for filter with JavaSparkContext?
Apr 20, 2026
scala
apache-spark
calculate percentile of column over window in pyspark
Apr 20, 2026
apache-spark
pyspark
apache-spark-sql
How to Split the Predicted Probabilities Produced by ML Pileline Logistic Regression
Apr 20, 2026
scala
apache-spark
apache-spark-ml
What happens if we use broadcast in the larger table?
Apr 20, 2026
apache-spark
pyspark
Resource optimization/utilization in EMR for long running job and multiple small running jobs
Apr 20, 2026
apache-spark
hadoop
hadoop-yarn
amazon-emr
long-running-processes
PySpark Distinct List of Each of the Keys from an RDD
Apr 20, 2026
python
apache-spark
pyspark
rdd
Spark Streaming reading from local file gives NullPointerException
Apr 19, 2026
apache-spark
nullpointerexception
spark-streaming
How to extract values from key value map?
Apr 17, 2026
dataframe
scala
apache-spark
dictionary
apache-spark-sql
Spark SVD is not reproducible
Apr 18, 2026
apache-spark
apache-spark-mllib
apache-spark-ml
svd
non-deterministic
Not enough replicas available for query at consistency LOCAL_ONE (1 required but only 0 alive)
Apr 19, 2026
apache-spark
cassandra
spark-cassandra-connector
Spark last 30 days filter, best approach to improve performance
Apr 19, 2026
performance
scala
hadoop
apache-spark
statistics
Requirement failed in LogisticRegressionModel.predict
Apr 18, 2026
java
apache-spark
apache-spark-mllib
Scala Spark sort RDD by index of substring
Apr 19, 2026
scala
apache-spark
« Newer Entries
Older Entries »