Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Add all the dates (week) between two dates in new Row in spark Scala
Nov 30, 2025
scala
apache-spark
apache-spark-sql
partitioning
Create a new column by replacing comma-separated column's values with a lookup based on another dataframe
Nov 29, 2025
python
apache-spark
pyspark
apache-spark-sql
How is task distributed in spark
Nov 30, 2025
apache-spark
distributed-system
How to read a Json file with a specific format with Spark Scala?
Nov 29, 2025
json
scala
apache-spark
How to get the latest date from listed dates along with the total count?
Nov 29, 2025
scala
apache-spark
apache-spark-sql
Spark saving RDD[(Int, Array[Double])] to text file got strange result
Nov 29, 2025
apache-spark
apache-spark-mllib
How to make predictions with Linear Regression Model?
Nov 28, 2025
java
apache-spark
linear-regression
apache-spark-ml
How to broadcast large variable to local disk of each node in Spark
Nov 29, 2025
hadoop
apache-spark
broadcast
Spark history server filter jobs by user id or time
Nov 29, 2025
apache-spark
apache-spark-sql
spark-streaming
Spark not able to find checkpointed data in HDFS after executor fails
Nov 29, 2025
apache-spark
spark-streaming
spark-checkpoint
Does PySpark code run in JVM or Python subprocess?
Nov 28, 2025
python
apache-spark
pyspark
Spark read JDBC from SAS IOM
Nov 29, 2025
apache-spark
sas
Spark + Yarn: How to retain logs of lost-executors
Nov 28, 2025
hadoop
logging
apache-spark
hadoop-yarn
How many times K-means Spark Streaming processed the same data?
Nov 28, 2025
algorithm
apache-spark
k-means
spark-streaming
How to drop duplicates using conditions [duplicate]
Nov 28, 2025
scala
apache-spark
apache-spark-sql
« Newer Entries
Older Entries »