Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to pull Spark jobs client logs submitted using Apache Livy batches POST method using AirFlow
Sep 18, 2025
apache-spark
airflow
livy
Transform column with seconds to human readable duration
Sep 18, 2025
python
apache-spark
apache-spark-sql
pyspark
Distributed Rules Engine
Sep 19, 2025
apache-spark
drools
rule-engine
complex-event-processing
Spark Graphframes large dataset and memory Issues
Sep 17, 2025
apache-spark
pyspark
amazon-emr
graphframes
list S3 files in Pyspark
Sep 18, 2025
python
apache-spark
amazon-s3
pyspark
boto3
Value split is not a member of (String, String)
Sep 18, 2025
scala
apache-spark
apache-kafka
spark-streaming
spark-submit
Generate database schema diagram for Databricks
Sep 18, 2025
apache-spark
database-schema
databricks
diagram
Merge two tables in Scala/Spark
Sep 18, 2025
scala
apache-spark
Spark/Scala load Oracle Table to Hive
Sep 18, 2025
oracle-database
apache-spark
hive
How to find out the driver node for my Spark?
Sep 17, 2025
apache-spark
port
driver
hadoop-yarn
Spark:executor.CoarseGrainedExecutorBackend: Driver Disassociated disassociated
Sep 17, 2025
apache-spark
rdd
SPARK: How to parse a Array of JSON object using Spark
Sep 18, 2025
json
apache-spark
apache-spark-sql
schema
how to save data in HDFS with spark?
Sep 16, 2025
hadoop
apache-spark
hdfs
spark-streaming
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/spark/streaming/StreamingContext
Sep 16, 2025
scala
apache-spark
intellij-idea
sbt
spark-streaming
AWS EMR - EMR_DefaultRole has insufficient EC2 permissions
Sep 18, 2025
amazon-web-services
apache-spark
amazon-iam
amazon-emr
Is there a way to set a minimum batch size for a pandas_udf in PySpark?
Sep 17, 2025
python
pandas
apache-spark
pyspark
apache-arrow
PySpark - Loop in ForEachBatch leads to "SparkContext should only be created and accessed on the driver" Error
Sep 17, 2025
python
python-3.x
apache-spark
pyspark
Need to release the memory used by unused spark dataframes
Sep 17, 2025
apache-spark
memory
pyspark
How to add Extra column with current date in Spark dataframe
Sep 17, 2025
dataframe
apache-spark
pyspark
apache-spark-sql
Using pyspark groupBy with a custom function in agg
Sep 17, 2025
python
pandas
apache-spark
pyspark
« Newer Entries
Older Entries »