Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in apache-spark

Why Iterator of Series to Iterator of Series pandasUDF (PandasUDFType.SCALAR_ITER) when Series to Series (PandasUDFType.SCALAR) is available?

Jan 31, 2026

apache-spark pyspark apache-spark-sql

How to calculate percentage over a dataframe

Jan 31, 2026

python apache-spark pyspark

spark repartition data for small file

Jan 31, 2026

java hadoop apache-spark hadoop-partitioning

How to build and run Scala Spark locally

Jan 31, 2026

eclipse scala maven apache-spark

Delta lake incremental manifest files generation

Jan 30, 2026

python apache-spark amazon-athena delta-lake

How to find the top level hierarchy of one column from another column in pyspark?

Jan 31, 2026

python apache-spark pyspark apache-spark-sql

Start spark standalone master with Upstart

Jan 31, 2026

apache-spark upstart

spark master goes down with out of memory exception

Jan 31, 2026

apache-spark

Sorting a DStream and taking topN

Jan 30, 2026

scala apache-spark spark-streaming top-n dstream

In Apache Spark how can I group all the rows of an RDD by two shared values?

Jan 31, 2026

scala apache-spark cassandra rdd

slf4j-log4j12.jar and log4j-over-slf4j.jar in same path while dependency is getting resolved in Maven POM

Jan 29, 2026

apache-spark slf4j apache-drill log4j

Remove a suffix if present on a string column of a DataFrame

Jan 31, 2026

apache-spark dataframe

Spark Scala CSV Input to Nested Json

Jan 30, 2026

scala apache-spark apache-spark-sql

How should I configure Spark to correctly prune Hive Metastore partitions?

Jan 28, 2026

apache-spark hive apache-spark-sql

« Newer Entries Older Entries »