apache-spark-sql tutorials

How to get columns from an org.apache.spark.sql row by name?

Oct 26, 2025

Combining csv files with mismatched columns

Oct 25, 2025

csv apache-spark pyspark apache-spark-sql data-analysis

How to create table with nested map on databricks using sql

Oct 24, 2025

sql arrays apache-spark apache-spark-sql databricks

Xor logical condition in pyspark

Oct 24, 2025

pyspark apache-spark-sql

Convert date to ISO week date in Spark

Oct 23, 2025

apache-spark date pyspark apache-spark-sql spark3

How can I append to same file in HDFS(spark 2.11)

Oct 23, 2025

apache-spark apache-spark-sql spark-streaming

How to merge two rows in Spark SQL?

Oct 25, 2025

scala apache-spark apache-spark-sql

Split a column in multiple columns using Spark SQL

Oct 24, 2025

sql apache-spark apache-spark-sql

Relative path in absolute URI Exception while accessing DynamoDB via Glue Data Catalogue in PySpark running on EMR

Oct 24, 2025

amazon-dynamodb apache-spark-sql amazon-emr spark-hive aws-glue-data-catalog

Databricks notebook time out error when calling other notebooks: com.databricks.WorkflowException: java.net.SocketTimeoutException: Read timed out

Oct 24, 2025

apache-spark apache-spark-sql databricks socket-timeout-exception

How to prevent processing files twice with Spark DataFrames

Oct 24, 2025

apache-spark amazon-s3 apache-spark-sql aws-glue

How to read Parquet files under a directory using PySpark?

Oct 22, 2025

python pyspark apache-spark-sql databricks azure-databricks

Is there any way to get max value from a column in Pyspark other than collect()?

Oct 24, 2025

apache-spark pyspark apache-spark-sql

How to express a hex literal in Spark SQL?

Oct 23, 2025

hive apache-spark-sql

can we create a new table from an existing table with data in pyspark

Oct 23, 2025

apache-spark-sql

Spark dataframe filter both nulls and spaces

Oct 24, 2025

scala apache-spark-sql

Create a map column in Apache Spark from other columns

Oct 22, 2025

scala apache-spark apache-spark-sql

replace for loop to parallel process in pyspark

Oct 23, 2025

python apache-spark pyspark apache-spark-sql

How to specify sql dialect when creating spark dataframe from JDBC?

Oct 23, 2025

apache-spark jdbc apache-spark-sql apache-spark-2.0 vitess

Maximum number of concurrent tasks in 1 DPU in AWS Glue

Oct 23, 2025

amazon-web-services apache-spark apache-spark-sql aws-glue

New posts in apache-spark-sql