apache-spark-sql tutorials

remove last character from string

Oct 26, 2025

apache-spark pyspark apache-spark-sql

Spark CSV package not able to handle \n within fields

Oct 25, 2025

scala apache-spark apache-spark-sql spark-csv apache-spark-1.6

Execution of cmd cells in databricks notebook based on some condition

Oct 27, 2025

pyspark apache-spark-sql databricks azure-databricks spark-notebook

pyspark dataframe cube method returning duplicate null values

Oct 27, 2025

python python-2.7 apache-spark pyspark apache-spark-sql

PySpark: How to extract variables from a struct nested in a struct inside an array?

Oct 27, 2025

python dataframe pyspark apache-spark-sql

Strange error while writing parquet file to s3

Oct 27, 2025

scala apache-spark amazon-s3 apache-spark-sql amazon-emr

Usage of custom Python object in Pyspark UDF

Oct 26, 2025

python apache-spark pyspark apache-spark-sql

Is there an idiomatic way to cache Spark dataframes?

Oct 26, 2025

dataframe apache-spark pyspark apache-spark-sql

How to use salting technique for joining data frames having skewed data

Oct 25, 2025

apache-spark pyspark apache-spark-sql skew

Is it possible to force schema definition when loading tables from AWS RDS (MySQL)

Oct 25, 2025

mysql amazon-web-services apache-spark apache-spark-sql

Adding line numbers when parsing many CSV files with Spark

Oct 25, 2025

csv apache-spark apache-spark-sql

Filtering and counting negative/positive values from a Spark dataframe using pyspark?

Oct 26, 2025

apache-spark pyspark apache-spark-sql

List to DataFrame in pyspark

Oct 26, 2025

pyspark apache-spark-sql

How to conditionally remove the first two characters from a column

Oct 25, 2025

scala apache-spark hadoop apache-spark-sql hive

pyspark: groupby and aggregate avg and first on multiple columns

Oct 25, 2025

pyspark apache-spark-sql

Explode array values using PySpark

Oct 26, 2025

apache-spark hadoop pyspark apache-spark-sql

How to get columns from an org.apache.spark.sql row by name?

Oct 26, 2025

scala apache-spark apache-spark-sql spark-streaming

Combining csv files with mismatched columns

Oct 25, 2025

csv apache-spark pyspark apache-spark-sql data-analysis

How to create table with nested map on databricks using sql

Oct 24, 2025

sql arrays apache-spark apache-spark-sql databricks

Xor logical condition in pyspark

Oct 24, 2025

pyspark apache-spark-sql

New posts in apache-spark-sql