apache-spark-sql tutorials

What is the preferred way to avoid SQL injections in Spark-SQL (on Hive)

Nov 03, 2022

Integrating Spark SQL and Apache Drill through JDBC

Oct 16, 2022

hadoop jdbc apache-spark apache-spark-sql apache-drill

How to load Tuple from Cassandra table?

Feb 14, 2022

apache-spark apache-spark-sql spark-cassandra-connector

Are the join types defined as constants somewhere accessible in Apache Spark?

Apr 09, 2022

scala apache-spark apache-spark-sql

Start kubernetes pod memory depending on size of data job

Apr 16, 2022

apache-spark kubernetes apache-spark-sql google-cloud-dataflow apache-beam

spark.table fails with java.io.Exception: No FileSystem for Scheme: abfs

Sep 09, 2021

apache-spark apache-spark-sql

Partitions not being pruned in simple SparkSQL queries

Sep 13, 2022

amazon-s3 apache-spark apache-spark-sql pyspark parquet

Using TestHiveContext/HiveContext in unit tests

Jun 29, 2021

apache-spark hive apache-spark-sql hivecontext

Not able to fetch result from hive transaction enabled table through spark-sql

Oct 20, 2022

hadoop apache-spark hive apache-spark-sql

How to write dataframe (obtained from hive table) into hadoop SequenceFile and RCFile?

Oct 16, 2022

apache-spark apache-spark-sql spark-dataframe

The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--------- (on Linux)

Jan 18, 2020

apache-spark hive apache-spark-sql spark-dataframe hiveql

Using .where() on pyspark.sql.functions.max().over(window) on Spark 2.4 throws Java exception

Aug 22, 2022

apache-spark exception pyspark apache-spark-sql

one-hot encode of multiple string categorical features using Spark DataFrames

Jun 21, 2022

python apache-spark pyspark apache-spark-sql bigdata

Aggregate while dropping duplicates in pyspark

Jul 02, 2022

dataframe apache-spark pyspark apache-spark-sql databricks

How to extract complex JSON structures using Apache Spark 1.4.0 Data Frames

Nov 21, 2022

apache-spark apache-spark-sql

Apache Spark: In SparkSql, are sql's vulnerable to Sql Injection [duplicate]

Apr 05, 2022

hadoop apache-spark hive apache-spark-sql bigdata

rank() function usage in Spark SQL

Sep 04, 2018

java apache-spark apache-spark-sql window-functions rank

How to convert the group by function to data frame

Nov 21, 2022

scala apache-spark apache-spark-sql

How can you update values in a dataset?

Aug 25, 2022

apache-spark apache-spark-sql

How to add sparse vectors after group by, using Spark SQL?

Sep 16, 2022

python apache-spark machine-learning apache-spark-sql pyspark-sql

New posts in apache-spark-sql