Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in apache-spark

Spark SQL: How to append new row to dataframe table (from another table)

Feb 02, 2019

scala apache-spark apache-spark-sql

How to save a partitioned parquet file in Spark 2.1?

Nov 08, 2022

scala apache-spark apache-spark-sql parquet

How do I delete files in hdfs directory after reading it using scala?

Sep 15, 2022

scala hadoop apache-spark spark-streaming

File already exists error writing new files from dataframe

Dec 03, 2020

apache-spark emr

Kafka Structured Streaming KafkaSourceProvider could not be instantiated

Apr 19, 2021

java python apache-spark pyspark apache-kafka

How to get rid of "Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties" message?

Sep 06, 2022

log4j apache-spark

Is there a way to filter a field not containing something in a spark dataframe using scala?

Mar 09, 2022

scala apache-spark apache-spark-sql

Spark SQL change format of the number

Nov 18, 2022

scala apache-spark apache-spark-sql

key not found: _PYSPARK_DRIVER_CALLBACK_HOST

Apr 16, 2022

python apache-spark pyspark

Error while using Hive context in spark : object hive is not a member of package org.apache.spark.sql

Mar 27, 2022

apache-spark apache-spark-sql

Scala/Spark version compatibility

Aug 26, 2022

scala apache-spark

Selecting only numeric/string columns names from a Spark DF in pyspark

Dec 21, 2017

python apache-spark pyspark apache-spark-sql

How to allocate more executors per worker in Standalone cluster mode?

Oct 18, 2022

apache-spark

PySpark - Adding a Column from a list of values using a UDF

Oct 03, 2019

python list apache-spark pyspark apache-spark-sql

spark partition data writing by timestamp

Oct 25, 2022

scala apache-spark apache-spark-sql

Invalid Spark URL in local spark session

Jul 26, 2022

apache-spark

UnsatisfiedLinkError: no snappyjava in java.library.path when running Spark MLLib Unit test within Intellij

Feb 10, 2022

scala unit-testing intellij-idea apache-spark

How can I efficiently read multiple json files into a Dataframe or JavaRDD?

Aug 31, 2018

java json apache-spark

spark error RDD type not found when creating RDD

Jun 12, 2019

scala apache-spark apache-spark-sql

What is the best way to define custom methods on a DataFrame?

May 28, 2021

scala apache-spark apache-spark-sql

« Newer Entries Older Entries »