Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Cannot resolve given input columns while sql on dataframe
Mar 07, 2023
scala
apache-spark
Sorting numeric String in Spark Dataset
Mar 07, 2023
scala
apache-spark
apache-spark-dataset
How to pass Spark job properties to DataProcSparkOperator in Airflow?
Mar 07, 2023
apache-spark
airflow
google-cloud-dataproc
airflow-scheduler
google-cloud-composer
How to fix "ImportError: PyArrow >= 0.8.0 must be installed; however, it was not found."?
Mar 05, 2023
apache-spark
pyspark
apache-spark-sql
Spark infer schema with limit during a read.csv
Mar 07, 2023
apache-spark
Remove spaces between single character in string
Mar 06, 2023
regex
scala
apache-spark
regex-group
Why is the "topics" argument of KafkaUtils.createStream() a Map rather then array?
Mar 06, 2023
java
apache-spark
apache-kafka
spark-streaming
How to save spark dataframe to parquet without using INT96 format for timestamp columns?
Mar 04, 2023
apache-spark
avro
parquet
Getting HDFS Location of Hive Table in Spark
Mar 06, 2023
scala
apache-spark
hive
apache-spark-sql
hiveql
Spark-Streaming hangs with kafka starting offset at earliest (Kafka 2, spark 2.4.3)
Mar 05, 2023
apache-spark
apache-kafka
kafka-consumer-api
spark-structured-streaming
Refresh metadata for Dataframe while reading parquet file
Mar 05, 2023
apache-spark
apache-spark-sql
parquet
apache-spark-dataset
Add a new column to a PySpark DataFrame from a Python list
Mar 04, 2023
python
apache-spark
pyspark
apache-spark-sql
pandas_udf error RuntimeError: Result vector from pandas_udf was not the required length: expected 12, got 35
Mar 05, 2023
python
apache-spark
pyspark
What is the Difference between Broadcast hash join and Broadcast Nested loop join in Spark?
Mar 04, 2023
apache-spark
flattening array of struct in pyspark
Mar 05, 2023
apache-spark
pyspark
apache-spark-sql
How to write Kafka Producer in Scala
Mar 05, 2023
scala
apache-spark
apache-kafka
kafka-producer-api
Azure Databricks, could not initialize class org.apache.spark.eventhubs.EventHubsConf
Mar 05, 2023
scala
azure
apache-spark
databricks
azure-databricks
How to use variables in SQL queries?
Mar 04, 2023
apache-spark
apache-spark-sql
databricks
Writing to Google Cloud Storage with v2 algorithm safe?
Mar 04, 2023
apache-spark
apache-spark-sql
google-cloud-storage
Populate a column based on previous value and row Pyspark
Mar 03, 2023
apache-spark
pyspark
apache-spark-sql
« Newer Entries
Older Entries »