Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How do we rank dataframe?
Nov 01, 2022
scala
apache-spark
apache-spark-sql
Submitting spring boot application jar to spark-submit
Feb 07, 2022
spring
apache-spark
spring-boot
Pass system property to spark-submit and read file from classpath or custom path
Feb 28, 2018
java
scala
apache-spark
apache-spark-2.0
spark-submit
How to list files in S3 bucket using Spark Session?
Aug 30, 2022
apache-spark
amazon-s3
apache-spark-sql
Spark: Sort records in groups?
Mar 16, 2017
scala
sorting
apache-spark
SPARK : failure: ``union'' expected but `(' found
Jun 24, 2021
sql
scala
apache-spark
dataframe
apache-spark-sql
How to convert a JSON file to parquet using Apache Spark?
Oct 21, 2022
json
apache-spark
apache-spark-sql
parquet
Spark CrossValidatorModel access other models than the bestModel?
Apr 03, 2022
apache-spark
apache-spark-mllib
cross-validation
apache-spark-1.6
Emit multiple pairs in map operation
Dec 21, 2019
apache-spark
pyspark
Which is efficient, Dataframe or RDD or hiveql?
Aug 24, 2022
apache-spark
apache-spark-sql
spark-dataframe
Error ExecutorLostFailure when running a task in Spark
Aug 28, 2022
apache-spark
pyspark
apache-spark-mllib
collect
Spark Scala Understanding reduceByKey(_ + _)
Oct 14, 2022
scala
apache-spark
word-count
bigdata
Spark Standalone Number Executors/Cores Control
Nov 10, 2022
apache-spark
apache-spark-standalone
Missing SPARK_HOME when using SparkLauncher on AWS EMR cluster
Aug 12, 2017
amazon-web-services
apache-spark
pyspark
emr
amazon-emr
Scalatest and Spark giving "java.io.NotSerializableException: org.scalatest.Assertions$AssertionsHelper"
Mar 09, 2021
scala
apache-spark
serialization
rdd
scalatest
How to skip lines while reading a CSV file as a dataFrame using PySpark?
Apr 23, 2022
apache-spark
pyspark
spark-dataframe
pyspark-sql
How to process a range of hbase rows using spark?
Apr 01, 2022
java
hadoop
bigdata
apache-spark
How to process multi line input records in Spark
Nov 08, 2022
scala
apache-spark
Hive doesn't read partitioned parquet files generated by Spark
Aug 21, 2022
apache-spark
hive
partitioning
partition
parquet
Kafka Producer - org.apache.kafka.common.serialization.StringSerializer could not be found
Sep 15, 2022
apache-spark
apache-kafka
apache-karaf
spark-streaming-kafka
« Newer Entries
Older Entries »