Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
How to process logs from distributed log broker (Eg Kafka) exactly after 1 week?
Nov 09, 2022
java
python
apache-spark
apache-kafka
apache-storm
spark-nlp : DocumentAssembler initializing failing with 'java.lang.NoClassDefFoundError: org/apache/spark/ml/util/MLWritable$class'
Nov 09, 2022
python
apache-spark
pyspark
johnsnowlabs-spark-nlp
Why is Pandas UDF not being parallelized?
Nov 07, 2022
python
apache-spark
pyspark
databricks
azure-databricks
Get difference between two version of delta lake table
Nov 07, 2022
scala
apache-spark
delta-lake
Spark Structured Streaming program that reads from non-empty Kafka topic (starting from earliest) triggers batches locally, but not on EMR cluster
Nov 08, 2022
apache-spark
apache-kafka
amazon-emr
spark-structured-streaming
saveAsTextFile to s3 on spark does not work, just hangs
Nov 03, 2022
amazon-s3
apache-spark
Apache Spark Native Libraries
Nov 03, 2022
hadoop
64-bit
apache-spark
hadoop-yarn
Drawbacks of Spark Streaming in Comparison With Real Streaming Computing Systems
Nov 01, 2022
distributed-computing
apache-spark
apache-storm
Multipart uploads to Amazon S3 from Apache Spark
Nov 03, 2022
file-upload
amazon-s3
apache-spark
jets3t
How can I make Spark Streaming count the words in a file in a unit test?
Nov 02, 2022
java
unit-testing
apache-spark
spark-streaming
How do I use infinite Scala streams as source in Spark Streaming?
Nov 03, 2022
scala
apache-spark
spark-streaming
Spark MLLib Collaborative Filtering with new user
Nov 03, 2022
apache-spark
apache-spark-mllib
collaborative-filtering
Unable to add a new service with Cloudera Manager within Cloudera Quickstart VM 5.3.0
Nov 03, 2022
apache-spark
cloudera
cloudera-manager
cloudera-quickstart-vm
How does partitions map to tasks in Spark?
Nov 02, 2022
apache-spark
rdd
Spark 1.3.1: cannot read file from S3 bucket, org/jets3t/service/ServiceException
Nov 03, 2022
amazon-ec2
amazon-s3
apache-spark
hadoop2
Apache Spark-Kafka.TaskCompletionListenerException & KafkaRDD$KafkaRDDIterator.close NPE on local cluster(Client Mode)
Nov 01, 2022
java
hadoop
apache-spark
apache-kafka
spark-streaming
parquet.io.ParquetDecodingException: Can not read value at 0 in block -1 in file
Apr 21, 2022
java
hadoop
apache-spark
hive
Why does format("kafka") fail with "Failed to find data source: kafka." (even with uber-jar)?
May 08, 2022
apache-spark
apache-spark-sql
spark-structured-streaming
uberjar
DataFrame error: "overloaded method value filter with alternatives"
Aug 28, 2021
scala
apache-spark
dataframe
ERROR Utils: Uncaught exception in thread SparkListenerBus
Aug 22, 2021
scala
apache-spark
« Newer Entries
Older Entries »