Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Read Kafka topic in a Spark batch job
Nov 04, 2022
scala
apache-spark
apache-kafka
spark-streaming
kafka-consumer-api
PySpark: retrieve mean and the count of values around the mean for groups within a dataframe
May 15, 2019
python
sql
apache-spark
apache-spark-sql
window-functions
Running Spark on Linux : $JAVA_HOME not set error
Sep 14, 2022
linux
apache-spark
java-home
ubuntu-16.04
Inspecting GraphX Graph Object
Feb 06, 2017
apache-spark
spark-graphx
GroupByKey with datasets in Spark 2.0 using Java
Aug 11, 2022
java
apache-spark
group-by
dataset
apache-spark-2.0
Outlier detection algorithm spark mllib
May 31, 2022
apache-spark
machine-learning
apache-spark-mllib
outliers
Hadoop Yarn: How to limit dynamic self allocation of resources with Spark?
Sep 07, 2022
hadoop
apache-spark
pyspark
hadoop-yarn
How to make Spark driver resilient to Master restarts?
Oct 27, 2022
apache-spark
apache-spark-standalone
spark: SAXParseException while writing to parquet on s3
Apr 26, 2022
scala
hadoop
apache-spark
amazon-s3
How to use "cube" only for specific fields on Spark dataframe?
May 05, 2021
scala
apache-spark
dataframe
apache-spark-sql
cube
Spark: graphx api OOM errors after unpersist useless RDDs
Apr 26, 2022
apache-spark
out-of-memory
spark-graphx
How does back pressure property work in Spark Streaming?
Aug 17, 2022
hadoop
apache-spark
spark-streaming
backpressure
Spark Shell with Yarn - Error: Yarn application has already ended! It might have been killed or unable to launch application master
May 27, 2022
hadoop
apache-spark
hadoop-yarn
How to split comma separated string and get n values in Spark Scala dataframe?
Oct 25, 2022
scala
apache-spark
dataframe
apache-spark-sql
spark-dataframe
How to connect with JMX remotely to Spark worker on Dataproc
Nov 17, 2022
apache-spark
hadoop-yarn
google-cloud-dataproc
how to write spark custom data source based on FileFormat
Oct 18, 2022
apache-spark
datasource
What causes "unknown resolver null" in Spark Kafka Connector?
Sep 05, 2022
java
apache-spark
apache-kafka
spark-streaming
spark-submit
Is manually managing memory with .unpersist() a good idea?
Jun 14, 2022
scala
apache-spark
garbage-collection
spark-dataframe
maxCategories not working as expected in VectorIndexer when using RandomForestClassifier in pyspark.ml
Oct 31, 2022
apache-spark
machine-learning
pyspark
random-forest
Read Zstandard-compressed file in Spark 2.3.0
Aug 20, 2022
apache-spark
hadoop2
amazon-emr
zstandard
« Newer Entries
Older Entries »