Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in apache-spark
Locally change the log level for the zookeeper C client
Aug 17, 2022
logging
apache-spark
apache-zookeeper
mesos
Spark mapWithState shuffles all data to one node
Nov 06, 2022
scala
apache-spark
spark-streaming
How to give predicted and label columns in BinaryClassificationMetrics evaluation for Naive Bayes model
Dec 20, 2019
scala
apache-spark
machine-learning
apache-spark-mllib
apache-spark-ml
Not able to fetch result from hive transaction enabled table through spark-sql
Oct 20, 2022
hadoop
apache-spark
hive
apache-spark-sql
How to write dataframe (obtained from hive table) into hadoop SequenceFile and RCFile?
Oct 16, 2022
apache-spark
apache-spark-sql
spark-dataframe
How to convert RDD to DataFrame in Spark Streaming, not just Spark
Oct 18, 2022
scala
apache-spark
spark-streaming
rdd
Apache Toree and Spark Scala Not Working in Jupyter
Nov 16, 2021
scala
apache-spark
jupyter-notebook
apache-toree
Spark never finishes jobs and stages, JobProgressListener crash
Aug 07, 2021
apache-spark
The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwx--------- (on Linux)
Jan 18, 2020
apache-spark
hive
apache-spark-sql
spark-dataframe
hiveql
How to implement a ScalaTest FunSuite to avoid boilerplate Spark code and import implicits
Jun 05, 2022
scala
apache-spark
scalatest
Accessing Spark Mllib Bisecting K-means tree data
Apr 15, 2019
apache-spark
apache-spark-mllib
Am I fully utilizing my EMR cluster?
Mar 08, 2022
amazon-web-services
apache-spark
pyspark
elastic-map-reduce
How to log malformed rows from Scala Spark DataFrameReader csv
Feb 05, 2020
scala
csv
logging
apache-spark
How to transform Dataset<Tuple2<String,DeviceData>> to Iterator<DeviceData>
Feb 16, 2021
java
apache-spark
apache-spark-2.0
apache-spark-dataset
Naive install of PySpark to also support S3 access
Oct 24, 2022
python
amazon-web-services
apache-spark
amazon-s3
pyspark
Broadcast a user defined class in Spark
Apr 07, 2022
python
apache-spark
pyspark
Do not discard keys with null values when converting to JSON in PySpark DataFrame
Feb 27, 2022
apache-spark
pyspark
Running Python startup code after modules are loaded
Aug 30, 2022
python
apache-spark
ipython
pyspark
How to use PySpark to load a rolling window from daily files?
May 15, 2022
csv
pandas
apache-spark
pyspark
What is the difference between tensorflow on spark with the default distributed tensorflow 1.0?
Oct 22, 2022
apache-spark
tensorflow
deep-learning
distributed
« Newer Entries
Older Entries »