Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
Glennie Helles Sindholt
Glennie Helles Sindholt has asked
3
questions and find answers to
67
problems.
Stats
4.9k
EtPoint
2.2k
Vote count
3
questions
67
answers
About
I have a Ph.D in computer science and my current passion is Big Data and in particular Spark :)
Glennie Helles Sindholt questions
How to control number of parquet files generated when using partitionBy
Is groupByKey ever preferred over reduceByKey
Spark code organization and best practices [closed]
Glennie Helles Sindholt answers
Spark standalone mode on AWS EMR
efficiently get joined and not joined data of a dataframe against other dataframe
When does EMR bootstrap actions run
How we can sort and group data from the Spark RDDs?
Stop hadoop/EMR/AWS creating S3 paths with _$folder$ extensions
cast schema of a data frame in Spark and Scala
Spark Dataframe distinguish columns with duplicated name
How to access elemens in Row RDD in SCALA
write a spark Dataset to json with all keys in the schema, including null columns
How to load only the data of the last partition