Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in amazon-emr
How to compute 'DynamoDB read throughput ratio' while setting up DataPipeline to export DynamoDB data to S3
Mar 14, 2026
amazon-s3
amazon-dynamodb
amazon-emr
amazon-data-pipeline
How to get data from s3 and do some work on it? python and boto
Mar 14, 2026
python
amazon-s3
amazon-ec2
boto
amazon-emr
How to do writeStream a dataframe in console? (Scala Spark Streaming)
Mar 14, 2026
scala
apache-spark
spark-streaming
amazon-emr
Is it possible to use a custom hadoop version with EMR?
Mar 01, 2026
amazon-web-services
apache-spark
hadoop
pyspark
amazon-emr
How to set instance role for EMR clusters launched via data pipeline?
Feb 25, 2026
elastic-map-reduce
amazon-emr
amazon-data-pipeline
What is a good number of partitions in spark as a function of number of executors and threads?
Feb 19, 2026
scala
amazon-web-services
apache-spark
scalability
amazon-emr
All executors dead MinHash LSH PySpark approxSimilarityJoin self-join on EMR cluster
Feb 20, 2026
pyspark
apache-spark-sql
garbage-collection
amazon-emr
minhash
Older Entries »