Work on big data software such as HDFS, MapReduce, Hive, Impala, Spark, Oozie, HBase, and Zookeeper. My main interest is Apache Spark running on AWS.