Software and Big Data developer with 3 years of experience.
Proficient in Apache Spark, Python, AWS, MySQL, with a disciplined and dedicated approach.
Have experience in setting up and maintaining in house Spark Standalone clusters suitable to requirements of project and size of Data.
Experienced in developing spark scripts to generate summary and patterns/user behavior.
Have experience in anomaly detection using machine learning algorithms from a large data set
Experience in making Web services (Flask REST framework, Django framework).
Expertise on using AWS with python SDK using boto3 library. Have created a decider worker model to automate jobs in Amazon EMR using Amazon SWF, EC2, S3
Good knowledge on libraries like pandas, numpy, sqlalchemy.