I am creating a data pipeline to export dynamoDB table to S3 bucket.I used the standard template to use for this in data pipeline console. I ha verified that the runsOn field is set to the name of EMR cluster to be started. However, The EMR activity status is still as Waiting_For_Runner...Any ideas why is this so??
Thanks!!!
The pipeline status is simply an overview of a pipeline; to see more information, view the status of individual pipeline components. You can do this by clicking through a pipeline in the console or retrieving pipeline component details using the CLI.
The --workerGroup option specifies the name of your worker group, which must be the same value as specified in your pipeline for tasks to be processed. The --region option specifies the service region from which to pull tasks to execute.
AWS Data Pipeline is a web service that helps you reliably process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals.
Waiting_For_Runner means datatpipeline is trying to connect to EMR.
Few reasons you can check:
$ps -ef | grep workerGroup
(Master Instance)If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With