How to move data from Glue to Dynamodb

Tags:

We are designing an Big data solution for one of our dashboard applications and seriously considering Glue for our initial ETL. Currently Glue supports JDBC and S3 as the target but our downstream services and components will work better with dynamodb. We are wondering what is the best approach to eventually move the records from Glue to Dynamo.

Should we write to S3 first and then run lambdas to insert the data into Dynamo? Is that the best practice? OR Should we use a third party JDBC wrapper for Dynamodb and use Glue to directly write to Dynamo (Not sure if this is possible, sounds a bit scary) OR Should we do something else?

Any help is greatly appreciated. Thanks!

666

asked Mar 02 '18 05:03

Robby

2 Answers

You can add the following lines to your Glue ETL script:

    glueContext.write_dynamic_frame.from_options(frame =DynamicFrame.fromDF(df, glueContext, "final_df"), connection_type = "dynamodb", connection_options = {"tableName": "pceg_ae_test"})

df should be of type DynamicFrame

answered Oct 26 '22 22:10

Bishal Regmi

I am able to write using boto3... definitly its not best approach to load but its working one. :)

dynamodb = boto3.resource('dynamodb','us-east-1') table = 
dynamodb.Table('BULK_DELIVERY')

print "Start testing"

for row in df1.rdd.collect():
    var1=row.sourceCid 
    print(var1) table.put_item( Item={'SOURCECID': "{}".format(var1)} )

print "End testing"

answered Oct 26 '22 23:10

Vinay Agarwal

Related questions
                            
                                can someone hack into my s3 with "AWS-cognito-identity-poolID" that is hard-coded?
                            
                                Read files with only specific names from Amazon S3
                            
                                django-storages and amazon s3 - suspiciousoperation
                            
                                Using server side includes or ssi ,AWS S3
                            
                                Rails direct upload to Amazon S3 using Activeadmin + Paperclip
                            
                                Amazon S3 file download through curl by using IAM user credentials
                            
                                Custom 404 Page for Static Website using AWS S3 buckets not working
                            
                                S3 and EMR data locality [closed]
                            
                                Getting a data stream from a zipped file sitting in a S3 bucket using boto3 lib and AWS Lambda
                            
                                RDS instance unusably slow after restoring from snapshot
                            
                                AWS Glue: crawler misinterprets timestamps as strings. GLUE ETL meant to convert strings to timestamps makes them NULL
                            
                                Minio: How to make folders and files already in mount point available when starting minio server?
                            
                                Error [ERR_STREAM_PREMATURE_CLOSE]: Premature close in Node Pipeline stream
                            
                                Rails Heroku server paperclip Amazon S3 - AWS::S3::Errors::RequestTimeout
                            
                                How to copy an entire "folder" to another path using S3 with sdk?
                            
                                PDF.js CORS issue for S3 file
                            
                                Insert data in AWS Redshift via AWS Lambda
                            
                                How can I parse an Excel file using SheetJS from an external link (Amazon S3)
                            
                                S3 link with longer expiration
                            
                                Stream a multi-GB file to AWS S3 from ASP.NET Core Web API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to move data from Glue to Dynamodb

Tags:

amazon-s3

amazon-dynamodb

etl

aws-glue

Robby

People also ask

2 Answers

Bishal Regmi

Vinay Agarwal

Recent Activity

Donate For Us