Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

S3 Batch: time to copy 1m objects between buckets in the same region?

I need to copy 92 million objects from bucket A to bucket B in the same AWS region. I know AWS takes 48 hours to generate an S3 Inventory report, so ... I'm wondering how long it takes to read a manifest of 92 million objects and copy them to another bucket. My object files have an average size of 512KB.

like image 200
juanc4milo Avatar asked Dec 05 '25 05:12

juanc4milo


1 Answers

One option is to use S3DistCp - Amazon EMR, which fires off many parallel copy commands from a Hadoop cluster.

This involves a fair bit of overhead (using an Amazon EMR cluster), but once going it can copy the files quite rapidly by generating parallel API requests to copy the objects.

If you are going to instigate the copy yourself, then you could try something similar to generate many parallel copy requests, rather than simply looping through the list sequentially.

like image 190
John Rotenstein Avatar answered Dec 07 '25 21:12

John Rotenstein



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!