What is the simplest way to get a list of all items within an S3 bucket using Java? <pre class="prettyprint"><code>List<S3ObjectSummary> s3objects = s3.listObjects(bucketName,prefix).getObjectSummaries(); </code></pre> This example only returns 1000 items.

It might be a workaround but this solved my problem: <pre class="prettyprint"><code>ObjectListing listing = s3.listObjects( bucketName, prefix ); List<S3ObjectSummary> summaries = listing.getObjectSummaries(); while (listing.isTruncated()) { listing = s3.listNextBatchOfObjects (listing); summaries.addAll (listing.getObjectSummaries()); } </code></pre>

How to list all AWS S3 objects in a bucket using Java

Tags:

java

amazon-web-services

amazon-s3

What is the simplest way to get a list of all items within an S3 bucket using Java?

List<S3ObjectSummary> s3objects = s3.listObjects(bucketName,prefix).getObjectSummaries();

This example only returns 1000 items.

803

asked Nov 06 '11 13:11

Ron D.

2 Answers

It might be a workaround but this solved my problem:

ObjectListing listing = s3.listObjects( bucketName, prefix ); List<S3ObjectSummary> summaries = listing.getObjectSummaries();  while (listing.isTruncated()) {    listing = s3.listNextBatchOfObjects (listing);    summaries.addAll (listing.getObjectSummaries()); }

answered Sep 21 '22 06:09

Ron D.

For those, who are reading this in 2018+. There are two new pagination-hassle-free APIs available: one in AWS SDK for Java 1.x and another one in 2.x.

1.x

There is a new API in Java SDK that allows you to iterate through objects in S3 bucket without dealing with pagination:

AmazonS3 s3 = AmazonS3ClientBuilder.standard().build();  S3Objects.inBucket(s3, "the-bucket").forEach((S3ObjectSummary objectSummary) -> {     // TODO: Consume `objectSummary` the way you need     System.out.println(objectSummary.key); });

This iteration is lazy:

The list of S3ObjectSummarys will be fetched lazily, a page at a time, as they are needed. The size of the page can be controlled with the withBatchSize(int) method.

2.x

The API changed, so here is an SDK 2.x version:

S3Client client = S3Client.builder().region(Region.US_EAST_1).build(); ListObjectsV2Request request = ListObjectsV2Request.builder().bucket("the-bucket").prefix("the-prefix").build(); ListObjectsV2Iterable response = client.listObjectsV2Paginator(request);  for (ListObjectsV2Response page : response) {     page.contents().forEach((S3Object object) -> {         // TODO: Consume `object` the way you need         System.out.println(object.key());     }); }

ListObjectsV2Iterable is lazy as well:

When the operation is called, an instance of this class is returned. At this point, no service calls are made yet and so there is no guarantee that the request is valid. As you iterate through the iterable, SDK will start lazily loading response pages by making service calls until there are no pages left or your iteration stops. If there are errors in your request, you will see the failures only after you start iterating through the iterable.

answered Sep 22 '22 06:09

madhead - StandWithUkraine

Related questions
                            
                                Redirect console output to string in Java
                            
                                Why should wait() always be called inside a loop
                            
                                How to run .jar file by double click on Windows 7 64-bit?
                            
                                Test class with a new() call in it with Mockito
                            
                                How to use SortedMap interface in Java?
                            
                                Extracting jar to specified directory
                            
                                What is Keystore?
                            
                                'finally block does not complete normally' Eclipse warning
                            
                                NoSuchFieldException when field exists
                            
                                What do the return values of Comparable.compareTo mean in Java?
                            
                                How to find index position of an element in a list when contains returns true
                            
                                Copy entire directory contents to another directory? [duplicate]
                            
                                Max limit of MultipartFile in Spring Boot
                            
                                Fastest way to set all values of an array?
                            
                                Java constructor style: check parameters aren't null
                            
                                Show soft keyboard for dialog
                            
                                InetAddress.getLocalHost() slow to run (30+ seconds)
                            
                                JPA mapping: "QuerySyntaxException: foobar is not mapped..."
                            
                                Java Reflection: How can I get the all getter methods of a java class and invoke them
                            
                                Reading and writing java.util.Date from Parcelable class

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to list all AWS S3 objects in a bucket using Java

Tags:

java

amazon-web-services

amazon-s3

Ron D.

People also ask

2 Answers

Ron D.

1.x

2.x

madhead - StandWithUkraine

Recent Activity

Donate For Us