read only particular json files from s3 buckets from multiple folders

Tags:

am trying to scroll over all buckets in s3 and see if there is a prefix that matches and get into those folders and read the json files.

I have tried to get the folders that contain a prefix, but failing to enter them.

Code:

import boto3
bucket = ['test-eob', 'test-eob-images']
client = boto3.client('s3')
for i in bucket:
    result = client.list_objects(Bucket=i,Prefix = 'PROCESSED_BY/FILE_JSON', Delimiter='/')
    print(result)

Using this am getting the ones with prefix and fails when bucket doesnt have that prefix.

structure of test-eob , test-eob/PROCESSED_BY/FILE_JSON/*.json I have to read the json if only my prefix matches, else come out of the bucket.

Can anyone help me out here.

823

asked Jun 02 '20 07:06

pylearner

1 Answers

Try to catch the error(is it a KeyError?) when the bucket does not contain the prefix.

For example:

for i in bucket:
    try:
          result = client.list_objects(Bucket=i,Prefix = 'PROCESSED_BY/FILE_JSON', Delimiter='/')
          print(result)
    except KeyError:
          pass

To read the json, there are several ways. For example with json.loads() from the json module.

So for each object in the bucket:

content_object = s3.Object(bucket_name, file_name)
file_content = content_object.get()['Body'].read().decode('utf-8')
json_content = json.loads(file_content)

151

answered Nov 14 '22 19:11

Adi Dembak

Related questions
                            
                                How can I customize Repl.it to not use poetry?
                            
                                Drawing labels that follow their edges in a Networkx graph
                            
                                How to perform a constrained optimization over a scaled regression model?
                            
                                Pycryptodome RSA decryption causes massive performance downgrade (RPS)
                            
                                Selenium login test doesn't accept pytest fixtures for login or refuses to connect
                            
                                How can I trim / remove part of a Tensor to match the shape of another Tensor with PyTorch?
                            
                                ansible not install perfectly using "brew install ansible" command not work in MacOS ? error: -sh: /usr/local/bin/ansible: No such file or directory
                            
                                Silencing SQLAlchemy warnings
                            
                                How is polymophism working in Python if parent constructor is not invoked (unlike Java)?
                            
                                numba-safe version of itertools.combinations?
                            
                                How to change the color of the interactive zoom rectangle?
                            
                                Creating a standalone macOS application with Python and py2app
                            
                                bson.errors.InvalidDocument: key '$numberDecimal' must not start with '$' when using json
                            
                                Clean Docker pip install results in ERROR: THESE PACKAGES DO NOT MATCH THE HASHES FROM THE REQUIREMENTS FILE
                            
                                How to use TPUs with PyTorch?
                            
                                pd.read_feather problems with decimal / thousands separator and rounding problems for floats
                            
                                What kind of objects `yield from` can be used with?
                            
                                PyGame slower on macOS than on Ubuntu or Raspbian
                            
                                What is the best practice for keeping Kafka consumer alive in python?
                            
                                How to use regex to extract text in order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

read only particular json files from s3 buckets from multiple folders

Tags:

python

amazon-web-services

amazon-s3

pylearner

People also ask

1 Answers

Adi Dembak

Recent Activity

Donate For Us