Background I am using the following Boto3 code to download file from S3. <pre class="prettyprint"><code>for record in event['Records']: bucket = record['s3']['bucket']['name'] key = record['s3']['object']['key'] print (key) if key.find('/') < 0 : if len(key) > 4 and key[-5:].lower() == '.json': //File is uploaded outside any folder download_path = '/tmp/{}{}'.format(uuid.uuid4(), key) else: download_path = '/tmp/{}/{}'.format(uuid.uuid4(), key)//File is uploaded inside a folder </code></pre> If a new file is uploaded in S3 bucket, this code is triggered and that newly uploaded file is downloaded by this code. This code works fine when uploaded outside any folder. However, when I upload a file inside a directory, IO error happens. Here is a dump of the IO error I am encountering. <blockquote> [Errno 2] No such file or directory: /tmp/316bbe85-fa21-463b-b965-9c12b0327f5d/test1/customer1.json.586ea9b8: IOError </blockquote> <code>test1</code> is the directory inside my S3 bucket where <code>customer1.json</code> is uploaded. Query Any thoughts on how to resolve this error?

Error raised because you attempted to download and save file into directory which not exists. Use os.mkdir prior downloading file to create an directory. <pre class="prettyprint"><code># ... else: item_uuid = str(uuid.uuid4()) os.mkdir('/tmp/{}'.format(item_uuid)) download_path = '/tmp/{}/{}'.format(item_uuid, key) # File is uploaded inside a folder </code></pre> Note: It's better to use os.path.join() while operating with systems paths. So code above could be rewritten to: <pre class="prettyprint"><code># ... else: item_uuid = str(uuid.uuid4()) os.mkdir(os.path.join(['tmp', item_uuid])) download_path = os.path.join(['tmp', item_uuid, key])) </code></pre> Also error may be raises because you including '/tmp/' in download path for s3 bucket file, do not include <code>tmp</code> folder as likely it's not exists on s3. Ensure you are on the right way by using that articles: <ul> <li>Amazon S3 upload and download using Python/Django</li> <li>Python s3 examples</li> </ul>

IOError in Boto3 download_file

Tags:

amazon-s3

python-2.7

aws-lambda

boto3

Background

I am using the following Boto3 code to download file from S3.

for record in event['Records']:
    bucket = record['s3']['bucket']['name']
    key = record['s3']['object']['key']
    print (key)
    if key.find('/') < 0 :
    if len(key) > 4 and key[-5:].lower() == '.json': //File is uploaded outside any folder

        download_path = '/tmp/{}{}'.format(uuid.uuid4(), key)
    else:
        download_path = '/tmp/{}/{}'.format(uuid.uuid4(), key)//File is uploaded inside a folder

If a new file is uploaded in S3 bucket, this code is triggered and that newly uploaded file is downloaded by this code.

This code works fine when uploaded outside any folder.

However, when I upload a file inside a directory, IO error happens. Here is a dump of the IO error I am encountering.

[Errno 2] No such file or directory: /tmp/316bbe85-fa21-463b-b965-9c12b0327f5d/test1/customer1.json.586ea9b8: IOError

test1 is the directory inside my S3 bucket where customer1.json is uploaded.

Query

Any thoughts on how to resolve this error?

354

asked Sep 19 '16 09:09

Rohan

2 Answers

Error raised because you attempted to download and save file into directory which not exists. Use os.mkdir prior downloading file to create an directory.

# ...
else:
    item_uuid = str(uuid.uuid4())
    os.mkdir('/tmp/{}'.format(item_uuid))
    download_path = '/tmp/{}/{}'.format(item_uuid, key)  # File is uploaded inside a folder

Note: It's better to use os.path.join() while operating with systems paths. So code above could be rewritten to:

# ...
else:
    item_uuid = str(uuid.uuid4())
    os.mkdir(os.path.join(['tmp', item_uuid]))
    download_path = os.path.join(['tmp', item_uuid, key]))

Also error may be raises because you including '/tmp/' in download path for s3 bucket file, do not include tmp folder as likely it's not exists on s3. Ensure you are on the right way by using that articles:

Amazon S3 upload and download using Python/Django
Python s3 examples

answered Oct 13 '22 18:10

Andriy Ivaneyko

I faced the same issue, and the error message caused a lot of confusion, (the random string extension after the file name). In my case it was caused by the missing directory path, which didn't exist.

answered Oct 13 '22 20:10

Yankee

Related questions
                            
                                Properly convert png to npy numpy array (Image to Array)
                            
                                Sample code for listing a FixedPriceItem with ebay
                            
                                How to handle dependency on scipy in setup.py
                            
                                Import errors with Pycharm
                            
                                How to add a Callback to Bokeh DataTable?
                            
                                No module named __future__
                            
                                Suppressing printout of "Exception ... ignored" message in Python 3
                            
                                Running PySpark on and IDE like Spyder?
                            
                                Fastest way to "grep" big files
                            
                                Why does __self__ of built-in functions return the builtin module it belongs to?
                            
                                What Does the python -v Command Do
                            
                                Exact equivalent of `b'...'.decode("utf-8", "backslashreplace")` in Python 2
                            
                                Blank line before the return statement in a Python function
                            
                                Why tasks are stuck in None state in Airflow 1.10.2 after a trigger_dag
                            
                                Python 2.7 Unit test: Assert logger warning thrown
                            
                                Why does a virtualenv environment contain argparse, distribute and wsgiref? [duplicate]
                            
                                How to define a mutually exclusive group of two positional arguments?
                            
                                Python undo method mock
                            
                                Likelihood ratio test in Python
                            
                                Failed to establish a new connection: [Errno 111] Connection refused(elasticsearch)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With