Writing a pickle file to an s3 bucket in AWS

Tags:

I'm trying to write a pandas dataframe as a pickle file into an s3 bucket in AWS. I know that I can write dataframe new_df as a csv to an s3 bucket as follows:

bucket='mybucket' key='path'  csv_buffer = StringIO() s3_resource = boto3.resource('s3')  new_df.to_csv(csv_buffer, index=False) s3_resource.Object(bucket,path).put(Body=csv_buffer.getvalue())

I've tried using the same code as above with to_pickle() but with no success.

314

asked Mar 05 '18 21:03

himi64

2 Answers

Further to you answer, you don't need to convert to csv. pickle.dumps method returns a byte obj. see here: https://docs.python.org/3/library/pickle.html

import boto3 import pickle  bucket='your_bucket_name' key='your_pickle_filename.pkl' pickle_byte_obj = pickle.dumps([var1, var2, ..., varn])  s3_resource = boto3.resource('s3') s3_resource.Object(bucket,key).put(Body=pickle_byte_obj)

answered Sep 19 '22 14:09

Mostafa Shabani

I've found the solution, need to call BytesIO into the buffer for pickle files instead of StringIO (which are for CSV files).

import io import boto3  pickle_buffer = io.BytesIO() s3_resource = boto3.resource('s3')  new_df.to_pickle(pickle_buffer) s3_resource.Object(bucket, key).put(Body=pickle_buffer.getvalue())

answered Sep 17 '22 14:09

himi64

Related questions
                            
                                TypeScript conditional types - filter out readonly properties / pick only required properties
                            
                                .gitlab-ci.yml after_script section: how can I tell whether the task succeeded or failed?
                            
                                screen inside the conda environment doesnt work
                            
                                ERROR: could not stat file "XX.csv": Unknown error
                            
                                Node error Cannot read property 'resolve' of undefined
                            
                                Django REST Framework (DRF): TypeError: register() got an unexpected keyword argument 'base_name'
                            
                                How to assert data type with Jest
                            
                                Sendgrid email delivery drop due to Spamhaus listing [closed]
                            
                                Access parent datacontext in listbox in Silverlight
                            
                                Emacs - regular expressions in Lisp need to be double-escaped - why?
                            
                                Windows Forms ToolTip will not re-appear after first use
                            
                                jQuery selector question. Select all nodes that do NOT START with (string)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Writing a pickle file to an s3 bucket in AWS

Tags:

himi64

People also ask

2 Answers

Mostafa Shabani

himi64

Recent Activity

Donate For Us