How to get data from pickle files into a pandas dataframe

Tags:

I'm working on a social media sentiment analysis for a class. I have gotten all of the tweets about the Kentucky Derby for a 2 month period saved into pkl files.

My question is: how do I get all of these pickle dump files loaded into a dataframe?

Here is my code:

import sklearn as sk
import pandas as pd
import  got3

def daterange(start_date, end_date):
for n in range(int ((end_date - start_date).days)):
    yield start_date + timedelta(n)

start_date = date(2016, 3, 31)
end_date = date(2016, 6, 1)

dates = []

for single_date in daterange(start_date, end_date):
    dates.append(single_date.strftime("%Y-%m-%d"))

for i in range(len(dates)-1): 
    this_date = dates[i]
    tomorrow_date = dates[i+1]
    print("Getting tweets for " + tomorrow_date)
    tweetCriteria = got3.manager.TweetCriteria()
    tweetCriteria.setQuerySearch("Kentucky Derby")
    tweetCriteria.setQuerySearch("KYDerby")
    tweetCriteria.setSince(this_date)
    tweetCriteria.setUntil(tomorrow_date)
    Kentucky_Derby_tweets = got3.manager.TweetManager.getTweets(tweetCriteria)
    pkl.dump(Kentucky_Derby_tweets, open(tomorrow_date + ".pkl", "wb"))

850

asked Oct 21 '16 15:10

Andrew Smith

1 Answers

You can use

pd.read_pickle(filename)
add it to a list
then pd.concat(thelist)

127

answered Sep 18 '22 23:09

simon

Related questions
                            
                                Convolutional Neural Network (CNN) with max-pooling
                            
                                Building Python 3 on OS X: [Python/importlib.h] Error 133
                            
                                NLTK Most common synonym (Wordnet) for each word
                            
                                manage.py collectstatic: error: unrecognized arguments: --noinput in shell script launched by Docker
                            
                                'Operation Denied' Error when deploying an Elastic Beanstalk app using awsebcli
                            
                                Pandas rolling computations for printing elements in the window
                            
                                How to decode &#39 in flask with Jinja2 template [duplicate]
                            
                                Can I scrape the raw data from highcharts.js?
                            
                                Convert a string to JSON
                            
                                Lazy loading on column_property in SQLAlchemy
                            
                                `Iterable[(int, int)]` tuple is not allowed in type hints
                            
                                How to create a multi-line plot title in bokeh?
                            
                                matplotlib 2D plot from x,y,z values
                            
                                How to split text into chunks minimizing the solution?
                            
                                Transpose the data in a column every nth rows in PANDAS
                            
                                pip install psycopg2 broken - warning: unused function 'Dprintf' [duplicate]
                            
                                Combination of PyCharm and ipython fails to import qt5 or Qt5Agg
                            
                                Python SSL server gives me "501 Unsupported method GET"
                            
                                How to classify new documents with tf-idf?
                            
                                How Python reads a file when it was deleted after being opened

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get data from pickle files into a pandas dataframe

Tags:

python

pandas

twitter

pickle

Andrew Smith

People also ask

1 Answers

simon

Recent Activity

Donate For Us