I have the following timeseries list in python: <pre class="prettyprint"><code>list = [(datetime.datetime(2008, 7, 15, 15, 0), 0.134), (datetime.datetime(2008, 7, 15, 16, 0), 0.0), (datetime.datetime(2008, 7, 15, 17, 0), 0.0), (datetime.datetime(2008, 7, 15, 18, 0), 0.0), (datetime.datetime(2008, 7, 15, 19, 0), 0.0), (datetime.datetime(2008, 7, 15, 20, 0), 0.0), (datetime.datetime(2008, 7, 15, 21, 0), 0.0), (datetime.datetime(2008, 7, 15, 22, 0), 0.0), (datetime.datetime(2008, 7, 15, 23, 0), 0.0), (datetime.datetime(2008, 7, 16, 0, 0), 0.0)] </code></pre> This list is a key value pair where key is datetime and value is the one after that separated by comma. I want to create pandas series from keys (datetime) and values (decimal value). Anyone can help me to split the above list of time series value into two list (list1 and list2) so I can creare the pandas Series object for further analysis from the following code? <pre class="prettyprint"><code>import pandas as pd ts = pd.Series(list1, list2) </code></pre>

<pre class="prettyprint"><code>In [34]: pd.Series(*zip(*((b,a) for a,b in data))) Out[34]: 2008-07-15 15:00:00 0.134 2008-07-15 16:00:00 0.000 2008-07-15 17:00:00 0.000 2008-07-15 18:00:00 0.000 2008-07-15 19:00:00 0.000 2008-07-15 20:00:00 0.000 2008-07-15 21:00:00 0.000 2008-07-15 22:00:00 0.000 2008-07-15 23:00:00 0.000 2008-07-16 00:00:00 0.000 dtype: float64 </code></pre> Or, eschewing the insane desire to make one-liners: <pre class="prettyprint"><code>dates, vals = zip(*data) s = pd.Series(vals, index=dates) </code></pre> If the data is extremely long, you can avoid creating the intermediate tuples by using itertools.izip: <pre class="prettyprint"><code>import itertools as IT dates, vals = IT.izip(*data) s = pd.Series(vals, index=dates) </code></pre>

python key value list to panda series

Tags:

python

pandas

numpy

data-analysis

time-series

I have the following timeseries list in python:

Click to copy

list = [(datetime.datetime(2008, 7, 15, 15, 0), 0.134),
    (datetime.datetime(2008, 7, 15, 16, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 17, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 18, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 19, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 20, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 21, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 22, 0), 0.0),
    (datetime.datetime(2008, 7, 15, 23, 0), 0.0),
    (datetime.datetime(2008, 7, 16, 0, 0), 0.0)]

This list is a key value pair where key is datetime and value is the one after that separated by comma. I want to create pandas series from keys (datetime) and values (decimal value). Anyone can help me to split the above list of time series value into two list (list1 and list2) so I can creare the pandas Series object for further analysis from the following code?

Click to copy

import pandas as pd
ts = pd.Series(list1, list2)

969

asked Apr 30 '14 13:04

Adds

2 Answers

Click to copy

In [34]: pd.Series(*zip(*((b,a) for a,b in data)))
Out[34]: 
2008-07-15 15:00:00    0.134
2008-07-15 16:00:00    0.000
2008-07-15 17:00:00    0.000
2008-07-15 18:00:00    0.000
2008-07-15 19:00:00    0.000
2008-07-15 20:00:00    0.000
2008-07-15 21:00:00    0.000
2008-07-15 22:00:00    0.000
2008-07-15 23:00:00    0.000
2008-07-16 00:00:00    0.000
dtype: float64

Or, eschewing the insane desire to make one-liners:

Click to copy

dates, vals = zip(*data)
s = pd.Series(vals, index=dates)

If the data is extremely long, you can avoid creating the intermediate tuples by using itertools.izip:

Click to copy

import itertools as IT
dates, vals = IT.izip(*data)
s = pd.Series(vals, index=dates)

answered Oct 12 '22 03:10

unutbu

You can use zip and splat to unpack your arguments as below.

Click to copy

import pandas as pd

my_list = [(datetime.datetime(2008, 7, 15, 15, 0), 0.134), 
        (datetime.datetime(2008, 7, 15, 16, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 17, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 18, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 19, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 20, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 21, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 22, 0), 0.0), 
        (datetime.datetime(2008, 7, 15, 23, 0), 0.0), 
        (datetime.datetime(2008, 7, 16, 0, 0), 0.0)]

ts = pd.Series(zip(*my_list))

zip(*my_list) effectively creates two tuples out of your data, one is a tuple of your datetime objects, one is your values. These two are then passed as the arguments to pd.Series.

answered Oct 12 '22 02:10

Ffisegydd

Related questions
                            
                                split a string at a certain index
                            
                                SQLAlchemy: how to perform regexp_replace on column values
                            
                                Most efficient way (time and space wise) to send binary data in response
                            
                                How to send Autobahn/Twisted WAMP message from outside of protocol?
                            
                                Serving resource to QWebView of PyQT5
                            
                                PySerial: how to understand that the timeout occured while reading from serial port?
                            
                                wrapping subsections of text with tags in BeautifulSoup
                            
                                Choosing elements from python list based on probability
                            
                                Renaming a table in pandas hdfstore
                            
                                Metaprogramming in Python - adding an object method
                            
                                how to extract the designated div table data in lxml?
                            
                                merge two integer variables in a single float in python
                            
                                Configure module logger to flask app logger
                            
                                Mongoengine: How to sort Embedded Document list by Embedded document field
                            
                                sqlalchemy event on column update
                            
                                Adding or removing specific rows or columns in an h5py dataset
                            
                                How to remove single pixels on the borders of a blob?
                            
                                make python @property handle +=, -= etc
                            
                                Django's escapejs filter and XSS
                            
                                Saving in memory file object with pillow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python key value list to panda series

Tags:

python

pandas

numpy

data-analysis

time-series

Adds

People also ask

2 Answers

unutbu

Ffisegydd

Recent Activity

Donate For Us