How do I take multiple lists and put them as different columns in a python dataframe? I tried this solution but had some trouble. Attempt 1: <ul> <li>Have three lists, and zip them together and use that <code>res = zip(lst1,lst2,lst3)</code> </li> <li>Yields just one column</li> </ul> Attempt 2: <pre class="prettyprint"><code>percentile_list = pd.DataFrame({'lst1Tite' : [lst1], 'lst2Tite' : [lst2], 'lst3Tite' : [lst3] }, columns=['lst1Tite','lst1Tite', 'lst1Tite']) </code></pre> <ul> <li>yields either one row by 3 columns (the way above) or if I transpose it is 3 rows and 1 column</li> </ul> How do I get a 100 row (length of each independent list) by 3 column (three lists) pandas dataframe?

Adding one more scalable solution. <pre class="prettyprint"><code>lists = [lst1, lst2, lst3, lst4] df = pd.concat([pd.Series(x) for x in lists], axis=1) </code></pre>

Just adding that using the first approach it can be done as - <pre class="prettyprint"><code>pd.DataFrame(list(map(list, zip(lst1,lst2,lst3)))) </code></pre>

There are several ways to create a dataframe from multiple lists. <pre class="prettyprint"><code>list1=[1,2,3,4] list2=[5,6,7,8] list3=[9,10,11,12] </code></pre> <ol> <li> <code>pd.DataFrame({'list1':list1, 'list2':list2, 'list3'=list3})</code> </li> <li> <code>pd.DataFrame(data=zip(list1,list2,list3),columns=['list1','list2','list3'])</code> </li> </ol>

Adding to above answers, we can create on the fly <pre class="prettyprint"><code>df= pd.DataFrame() list1 = list(range(10)) list2 = list(range(10,20)) df['list1'] = list1 df['list2'] = list2 print(df) </code></pre> hope it helps !

Take multiple lists into dataframe

Tags:

python

pandas

numpy

How do I take multiple lists and put them as different columns in a python dataframe? I tried this solution but had some trouble.

Attempt 1:

Have three lists, and zip them together and use that res = zip(lst1,lst2,lst3)
Yields just one column

Attempt 2:

percentile_list = pd.DataFrame({'lst1Tite' : [lst1],
                                'lst2Tite' : [lst2],
                                'lst3Tite' : [lst3] }, 
                                columns=['lst1Tite','lst1Tite', 'lst1Tite'])

yields either one row by 3 columns (the way above) or if I transpose it is 3 rows and 1 column

How do I get a 100 row (length of each independent list) by 3 column (three lists) pandas dataframe?

377

asked May 29 '15 06:05

jfalkson

8 Answers

I think you're almost there, try removing the extra square brackets around the lst's (Also you don't need to specify the column names when you're creating a dataframe from a dict like this):

import pandas as pd
lst1 = range(100)
lst2 = range(100)
lst3 = range(100)
percentile_list = pd.DataFrame(
    {'lst1Title': lst1,
     'lst2Title': lst2,
     'lst3Title': lst3
    })

percentile_list
    lst1Title  lst2Title  lst3Title
0          0         0         0
1          1         1         1
2          2         2         2
3          3         3         3
4          4         4         4
5          5         5         5
6          6         6         6
...

If you need a more performant solution you can use np.column_stack rather than zip as in your first attempt, this has around a 2x speedup on the example here, however comes at bit of a cost of readability in my opinion:

import numpy as np
percentile_list = pd.DataFrame(np.column_stack([lst1, lst2, lst3]), 
                               columns=['lst1Title', 'lst2Title', 'lst3Title'])

answered Oct 08 '22 11:10

maxymoo

Adding to Aditya Guru's answer here. There is no need of using map. You can do it simply by:

pd.DataFrame(list(zip(lst1, lst2, lst3)))

This will set the column's names as 0,1,2. To set your own column names, you can pass the keyword argument columns to the method above.

pd.DataFrame(list(zip(lst1, lst2, lst3)),
              columns=['lst1_title','lst2_title', 'lst3_title'])

answered Oct 08 '22 10:10

Abhinav Gupta

Adding one more scalable solution.

lists = [lst1, lst2, lst3, lst4]
df = pd.concat([pd.Series(x) for x in lists], axis=1)

answered Oct 08 '22 11:10

oopsi

Just adding that using the first approach it can be done as -

pd.DataFrame(list(map(list, zip(lst1,lst2,lst3))))

answered Oct 08 '22 12:10

Aditya Guru

There are several ways to create a dataframe from multiple lists.

list1=[1,2,3,4]
list2=[5,6,7,8]
list3=[9,10,11,12]

pd.DataFrame({'list1':list1, 'list2':list2, 'list3'=list3})
pd.DataFrame(data=zip(list1,list2,list3),columns=['list1','list2','list3'])

answered Oct 08 '22 10:10

Reetesh Kumar

Adding to above answers, we can create on the fly

df= pd.DataFrame()
list1 = list(range(10))
list2 = list(range(10,20))
df['list1'] = list1
df['list2'] = list2
print(df)

hope it helps !

answered Oct 08 '22 10:10

Wickkiey

@oopsi used pd.concat() but didn't include the column names. You could do the following, which, unlike the first solution in the accepted answer, gives you control over the column order (avoids dicts, which are unordered):

import pandas as pd
lst1 = range(100)
lst2 = range(100)
lst3 = range(100)

s1=pd.Series(lst1,name='lst1Title')
s2=pd.Series(lst2,name='lst2Title')
s3=pd.Series(lst3 ,name='lst3Title')
percentile_list = pd.concat([s1,s2,s3], axis=1)

percentile_list
Out[2]: 
    lst1Title  lst2Title  lst3Title
0           0          0          0
1           1          1          1
2           2          2          2
3           3          3          3
4           4          4          4
5           5          5          5
6           6          6          6
7           7          7          7
8           8          8          8
...

answered Oct 08 '22 11:10

dabru

you can simple use this following code

train_data['labels']= train_data[["LABEL1","LABEL1","LABEL2","LABEL3","LABEL4","LABEL5","LABEL6","LABEL7"]].values.tolist()
train_df = pd.DataFrame(train_data, columns=['text','labels'])

answered Oct 08 '22 10:10

Shaina Raza

Related questions
                            
                                How do you decode Base64 data in Python?
                            
                                How to display pandas DataFrame of floats using a format string for columns?
                            
                                Converting integer to binary in python
                            
                                Passing HTML to template using Flask/Jinja2
                            
                                Is there any way to do HTTP PUT in python
                            
                                How to print a percentage value in python?
                            
                                Simpler way to create dictionary of separate variables?
                            
                                assertEquals vs. assertEqual in python
                            
                                Using property() on classmethods
                            
                                Class method decorator with self arguments?
                            
                                How can I check whether a numpy array is empty or not?
                            
                                Python Sets vs Lists
                            
                                Pythonic way to avoid "if x: return x" statements
                            
                                How are Pipfile and Pipfile.lock used?
                            
                                A clean, lightweight alternative to Python's twisted? [closed]
                            
                                np.mean() vs np.average() in Python NumPy?
                            
                                What are some good Python ORM solutions? [closed]
                            
                                Is there any difference between "foo is None" and "foo == None"?
                            
                                Java "Virtual Machine" vs. Python "Interpreter" parlance?
                            
                                How can I list the contents of a directory in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Take multiple lists into dataframe

Tags:

python

pandas

numpy

jfalkson

People also ask

8 Answers

maxymoo

Abhinav Gupta

oopsi

Aditya Guru

Reetesh Kumar

Wickkiey

dabru

Shaina Raza

Recent Activity

Donate For Us