Creating a Pandas DataFrame from a Numpy array: How do I specify the index column and column headers?

Tags:

I have a Numpy array consisting of a list of lists, representing a two-dimensional array with row labels and column names as shown below:

data = array([['','Col1','Col2'],['Row1',1,2],['Row2',3,4]])

I'd like the resulting DataFrame to have Row1 and Row2 as index values, and Col1, Col2 as header values

I can specify the index as follows:

df = pd.DataFrame(data,index=data[:,0]),

however I am unsure how to best assign column headers.

842

asked Dec 24 '13 15:12

user3132783

1 Answers

You need to specify data, index and columns to DataFrame constructor, as in:

>>> pd.DataFrame(data=data[1:,1:],    # values ...              index=data[1:,0],    # 1st column as index ...              columns=data[0,1:])  # 1st row as the column names

edit: as in the @joris comment, you may need to change above to np.int_(data[1:,1:]) to have correct data type.

100

answered Sep 22 '22 08:09

behzad.nouri

Related questions
                            
                                Find first sequence item that matches a criterion [duplicate]
                            
                                pip is configured with locations that require TLS/SSL, however the ssl module in Python is not available
                            
                                Reference requirements.txt for the install_requires kwarg in setuptools setup.py file
                            
                                "Unicode Error "unicodeescape" codec can't decode bytes... Cannot open text files in Python 3 [duplicate]
                            
                                How to delete items from a dictionary while iterating over it?
                            
                                Comparing two NumPy arrays for equality, element-wise
                            
                                Turn a string into a valid filename?
                            
                                Cost of len() function
                            
                                How can I tell if a string repeats itself in Python?
                            
                                Named tuple and default values for optional keyword arguments
                            
                                Setting the correct encoding when piping stdout in Python
                            
                                Is there a Python equivalent to Ruby's string interpolation?
                            
                                How to keep keys/values in same order as declared?
                            
                                Assign output of os.system to a variable and prevent it from being displayed on the screen [duplicate]
                            
                                What is getattr() exactly and how do I use it?
                            
                                How do I watch a file for changes?
                            
                                Else clause on Python while statement
                            
                                python: how to identify if a variable is an array or a scalar
                            
                                How can I get a list of all classes within current module in Python?
                            
                                Python dictionary: are keys() and values() always the same order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating a Pandas DataFrame from a Numpy array: How do I specify the index column and column headers?

Tags:

python

pandas

numpy

user3132783

People also ask

1 Answers

behzad.nouri

Recent Activity

Donate For Us