Iterates over a big list of .mp3 links to get the metadata tags and save it to an Excel file. Results in this error. I appreciate any help. Thanks. <pre class="prettyprint"><code> #print is_connected(); # Create a Pandas dataframe from the data. df = pd.DataFrame({'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years}) # Create a Pandas Excel writer using XlsxWriter as the engine. writer = pd.ExcelWriter(xlspath, engine='xlsxwriter') # Convert the dataframe to an XlsxWriter Excel object. df.to_excel(writer, sheet_name='Sheet1') #df.to_excel(writer, sheet_name='Sheet1') # Close the Pandas Excel writer and output the Excel file. writer.save() Traceback (most recent call last): File "mp.py", line 87, in <module> df = pd.DataFrame({'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years}) File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 266, in __init__ mgr = self._init_dict(data, index, columns, dtype=dtype) File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 402, in _init_dict return _arrays_to_mgr(arrays, data_names, index, columns, dtype=dtype) File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 5409, in _arrays_to_mgr index = extract_index(arrays) File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 5457, in extract_index raise ValueError('arrays must all be same length') ValueError: arrays must all be same length </code></pre>

you can do this to avoid that error <pre class="prettyprint"><code>a = {'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years} df = pd.DataFrame.from_dict(a, orient='index') df = df.transpose() </code></pre> Explanation: This creates the DataFrame as each key (e.g. <code>'Links'</code>) was a row and like this the missing values are actually missing columns which is no problem for pandas (only missing rows lead to <code>ValueError</code> during creation) After that you transpose the DataFrame (flip the axis) and make the rows to columns, which results the DataFrame you initially wanted.

Python Pandas ValueError Arrays Must be All Same Length

Tags:

python

pandas

Iterates over a big list of .mp3 links to get the metadata tags and save it to an Excel file. Results in this error. I appreciate any help. Thanks.

    #print is_connected();      # Create a Pandas dataframe from the data. df = pd.DataFrame({'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years})       # Create a Pandas Excel writer using XlsxWriter as the engine. writer = pd.ExcelWriter(xlspath, engine='xlsxwriter')      # Convert the dataframe to an XlsxWriter Excel object. df.to_excel(writer, sheet_name='Sheet1')     #df.to_excel(writer, sheet_name='Sheet1')       # Close the Pandas Excel writer and output the Excel file. writer.save()  Traceback (most recent call last):   File "mp.py", line 87, in <module>     df = pd.DataFrame({'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years})   File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 266, in __init__     mgr = self._init_dict(data, index, columns, dtype=dtype)   File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 402, in _init_dict     return _arrays_to_mgr(arrays, data_names, index, columns, dtype=dtype)   File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 5409, in _arrays_to_mgr     index = extract_index(arrays)   File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 5457, in extract_index     raise ValueError('arrays must all be same length') ValueError: arrays must all be same length

306

asked Nov 05 '16 18:11

Blue Island

2 Answers

you can do this to avoid that error

a = {'Links' : lines ,'Titles' : titles , 'Singers': finalsingers , 'Albums':finalalbums , 'Years' : years} df = pd.DataFrame.from_dict(a, orient='index') df = df.transpose()

Explanation:

This creates the DataFrame as each key (e.g. 'Links') was a row and like this the missing values are actually missing columns which is no problem for pandas (only missing rows lead to ValueError during creation) After that you transpose the DataFrame (flip the axis) and make the rows to columns, which results the DataFrame you initially wanted.

194

answered Oct 10 '22 23:10

Vivek Srinivasan

It's telling you that the arrays (lines, titles, finalsingers, etc...) are not of the same length. You can test this by

print(len(lines), len(titles), len(finalsingers)) # Print all of them out here

This will show you which data is malformed and then you'll need to do some investigating into what the right way to correct this is.

answered Oct 10 '22 22:10

kypalmer

Related questions
                            
                                Copying from one text file to another using Python
                            
                                Upgrade pip in Amazon Linux
                            
                                Python:Efficient way to check if dictionary is empty or not [duplicate]
                            
                                Easiest way to persist a data structure to a file in python?
                            
                                How to get the first 2 letters of a string in Python?
                            
                                How to remove those "\x00\x00"
                            
                                Add directory to Python path in PyCharm?
                            
                                Python: "Indentation Error: unindent does not match any outer indentation level"
                            
                                Is there a speed difference between WSGI and FCGI?
                            
                                Random word generator- Python
                            
                                Pandas: Get Dummies
                            
                                Not able to pip install pickle in python 3.6
                            
                                Get date and time when photo was taken from EXIF data using PIL
                            
                                Django/Python Beginner: Error when executing python manage.py syncdb - psycopg2 not found
                            
                                Python basemap module impossible to import
                            
                                How do I install pip on arch linux? [closed]
                            
                                Pandas: How do I assign values based on multiple conditions for existing columns?
                            
                                How to filter numpy array by list of indices?
                            
                                Pandas dataframe hide index functionality?
                            
                                How to create a requirements.txt? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With