Some joker made a Lotus database/applet thingy for tracking engineering issues in our company. The joke is that the key piece of information was named with a special character... a number sign (hash tag, pound sign, \u0023). abbreviated sample: <pre class="prettyprint"><code>KA# Issue Date Current Position 27144 1/9/2014 Accounting 27194 12/20/2012 Engineering 32474 4/21/2008 Engineering 32623-HOLD 4/25/2016 Engineering 32745 11/13/2012 SEPE 32812 10/30/2013 Engineering 32817 12/7/2012 Purchasing 32839 1/8/2013 SEPE </code></pre> I output this table (4K rows, 15 columns) to a csv file and process in python3 as a pandas dataframe. I generate various outputs. If I use something like: <pre class="prettyprint"><code>df.iloc[:,[0,3,1,8,9,10]] </code></pre> I get appropriate output and the key column shows up as <code>"KA#"</code>. (When I say "key column", I mean "most important"... NOT "index". I keep a serial index) Unfortunately, people sometimes mess with the column order in Lotus between my exports to csv so I can not guarantee that <code>"KA#"</code> will be any particular column number. I would like to use column names: <pre class="prettyprint"><code>df.loc[:,["KA#","Issue Date","Current Position"]] </code></pre> But the <code>"KA#"</code> column is filled with NaN's. Thanks for any help you can offer. Finally, if I try to rename <code>"KA#"</code> to simply <code>"KA"</code>: <pre class="prettyprint"><code>df['KA#'].name = 'KA' </code></pre> throws a KeyError and <pre class="prettyprint"><code>df = df.rename(columns={"KA#": "ka"}) </code></pre> is completely ignored. The column shows up as <code>"KA#"</code>. Can anyone think of a way to get rid of or handle that symbol? I'd even settle for a regex at this point.

use str.replace: <code>df.columns=df.columns.str.replace('#','')</code> You can check this in the documentation.

pandas dataframe column name: remove special character

Tags:

Some joker made a Lotus database/applet thingy for tracking engineering issues in our company. The joke is that the key piece of information was named with a special character... a number sign (hash tag, pound sign, \u0023).

abbreviated sample:

KA#         Issue Date      Current Position
27144       1/9/2014        Accounting
27194       12/20/2012      Engineering
32474       4/21/2008       Engineering
32623-HOLD  4/25/2016       Engineering
32745       11/13/2012      SEPE
32812       10/30/2013      Engineering
32817       12/7/2012       Purchasing
32839       1/8/2013        SEPE

I output this table (4K rows, 15 columns) to a csv file and process in python3 as a pandas dataframe.

I generate various outputs. If I use something like:

df.iloc[:,[0,3,1,8,9,10]]

I get appropriate output and the key column shows up as "KA#". (When I say "key column", I mean "most important"... NOT "index". I keep a serial index)

Unfortunately, people sometimes mess with the column order in Lotus between my exports to csv so I can not guarantee that "KA#" will be any particular column number. I would like to use column names:

df.loc[:,["KA#","Issue Date","Current Position"]]

But the "KA#" column is filled with NaN's.

Thanks for any help you can offer.

Finally, if I try to rename "KA#" to simply "KA":

df['KA#'].name = 'KA'

throws a KeyError and

df = df.rename(columns={"KA#": "ka"})

is completely ignored. The column shows up as "KA#".

Can anyone think of a way to get rid of or handle that symbol? I'd even settle for a regex at this point.

568

asked Jun 21 '16 19:06

Paul Podbielski

1 Answers

use str.replace:
df.columns=df.columns.str.replace('#','')

You can check this in the documentation.

answered Sep 27 '22 23:09

shivsn

Related questions
                            
                                Add column to a sparse matrix
                            
                                Pandas DataFrame: set_index with inplace=True returns a NoneType, why?
                            
                                Type annotation for boto3 resources like DynamoDB.Table
                            
                                Convert 1d array to lower triangular matrix
                            
                                Dlib installation error? [duplicate]
                            
                                $PYTHONSTARTUP with python 2.7 and python 3.2
                            
                                What are some rules of thumb for deciding between __get__, __getattr__, and __getattribute__?
                            
                                Sorting by absolute value without changing to absolute value
                            
                                Post unicode string to web service using Python Requests library
                            
                                Non-blocking file read
                            
                                Get length of Queue in Python's multiprocessing library
                            
                                How to split a DataFrame in pandas in predefined percentages?
                            
                                Permission System for Discord.py Bot
                            
                                concurrent.futures.ThreadPoolExecutor swallowing exceptions (Python 3.6)
                            
                                Pandas drop duplicates on elements made of lists
                            
                                TF2.0: Translation model: Error when restoring the saved model: Unresolved object in checkpoint (root).optimizer.iter: attributes
                            
                                How can I add python to cmd in windows [closed]
                            
                                Py3k: What's more pythonic - one import with commas or many imports?
                            
                                python 3 in emacs
                            
                                Cx-Freeze Error - Python 34

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas dataframe column name: remove special character

Tags:

python-3.x

pandas

special-characters

Paul Podbielski

People also ask

1 Answers

shivsn

Recent Activity

Donate For Us