I want to replace some values in a column of a dataframe using a dictionary that maps the old codes to the new codes. <pre class="prettyprint"><code>di = dict( { "myVar": {11:0, 204:11} } ) mydata.replace( to_replace = di, inplace = True ) </code></pre> But some of the new codes and old codes overlap. When using the .replace method of the dataframe I encounter the error <code>'Replacement not allowed with overlapping keys and values'</code> My current workaround is to replace replace the offending keys manually and then apply the dictionary to the remaining non-overlapping cases. <pre class="prettyprint"><code>mydata.loc[ mydata.myVar == 11, "myVar" ] = 0 di = dict( { "myVar": {204:11} } ) mydata.replace( to_replace = di, inplace = True ) </code></pre> Is there a more compact way to do this?

I found an answer here that uses the .map method on a series in conjunction with a dictionary. Here's an example recoding dictionary with overlapping keys and values. <pre class="prettyprint"><code>import pandas as pd >>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] ) >>> df Var 0 1 1 2 2 3 3 4 4 1 >>> dict = {1:2, 2:3, 3:1, 4:3} >>> df.Var.map( dict ) 0 2 1 3 2 1 3 3 4 2 Name: Var, dtype: int64 </code></pre> UPDATE: With map, every value in the original series must be mapped to a new value. If the mapping dictionary does not contain all the values of the original column, the unmapped values are mapped to NaN. <pre class="prettyprint"><code>>>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] ) >>> dict = {1:2, 2:3, 3:1} >>> df.Var.map( dict ) 0 2.0 1 3.0 2 1.0 3 NaN 4 2.0 Name: Var, dtype: float64 </code></pre>

Overlapping keys in dictionary when Using .replace() method on pandas dataframe

Tags:

python

pandas

I want to replace some values in a column of a dataframe using a dictionary that maps the old codes to the new codes.

di = dict( { "myVar": {11:0, 204:11} } )
mydata.replace( to_replace = di, inplace = True )

But some of the new codes and old codes overlap. When using the .replace method of the dataframe I encounter the error 'Replacement not allowed with overlapping keys and values'

My current workaround is to replace replace the offending keys manually and then apply the dictionary to the remaining non-overlapping cases.

mydata.loc[ mydata.myVar == 11, "myVar" ] = 0 
di = dict( { "myVar": {204:11} } )
mydata.replace( to_replace = di, inplace = True )

Is there a more compact way to do this?

793

asked Feb 23 '17 20:02

Nirvan

1 Answers

I found an answer here that uses the .map method on a series in conjunction with a dictionary. Here's an example recoding dictionary with overlapping keys and values.

import pandas as pd
>>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] )
>>> df
   Var
0    1
1    2
2    3
3    4
4    1
>>> dict = {1:2, 2:3, 3:1, 4:3}
>>> df.Var.map( dict )
0    2
1    3
2    1
3    3
4    2
Name: Var, dtype: int64

UPDATE:

With map, every value in the original series must be mapped to a new value. If the mapping dictionary does not contain all the values of the original column, the unmapped values are mapped to NaN.

>>> df = pd.DataFrame( [1,2,3,4,1], columns = ['Var'] )
>>> dict = {1:2, 2:3, 3:1}
>>> df.Var.map( dict )
0    2.0
1    3.0
2    1.0
3    NaN
4    2.0
Name: Var, dtype: float64

answered Oct 04 '22 14:10

Nirvan

Related questions
                            
                                sqlalchemy generic foreign key (like in django ORM)
                            
                                Python3 installed successfully, but cannot be opened in terminal
                            
                                How to use WTForms in Ajax validation?
                            
                                Generating multiple observers with Python watchdog
                            
                                How can I increase the number of subdivisions for functions in `scipy.integrate.dblquad`?
                            
                                urrlib2.urlopen: "Name or service not known" persists when starting script without internet connection
                            
                                How to use continuation line over-indented for visual indent?
                            
                                Download python package with dependencies without installing
                            
                                subprocess.check_output(): show output on failure
                            
                                argparse - Combining parent parser, subparsers and default values
                            
                                Python: Do something for any method of a class?
                            
                                Python MessageBox with Icons using ctypes and windll
                            
                                sudo pip install django
                            
                                Python Selenium Webdriver - Changing proxy settings on the fly
                            
                                Python: asking if two objects are the same class
                            
                                How to properly overload the __add__ method?
                            
                                Is there a Python API for submitting batch get requests to AWS DynamoDB?
                            
                                Numpy & Pandas: Return histogram values from pandas histogram plot?
                            
                                Python 3 print() function with Farsi/Arabic characters [duplicate]
                            
                                How to record val_loss and loss per batch in keras

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Overlapping keys in dictionary when Using .replace() method on pandas dataframe

Tags:

python

pandas

Nirvan

People also ask

1 Answers

Nirvan

Recent Activity

Donate For Us