pandas map column data based on value from another column using if to determine which dict to use

Tags:

pandas

I have the following dataframe:

df = pd.DataFrame([['Person1', 'CT', 2017],
               ['Person2', 'FL', 2017],
               ['Person3', 'TX', 2017],
              ['Person1', 'TX', 2016]], columns=['Name', 'State', 'Year'])

And two mapping tables below:

state_map = {'CT': 'Connecticut', 'FL': 'Florida', 'TX':'Texas'}
state_map2 = {'CT': 'ABC-CT', 'FL': 'BBC-Florida', 'TX':'CDA-TX'}

Here is what the data looks like:

    Name    State   Year
0   Person1   CT    2017
1   Person2   FL    2017
2   Person3   TX    2017
3   Person1   TX    2016

I would like to find a way to add a new column with values mapped using an if condition that determines whether to use values mapped from state_map or state_map2. So if df[df['Name']=='Person1'] then use state_map else use state_map2.

The final output should look like this:

    Name    State   Year   New_State_Name
0   Person1   CT    2017   Connecticut
1   Person2   FL    2017   BBC-Florida
2   Person3   TX    2017   CDA-TX
3   Person1   TX    2016   Texas

I tried the following code but it didn't work.

df['New_State_Name'] = [state_map[x] if df[df['Name'] == 'Person1'] else 
state_map2[x] for x in df['State']]

I got an error that says:

ValueError: The truth value of a DataFrame is ambiguous. Use a.empty, 
a.bool(), a.item(), a.any() or a.all().

326

asked Jul 26 '17 17:07

Tony

1 Answers

Use np.where:

df['New_State_Name'] = np.where(df['Name']=='Person1',df['State'].map(state_map),df['State'].map(state_map2))

Output:

      Name State  Year New_State_Name
0  Person1    CT  2017    Connecticut
1  Person2    FL  2017    BBC-Florida
2  Person3    TX  2017         CDA-TX
3  Person1    TX  2016          Texas

answered Oct 21 '22 22:10

Scott Boston

Related questions
                            
                                Cannot find a file in my tempfile.TemporaryDirectory() for Python3
                            
                                Collecting results from python coroutines before loop finishes
                            
                                One line solution for editing a numpy array of counts? (python)
                            
                                Fetch data from form and display in template
                            
                                Python: How to update a value in Google BigQuery in less than 40 seconds?
                            
                                Python .loc confusion
                            
                                Maxvalue in cv2.minMaxLoc()?
                            
                                Handle 1000 concurrent requests for Flask/Gunicorn web service
                            
                                Iterating over all notes in Music21
                            
                                Fill a matrix from a matrix of indices
                            
                                Python define function inside if block or vice versa
                            
                                Python: interpolating in a triangular mesh
                            
                                Formatting an entire pandas dataframe as a string, row by row
                            
                                python pandas pivot: How to do a proper tidyr-like spread?
                            
                                How to pipe Picamera video to FFMPEG with subprocess (Python)
                            
                                Intersection of sets as columns in pandas
                            
                                Flask Unit Testing and not understanding my fix for "TypeError: a bytes-like object is required, not 'str'"
                            
                                Merge two lists of dicts of different lengths using a single key in Python
                            
                                Tkinter Scale slider with float values doesn't work with locale of language that uses comma for floats
                            
                                What are noisy samples in Scikit's DBSCAN clustering algorithm?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With