I have this code (which works) - a bunch of nested conditional statements to set the value in the <code>'paragenesis1'</code> row of a dataframe (<code>myOxides['cpx']</code>), depending on the values in various other rows of the frame. I'm very new to python and programming in general. I am thinking that I should write a function to perform this, but how then to apply that function elementwise? This is the only way I have found to avoid the 'truth value of a series is ambiguous' error. Any help greatly appreciated! <pre class="prettyprint"><code>myOxides['cpx'].loc['paragenesis1'] = np.where( ((cpxCrOx>=0.5) & (cpxAlOx<=4)), "GtPeridA", np.where( ((cpxCrOx>=2.25) & (cpxAlOx<=5)), "GtPeridB", np.where( ((cpxCrOx>=0.5)& (cpxCrOx<=2.25)) & ((cpxAlOx>=4) & (cpxAlOx<=6)), "SpLhzA", np.where( ((cpxCrOx>=0.5) & (cpxCrOx<=(5.53125 - 0.546875 * cpxAlOx))) & ((cpxAlOx>=4) & (cpxAlOx <= ((cpxCrOx - 5.53125)/ -0.546875))), "SpLhzB", "Eclogite, Megacryst, Cognate")))) </code></pre> or; <pre class="prettyprint"><code>df.loc['a'] = np.where( (some_condition), "value", np.where( ((conditon_1) & (condition_2)), "some_value", np.where( ((condition_3)& (condition_4)), "some_other_value", np.where( ((condition_5), "another_value", "other_value")))) </code></pre>

One possible solution is use <code>numpy.select</code>: <pre class="prettyprint"><code>m1 = (cpxCrOx>=0.5) & (cpxAlOx<=4) m2 = (cpxCrOx>=2.25) & (cpxAlOx<=5) m3 = ((cpxCrOx>=0.5) & (cpxCrOx<=2.25)) & ((cpxAlOx>=4) & (cpxAlOx<=6)) m4 = ((cpxCrOx>=0.5) &(cpxCrOx<=(5.53125 - 0.546875 * cpxAlOx))) & \ ((cpxAlOx>=4) & (cpxAlOx <= ((cpxCrOx - 5.53125)/ -0.546875)) vals = [ "GtPeridA", "GtPeridB", "SpLhzA", "SpLhzB"] default = 'Eclogite, Megacryst, Cognate' myOxides['paragenesis1'] = np.select([m1,m2,m3,m4], vals, default=default) </code></pre>

Alternative to nested np.where in Pandas DataFrame

Tags:

python

python-3.x

pandas

dataframe

numpy

I have this code (which works) - a bunch of nested conditional statements to set the value in the 'paragenesis1' row of a dataframe (myOxides['cpx']), depending on the values in various other rows of the frame.

I'm very new to python and programming in general. I am thinking that I should write a function to perform this, but how then to apply that function elementwise? This is the only way I have found to avoid the 'truth value of a series is ambiguous' error.

Any help greatly appreciated!

myOxides['cpx'].loc['paragenesis1'] = np.where(
            ((cpxCrOx>=0.5) & (cpxAlOx<=4)),
            "GtPeridA", 
            np.where(
                    ((cpxCrOx>=2.25) & (cpxAlOx<=5)), 
                    "GtPeridB", 
                    np.where(
                            ((cpxCrOx>=0.5)&
                             (cpxCrOx<=2.25)) &
                             ((cpxAlOx>=4) & (cpxAlOx<=6)),
                             "SpLhzA",
                             np.where(
                                     ((cpxCrOx>=0.5) &
                                      (cpxCrOx<=(5.53125 - 
                                                 0.546875 * cpxAlOx))) &
                                      ((cpxAlOx>=4) & 
                                       (cpxAlOx <= ((cpxCrOx - 
                                                     5.53125)/ -0.546875))),
                             "SpLhzB",
                             "Eclogite, Megacryst, Cognate"))))

or;

df.loc['a'] = np.where(
            (some_condition),
            "value", 
            np.where(
                    ((conditon_1) & (condition_2)), 
                    "some_value", 
                    np.where(
                            ((condition_3)& (condition_4)),
                             "some_other_value",
                              np.where(
                                      ((condition_5),
                                        "another_value",
                                        "other_value"))))

690

asked Mar 13 '18 09:03

K. Mather

1 Answers

One possible solution is use numpy.select:

m1 = (cpxCrOx>=0.5) & (cpxAlOx<=4)
m2 = (cpxCrOx>=2.25) & (cpxAlOx<=5)
m3 = ((cpxCrOx>=0.5) & (cpxCrOx<=2.25)) & ((cpxAlOx>=4) & (cpxAlOx<=6))
m4 = ((cpxCrOx>=0.5) &(cpxCrOx<=(5.53125 -  0.546875 * cpxAlOx))) & \
     ((cpxAlOx>=4) &  (cpxAlOx <= ((cpxCrOx -  5.53125)/ -0.546875))

vals = [ "GtPeridA", "GtPeridB", "SpLhzA", "SpLhzB"]
default = 'Eclogite, Megacryst, Cognate'

myOxides['paragenesis1'] = np.select([m1,m2,m3,m4], vals, default=default)

148

answered Sep 19 '22 06:09

jezrael

Related questions
                            
                                Django redirect_authenticated_user: True not working
                            
                                django many-to-many recursive relationship
                            
                                matplotlib - could not convert string to float
                            
                                Parallel threading with xgboost?
                            
                                Add Missing Columns to the dataframe
                            
                                Regular expression - replace all spaces in beginning of line with periods
                            
                                how to concatenate multiple excel sheets from the same file?
                            
                                How to get ALL undefined variables from a Jinja2 template?
                            
                                Pythonic way to check empty dictionary and empty values
                            
                                Replace column values using regex in pandas data frame
                            
                                Python Dataframe: Remove duplicate words in the same cell within a column in Python [closed]
                            
                                Can you use a while loop on a dictionary in python?
                            
                                Can't find Python executable
                            
                                How to catch all exceptions in Try/Catch Block Python?
                            
                                How to groupby().transform() to value_counts() in pandas?
                            
                                How to create a pandas dataframe using Tweepy?
                            
                                Multiple tests in one pytest function
                            
                                Writing nested schema to BigQuery from Dataflow (Python)
                            
                                Add 'auto_now' DateTimeField to existing Django model
                            
                                Unable to plot Double Bar, Bar plot using pyplot for ndarray

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With