I have a function like this: <pre class="prettyprint"><code>def highlight_otls(df): return ['background-color: yellow'] </code></pre> And a DataFrame like this: <pre class="prettyprint"><code>price outlier 1.99 F,C 1.49 L,C 1.99 F 1.39 N </code></pre> What I want to do is highlight a certain column in my df based off of this condition of another column: <pre class="prettyprint"><code>data['outlier'].str.split(',').str.len() >= 2 </code></pre> So if the column values df['outlier'] >= 2, I want to highlight the corresponding column df['price']. (So the first 2 prices should be highlighted in my dataframe above). I attempted to do this by doing the following which gives me an error: <pre class="prettyprint"><code>data['price'].apply(lambda x: highlight_otls(x) if (x['outlier'].str.split(',').str.len()) >= 2, axis=1) </code></pre> Any idea on how to do this the proper way?

Use <code>Styler.apply</code>. (To output to <code>xlsx</code> format, use <code>to_excel</code> function.) Suppose one's dataset is <pre class="prettyprint"><code>other price outlier 0 X 1.99 F,C 1 X 1.49 L,C 2 X 1.99 F 3 X 1.39 N def hightlight_price(row): ret = ["" for _ in row.index] if len(row.outlier.split(",")) >= 2: ret[row.index.get_loc("price")] = "background-color: yellow" return ret df.style.\ apply(hightlight_price, axis=1).\ to_excel('styled.xlsx', engine='openpyxl') </code></pre> From the documentation, "<code>DataFrame.style</code> attribute is a property that returns a Styler object." We pass our styling function, <code>hightlight_price</code>, into <code>Styler.apply</code> and demand a row-wise nature of the function with <code>axis=1</code>. (Recall that we want to color the <code>price</code> cell in each row based on the <code>outlier</code> information in the same row.) Our function <code>hightlight_price</code> will generate the visual styling for each row. For each row <code>row</code>, we first generate styling for <code>other</code>, <code>price</code>, and <code>outlier</code> column to be <code>["", "", ""]</code>. We can obtain the right index to modify only the <code>price</code> part in the list with <code>row.index.get_loc("price")</code> as in <pre class="prettyprint"><code>ret[row.index.get_loc("price")] = "background-color: yellow" # ret becomes ["", "background-color: yellow", ""] </code></pre> Results <img src="https://i.stack.imgur.com/ICp19m.png" alt="enter image description here">

Highlight a column value based off another column value in pandas

Tags:

python

pandas

I have a function like this:

def highlight_otls(df):
    return ['background-color: yellow']

And a DataFrame like this:

price   outlier 
1.99       F,C
1.49       L,C
1.99         F
1.39         N

What I want to do is highlight a certain column in my df based off of this condition of another column:

data['outlier'].str.split(',').str.len() >= 2

So if the column values df['outlier'] >= 2, I want to highlight the corresponding column df['price']. (So the first 2 prices should be highlighted in my dataframe above).

I attempted to do this by doing the following which gives me an error:

data['price'].apply(lambda x: highlight_otls(x) if (x['outlier'].str.split(',').str.len()) >= 2, axis=1)

Any idea on how to do this the proper way?

952

asked Jun 06 '18 15:06

Hana

1 Answers

Use Styler.apply. (To output to xlsx format, use to_excel function.)

Suppose one's dataset is

other   price   outlier
0   X   1.99    F,C
1   X   1.49    L,C
2   X   1.99    F
3   X   1.39    N

def hightlight_price(row):
    ret = ["" for _ in row.index]
    if len(row.outlier.split(",")) >= 2:
        ret[row.index.get_loc("price")] = "background-color: yellow"
    return ret
       
df.style.\
    apply(hightlight_price, axis=1).\
    to_excel('styled.xlsx', engine='openpyxl')

From the documentation, "DataFrame.style attribute is a property that returns a Styler object."

We pass our styling function, hightlight_price, into Styler.apply and demand a row-wise nature of the function with axis=1. (Recall that we want to color the price cell in each row based on the outlier information in the same row.)

Our function hightlight_price will generate the visual styling for each row. For each row row, we first generate styling for other, price, and outlier column to be ["", "", ""]. We can obtain the right index to modify only the price part in the list with row.index.get_loc("price") as in

ret[row.index.get_loc("price")] = "background-color: yellow"
# ret becomes ["", "background-color: yellow", ""]

Results

enter image description here

108

answered Sep 18 '22 16:09

Tai

Related questions
                            
                                Add module from RPM as a requirement
                            
                                Tracking changes to all models in Django
                            
                                Keras - How to use ImageDataGenerator without deforming aspect ratio
                            
                                Group by without an aggregate function
                            
                                Can't change activations in existing Keras model
                            
                                Apache Spark Python Cosine Similarity over DataFrames
                            
                                In Python, How do I check whether a file exists starting or ending with a substring?
                            
                                Extending C++ to Python using Pybind11
                            
                                Shadows built-in names "function" and "module" with PyCharm
                            
                                AttributeError: module 'sys' has no attribute 'setdefaultencoding'
                            
                                How to get a complete topic distribution for a document using gensim LDA?
                            
                                How to set Postgres Datetime field into Odoo Datetime field
                            
                                Running the same test on two different fixtures
                            
                                How to set current_user for pytest?
                            
                                This site can’t be reached [flask, python]
                            
                                Testing that a method in instance has been called in mock
                            
                                keep both merging keys after pandas.merge_asof
                            
                                How to deal with: ImportError: /usr/lib/x86_64-linux-gnu/libatk-1.0.so.0: undefined symbol: g_log_structured_standard
                            
                                How do I run background job in Flask without threading or task-queue
                            
                                Python: Range or numpy Arange with end limit include

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With