Python Beautifulsoup Find_all except

Tags:

I'm struggling to find a simple to solve this problem and hope you might be able to help.

I've been using Beautifulsoup's find all and trying some regex to find all the items except the 'emptyLine' line in the html below:

<div class="product_item0 ">...</div>
<div class="product_item1 ">...</div>
<div class="product_item2 ">...</div>
<div class="product_item0 ">...</div>
<div class="product_item1 ">...</div>
<div class="product_item2 ">...</div>
<div class="product_item0 ">...</div>
<div class="product_item1 last">...</div>
<div class="product_item2 emptyItem">...</div>

Is there a simple way to find all the items except one including the 'emptyItem'?

974

asked Jan 31 '16 15:01

blountdj

1 Answers

Just skip elements containing the emptyItem class. Working sample:

from bs4 import BeautifulSoup

data = """
<div>
    <div class="product_item0">test0</div>
    <div class="product_item1">test1</div>
    <div class="product_item2">test2</div>
    <div class="product_item2 emptyItem">empty</div>
</div>
"""

soup = BeautifulSoup(data, "html.parser")

for elm in soup.select("div[class^=product_item]"):
    if "emptyItem" in elm["class"]:  # skip elements having emptyItem class
        continue

    print(elm.get_text())

Prints:

test0
test1
test2

Note that the div[class^=product_item] is a CSS selector that would match all div elements with a class starting with product_item.

128

answered Sep 28 '22 03:09

alecxe

Related questions
                            
                                Using scipy.integrate.complex_ode instead of scipy.integrate.ode
                            
                                conda and pip not working at all
                            
                                Python image library - font positioning
                            
                                Can't seem to import scikit-learn's MLPRegressor
                            
                                nltk NgramModel error
                            
                                Calling python function from C as a callback. What is the right way to handle the GIL?
                            
                                Convert timestamp to rfc 3339 in Python [duplicate]
                            
                                Neural network generating incorrect results that are around the average of outputs
                            
                                How to reshape a vector to TensorFlow's filters?
                            
                                get subsection of df based on multiple conditions
                            
                                SQLAlchemy events are not working
                            
                                subclass str, and make new method with same effect as +=
                            
                                Python inheritance old style type in a new style class
                            
                                Python string.format() : formatting nans as 'some text'?
                            
                                Is there a complete list of key event names used by turtle-graphics?
                            
                                Having trouble removing headers when using pd.read_csv
                            
                                Instantiating object automatically adds to SQLAlchemy Session. Why?
                            
                                Numpy 3D array transposed when indexed in single step vs two steps
                            
                                How to use full-text search in sqlite3 database in django?
                            
                                How do I add buttons that are dynamically created in pure python to a kivy layout that is Written in Kivy Language?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Beautifulsoup Find_all except

Tags:

python

python-3.x

html-parsing

beautifulsoup

blountdj

People also ask

1 Answers

alecxe

Recent Activity

Donate For Us