I am using BeautifulSoup to look for user-entered strings on a specific page. For example, I want to see if the string 'Python' is located on the page: http://python.org When I used: <code>find_string = soup.body.findAll(text='Python')</code>, <code>find_string</code> returned <code>[]</code> But when I used: <code>find_string = soup.body.findAll(text=re.compile('Python'), limit=1)</code>, <code>find_string</code> returned <code>[u'Python Jobs']</code> as expected What is the difference between these two statements that makes the second statement work when there are more than one instances of the word to be searched?

The following line is looking for the exact NavigableString 'Python': <pre class="prettyprint"><code>>>> soup.body.findAll(text='Python') [] </code></pre> Note that the following NavigableString is found: <pre class="prettyprint"><code>>>> soup.body.findAll(text='Python Jobs') [u'Python Jobs'] </code></pre> Note this behaviour: <pre class="prettyprint"><code>>>> import re >>> soup.body.findAll(text=re.compile('^Python$')) [] </code></pre> So your regexp is looking for an occurrence of 'Python' not the exact match to the NavigableString 'Python'.

Using BeautifulSoup to search HTML for string

1 Answers

The following line is looking for the exact NavigableString 'Python':

>>> soup.body.findAll(text='Python') []

Note that the following NavigableString is found:

>>> soup.body.findAll(text='Python Jobs')  [u'Python Jobs']

Note this behaviour:

>>> import re >>> soup.body.findAll(text=re.compile('^Python$')) []

So your regexp is looking for an occurrence of 'Python' not the exact match to the NavigableString 'Python'.

189

answered Oct 10 '22 08:10

sgallen

Related questions
                            
                                Nested f-strings
                            
                                Rearrange columns of numpy 2D array
                            
                                Set legend symbol opacity with matplotlib?
                            
                                Python Sound ("Bell")
                            
                                Send log messages from all celery tasks to a single file
                            
                                python copy files by wildcards
                            
                                How to add if condition in a TensorFlow graph?
                            
                                logging remove / inspect / modify handlers configured by fileConfig()
                            
                                How do I use subprocess.Popen to connect multiple processes by pipes?
                            
                                How to decorate a method inside a class?
                            
                                Python - calendar.timegm() vs. time.mktime()
                            
                                Ansible creating a virtualenv
                            
                                How to evaluate environment variables into a string in Python?
                            
                                How to pickle a namedtuple instance correctly
                            
                                Continuous Integration System for a Python Codebase
                            
                                Binary buffer in Python
                            
                                What is the fastest way to parse large XML docs in Python?
                            
                                Anyone know of a good Python based web crawler that I could use?
                            
                                Private Variables and Methods in Python [duplicate]
                            
                                How to inherit and extend a list object in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using BeautifulSoup to search HTML for string

Tags:

python

beautifulsoup

kachilous

People also ask

1 Answers

sgallen

Recent Activity

Donate For Us