beautiful soup regex

Tags:

I just ran the following code in Python to take all of the certain emails out of an IMAP folder. The extraction part works fine and the BeautifulSoup part works okay, but the output has a lot of '\r' and '\n' within.

I tried to remove these with REGEX sub function but it's not working...not even giving an error message. Any idea what is wrong? I am attaching the code...please note (this is not complete code but everything above the code I'm posting works okay. It still prints the output, it's "prettified", but the \r and \n are still there. Have tried with find_all() but that doesn't work either.

Click to copy

mail.list()  # Lists all labels in GMail
mail.select('INBOX/Personal')  # Connected to inbox.

resp, items = mail.search(None, '(SEEN)')

items = items[0].split()  # getting the mails id        
for emailid in items:
    # getting the mail content
    resp, data = mail.fetch(emailid, '(UID BODY[TEXT])')
    text = str(data[0])  # [1] don't forget to add this back
    soup = bs(text, 'html.parser')
    soup = soup.prettify()
    soup = re.sub('\\r\\n', '', soup)

print(soup)

746

asked Apr 11 '18 08:04

Obie

2 Answers

You can use this for one line regex statement:

Click to copy

soup = re.sub('\\r*n*', '', soup)

or you can use this:

Click to copy

soup = re.sub('\\r', '', soup)
soup = re.sub('\\n', '', soup)

https://regexr.com/3nnp1

answered Nov 18 '22 13:11

MasOOd.KamYab

What about replace command directly? Since it is not regex, it should be faster.

Click to copy

soup.replace("\n","").replace("\r","")

answered Nov 18 '22 12:11

silgon

Related questions
                            
                                discord.py embed with locally saved images
                            
                                How to install a wheel-style package using setup.py
                            
                                Keras : Why does Sequential and Model give different outputs?
                            
                                Odd TypeError from the airflow scheduler -- has usage of @once for scheduler interval changed in v1.9?
                            
                                How do I copy the contents of a word document?
                            
                                How to get stdout and stderr from a tmux session?
                            
                                Sort python dictionary keys based on sub-dictionary keys by defining sorting order
                            
                                Converting Tensor to np.array using K.eval() in Keras returns InvalidArgumentError
                            
                                Time complexity of min, max on sets
                            
                                Q Learning Applied To a Two Player Game
                            
                                Keras ConvLSTM2D: ValueError on output layer
                            
                                ModuleNotFoundError issue for pytest
                            
                                Cryptacular is broken
                            
                                matplotlib 1.3.1 has requirement numpy>=1.5, but you'll have numpy 1.8.0rc1 which is incompatible
                            
                                Python: Remove duplicates for a specific item from list
                            
                                Why can a subprocess still write to stdout after it's been closed?
                            
                                python requests.get gets stuck
                            
                                Is tf.contrib.layers.fully_connected() behavior change between tensorflow 1.3 and 1.4 an issue?
                            
                                Updating an OpenCV tracker with a bounding box in python
                            
                                How to serialize numpy arrays?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

beautiful soup regex

Tags:

python

regex

beautifulsoup

Obie

People also ask

2 Answers

MasOOd.KamYab

silgon

Recent Activity

Donate For Us