Hello I am new into regex and I'm starting out with python. I'm stuck at extracting all words from an English sentence. So far I have: <pre class="prettyprint"><code>import re shop="hello seattle what have you got" regex = r'(\w*) ' list1=re.findall(regex,shop) print list1 </code></pre> This gives output: <blockquote> ['hello', 'seattle', 'what', 'have', 'you'] </blockquote> If I replace regex by <pre class="prettyprint"><code>regex = r'(\w*)\W*' </code></pre> then output: <blockquote> ['hello', 'seattle', 'what', 'have', 'you', 'got', ''] </blockquote> whereas I want this output <blockquote> ['hello', 'seattle', 'what', 'have', 'you', 'got'] </blockquote> Please point me where I am going wrong.

Use word boundary <code>\b</code> <pre class="prettyprint"><code>import re shop="hello seattle what have you got" regex = r'\b\w+\b' list1=re.findall(regex,shop) print list1 OP : ['hello', 'seattle', 'what', 'have', 'you', 'got'] </code></pre> or simply <code>\w+</code> is enough <pre class="prettyprint"><code>import re shop="hello seattle what have you got" regex = r'\w+' list1=re.findall(regex,shop) print list1 OP : ['hello', 'seattle', 'what', 'have', 'you', 'got'] </code></pre>

Python regex for finding all words in a string [duplicate]

import re

shop="hello seattle what have you got"
regex = r'(\w*) '
list1=re.findall(regex,shop)
print list1

This gives output:

['hello', 'seattle', 'what', 'have', 'you']

If I replace regex by

regex = r'(\w*)\W*'

then output:

['hello', 'seattle', 'what', 'have', 'you', 'got', '']

whereas I want this output

['hello', 'seattle', 'what', 'have', 'you', 'got']

Please point me where I am going wrong.

994

asked May 31 '16 10:05

TNT

1 Answers

Use word boundary \b

import re

shop="hello seattle what have you got"
regex = r'\b\w+\b'
list1=re.findall(regex,shop)
print list1

OP : ['hello', 'seattle', 'what', 'have', 'you', 'got']

or simply \w+ is enough

import re

shop="hello seattle what have you got"
regex = r'\w+'
list1=re.findall(regex,shop)
print list1

OP : ['hello', 'seattle', 'what', 'have', 'you', 'got']

157

answered Oct 11 '22 02:10

Pranav C Balan

Related questions
                            
                                replace all characters in a string with asterisks
                            
                                Get the diagonal of a matrix in TensorFlow
                            
                                Valid syntax in both Python 2.x and 3.x for raising exception?
                            
                                psutil in Apache Spark
                            
                                PEP0484 Type Hinting: Annotating argument of given class, not instance
                            
                                How is an ICMP packet constructed in python
                            
                                Pandas dataframe with MultiIndex: check if string is contained in index level
                            
                                Tornado - What is the difference between RequestHandler's get_argument(), get_query_argument() and get_body_argument()?
                            
                                Run pip in python idle
                            
                                Access next sibling <li> element with BeautifulSoup
                            
                                Using datetime timedelta with a series in a pandas DF
                            
                                Bulk update in Pymongo using multiple ObjectId
                            
                                start node app from python script
                            
                                Apply multiple functions with map
                            
                                double curly brace {{
                            
                                Extract the text from `p` within `div` with BeautifulSoup
                            
                                Django - The current URL, , didn't match any of these
                            
                                Convert a column in pandas dataframe from String to Float
                            
                                faster geometric average on ASCII
                            
                                toctree nested drop down

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python regex for finding all words in a string [duplicate]

Tags:

python

regex

words

sentence

TNT

People also ask

1 Answers

Pranav C Balan

Recent Activity

Donate For Us