I am running through lines in a text file using a <code>python</code> script. I want to search for an <code>img</code> tag within the text document and return the tag as text. When I run the regex <code>re.match(line)</code> it returns a <code>_sre.SRE_MATCH</code> object. How do I get it to return a string? <pre class="prettyprint"><code>import sys import string import re f = open("sample.txt", 'r' ) l = open('writetest.txt', 'w') count = 1 for line in f: line = line.rstrip() imgtag = re.match(r'<img.*?>',line) print("yo it's a {}".format(imgtag)) </code></pre> When run it prints: <pre class="prettyprint"><code>yo it's a None yo it's a None yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a <_sre.SRE_Match object at 0x7fd4ea90e5e0> yo it's a None yo it's a None </code></pre>

You should use <code>re.MatchObject.group(0)</code>. Like <pre class="prettyprint"><code>imtag = re.match(r'<img.*?>', line).group(0) </code></pre> Edit: You also might be better off doing something like <pre class="prettyprint"><code>imgtag = re.match(r'<img.*?>',line) if imtag: print("yo it's a {}".format(imgtag.group(0))) </code></pre> to eliminate all the <code>None</code>s.

How do I return a string from a regex match in python? [duplicate]

Tags:

python

regex

I am running through lines in a text file using a python script. I want to search for an img tag within the text document and return the tag as text.

When I run the regex re.match(line) it returns a _sre.SRE_MATCH object. How do I get it to return a string?

import sys import string import re  f = open("sample.txt", 'r' ) l = open('writetest.txt', 'w')  count = 1  for line in f:     line = line.rstrip()     imgtag  = re.match(r'<img.*?>',line)     print("yo it's a {}".format(imgtag))

When run it prints:

yo it's a None yo it's a None yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a None yo it's a <_sre.SRE_Match object at 0x7fd4ea90e578> yo it's a <_sre.SRE_Match object at 0x7fd4ea90e5e0> yo it's a None yo it's a None

213

asked Aug 28 '13 16:08

Jack Dalton

1 Answers

You should use re.MatchObject.group(0). Like

imtag = re.match(r'<img.*?>', line).group(0)

Edit:

You also might be better off doing something like

imgtag  = re.match(r'<img.*?>',line) if imtag:     print("yo it's a {}".format(imgtag.group(0)))

to eliminate all the Nones.

120

answered Sep 20 '22 04:09

wflynny

Related questions
                            
                                Pandas timeseries plot setting x-axis major and minor ticks and labels
                            
                                how to convert 2d list to 2d numpy array?
                            
                                Mocking Functions Using Python Mock
                            
                                Is 'file' a keyword in python?
                            
                                regexes: How to access multiple matches of a group? [duplicate]
                            
                                Pandas unstack problems: ValueError: Index contains duplicate entries, cannot reshape
                            
                                Python "private" function coding convention
                            
                                Pandas: group by and Pivot table difference
                            
                                Import Script from a Parent Directory
                            
                                Private Constructor in Python
                            
                                How can I print a Python file's docstring when executing it?
                            
                                Divide multiple columns by another column in pandas
                            
                                Continuing in Python's unittest when an assertion fails
                            
                                Django - How to make a variable available to all templates?
                            
                                Jinja2: Change the value of a variable inside a loop
                            
                                Validate SSL certificates with Python
                            
                                ValueError: numpy.dtype has the wrong size, try recompiling
                            
                                Python - manually install package using virtualenv
                            
                                PathLib recursively remove directory?
                            
                                can't compare datetime.datetime to datetime.date

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With