re.search Multiple lines Python

Tags:

re.search with \s or '\n' is not finding the multiline i'm trying to search for.

Portion of Source:

Date/Time:
2013-08-27 17:05:36 

----- BEGIN SEARCH -----

GENERAL DATA:
NAME:   AB12
SECTOR: 
999,999
CONTROLLED BY:  Player
ALLIANCE:   Aliance
ONLINE: 1 seconds ago
SIZE:   Large
HOMEWORLD:  NO
APPROVAL RATING:    100%
PRODUCTION RATE:    100%

RESOURCE DATA:
POWER:  0 / 0
BUILDINGS:  0 / 20
ORE:    80,000 / 80,000
CRYSTAL:    80,000 / 80,000
POPULATION: 40,000 / 40,000

BUILDING DATA:
N/A

UNIT DATA:
WYVERN(S):  100

----- END SEARCH -----

Looking at it in Notepad++ I see "BUILDING DATA:(LF)"

Full Code

lines = open('scan.txt','r').readlines()
for a in lines:
    if re.search(r"\A\d", a):
        digits = a
        if re.search(r"2013", digits):
            date.append(digits[:19])
            count +=1
        elif re.search(r",", digits):
            clean = digits.rstrip()
            sector = clean.split(',')
            x.append(sector[0])
            y.append(sector[1])
    elif re.search(r"CONTROLLED BY:", a):
        player.append(a[15:].rstrip())
    elif re.search(r"ALLIANCE:", a):
        alliance.append(a[10:].rstrip())
    elif re.search(r"SIZE:", a):
        size.append(a[6:].rstrip())
    elif re.findall('BUILDING DATA:\sN/A', a, re.M):
        def_grid = ''
        print "Didn't find it"
        defense.append(def_grid)
        defense_count +=1
    elif re.search(r"DEFENSE GRID", a):
        def_grid = a[16:].rstrip()
        print "defense found"
        defense_count +=1

But I am not having anything returned.

I need to put an empty spacer in when "DEFENSE GRID" doesn't exist after "BUILDING DATA:"

I know i'm missing something and I've tried reading up on re.search but i'm not able to find any thorough examples that explain how the multiline works.

312

asked Aug 29 '13 21:08

Xariec

2 Answers

re.findall("BUILDING DATA:\nN/A",a,re.MULTILINE)

answered Sep 25 '22 11:09

Goontracker

You can do just what you did, but using re.findall instead of re.search:

re.findall('BUILDING DATA:\nN/A', a, re.M)
#['BUILDING DATA:\nN/A']

EDIT:

The problem is that you are currently reading line-by-line. In order to detect a pattern that belongs to two or more lines, you have to consider the string as a whole, maybe doing:

s = ''.join(lines)

which is ok if lines is not so big, and then use s to perform your multi-line searches...

answered Sep 25 '22 11:09

Saullo G. P. Castro

Related questions
                            
                                Why does shell=True eat my subprocess.Popen stdout?
                            
                                Programmatically add column names to numpy ndarray
                            
                                Opening and reading a file with askopenfilename
                            
                                Python closure function losing outer variable access
                            
                                Why doesn't nosetests find anything?
                            
                                In gevent, how can I dump stack traces of all running greenlets?
                            
                                What does this Python statement mean?
                            
                                Django ORM - mock values().filter() chain
                            
                                Python: getting correct string length when it contains surrogate pairs
                            
                                Can I write italics to the Python shell?
                            
                                Inherit a parent class docstring as __doc__ attribute
                            
                                Update a PostgreSQL array using SQLAlchemy
                            
                                Calculate camera world position with OpenCV Python
                            
                                Combine columns from several CSV files into a single file
                            
                                How can I say a file is SVG without using a magic number?
                            
                                How to set the Python 2 Preference in PyCharm?
                            
                                Fastest method to generate big random string with lower Latin letters
                            
                                Python numpy.random.normal only positive values
                            
                                Python - Removing overlapping lists
                            
                                Get indices that satisfy some criteria

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

re.search Multiple lines Python

Tags:

python

string

regex

python-2.7

Xariec

People also ask

2 Answers

Goontracker

Saullo G. P. Castro

Recent Activity

Donate For Us