Replace multiple newlines with single newlines during reading file

Tags:

I have the next code which reads from multiple files, parses obtained lines and prints the result:

import os
import re

files=[]
pars=[]

for i in os.listdir('path_to_dir_with_files'):
    files.append(i)

for f in files:
    with open('path_to_dir_with_files'+str(f), 'r') as a:
       pars.append(re.sub('someword=|\,.*|\#.*','',a.read()))

for k in pars:
   print k

But I have problem with multiple new lines in output:

test1


test2

Instead of it I want to obtain the next result without empty lines in output:

 test1
 test2

and so on.

I tried playing with regexp:

pars.append(re.sub('someword=|\,.*|\#.*|^\n$','',a.read()))

But it doesn't work. Also I tried using strip() and rstrip() including replace. It also doesn't work.

380

asked Mar 06 '17 15:03

user54

1 Answers

You could use a second regex to replace multiple new lines with a single new line and use strip to get rid of the last new line.

import os
import re

files=[]
pars=[]

for i in os.listdir('path_to_dir_with_files'):
    files.append(i)

for f in files:
    with open('path_to_dir_with_files/'+str(f), 'r') as a:
        word = re.sub(r'someword=|\,.*|\#.*','', a.read())
        word = re.sub(r'\n+', '\n', word).strip()
        pars.append(word)

for k in pars:
   print k

144

answered Oct 18 '22 13:10

Kris

Related questions
                            
                                ImportError: No module named cv2
                            
                                Write Large Pandas DataFrames to SQL Server database
                            
                                How to use Ansible 2.0 Python API to run a Playbook?
                            
                                What is the use of returning self in the __iter__ method?
                            
                                Python Tkinter Entry get()
                            
                                python abstract base classes, difference between mixin & abstract method
                            
                                Call column in dataframe by column index instead of column name - pandas
                            
                                Python: How can I tell if my python has SSL?
                            
                                Difference between reverse and [::-1]
                            
                                python decorate function call
                            
                                Zen of Python: Errors should never pass silently. Why does zip work the way it does?
                            
                                Represent infinity as an integer in Python 2.7
                            
                                Sort a sublist of elements in a list leaving the rest in place
                            
                                Why is print("text" + str(var1) + "more text" + str(var2)) described as "disapproved"?
                            
                                Sort a list of tuples in consecutive order
                            
                                Multiple 'for' loops in dictionary generator
                            
                                Rotate minor ticks in matplotlib
                            
                                Python: How to NOT wait for a thread to finish to carry on?
                            
                                Can't run binary from within python aws lambda function
                            
                                Is it possible to show multiple plots in separate windows using matplotlib?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Replace multiple newlines with single newlines during reading file

Tags:

python

regex

file

user54

People also ask

1 Answers

Kris

Recent Activity

Donate For Us