My aim is to read line from the file , strip the blank spaces at the end of it and write back into the same file. I have tried the following code:
with open(filename, 'r+') as f:
for i in f:
f.write(i.rstrip()+"\n")
This seems to write at the end of the file, keeping initial data in the file intact . I know that using f.seek(0)
would take the pointer back to start of the file , which I am assuming would be somehow required for this solution.
Can you please advise if there is different approach for this or am I on the right patch just need to add more logic into the code?
'r+' opens the file for both reading and writing. On Windows, 'b' appended to the mode opens the file in binary mode, so there are also modes like 'rb', 'wb', and 'r+b'. Also reading then writing works equally well using 'r+b' mode, but you have to use f.
To write to a text file in Python, you can use the built-in open function, specifying a mode of w or wt . You can then use the write method on the file object you get back to write to that file. It's best to use a with block when you're opening a file to write to it.
During the actual reading and writing, yes. But multiple processes can open the same file at the same time, then write back.
Use a temporary file. Python provides facilities for creating temporary files in a secure manner. Call example below with: python modify.py target_filename
import tempfile
import sys
def modify_file(filename):
#Create temporary file read/write
t = tempfile.NamedTemporaryFile(mode="r+")
#Open input file read-only
i = open(filename, 'r')
#Copy input file to temporary file, modifying as we go
for line in i:
t.write(line.rstrip()+"\n")
i.close() #Close input file
t.seek(0) #Rewind temporary file to beginning
o = open(filename, "w") #Reopen input file writable
#Overwriting original file with temporary file contents
for line in t:
o.write(line)
t.close() #Close temporary file, will cause it to be deleted
if __name__ == "__main__":
modify_file(sys.argv[1])
References here: http://docs.python.org/2/library/tempfile.html
The problem with your approach is that you need both an input stream and an output stream, which can point at different places in the same file. If you want to use f.seek()
then you will need to store the position using f.tell()
after each read and write. For example:
f = open(filename, 'r+')
while True:
i = f.readline()
if i == '': break
in = f.tell()
f.seek(out)
f.write(i.rstrip()+"\n")
out = f.tell()
f.seek(in)
But that's confusing and prone to errors. If the file isn't too big, why not read it all into memory and then write it back out again?
in = open(filename, 'r')
lines = in.read()
in.close()
out = open(filename, 'w')
out.write([line.rstrip()+'\n' for line in lines.split('\n')])
out.close()
If the file is too large to fit into memory, then write the lines to a temporary file and then rename the file when you are done:
out = open(filename+'.tmp', 'w')
with open(filename, 'r') as f:
for i in f:
out.write(i.rstrip()+"\n")
out.close()
os.rename(filename+'.tmp', filename)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With