I have a myfile.csv
with rows like
first, second, third
1, 2, 3
a, b, c
1, 2, 3
and so on.
I don't understand how to remove duplicate rows in myfile.csv
.
One condition, we can't save new files, we need to update myfile.csv
.
In order to after run script myfile.csv
look like
first, second, third
a, b, c
1, 2, 3
So new data is not saved to a new file need of updating myfile.csv
.
Thank you very much.
You can loop over the data and filter the lists to contain only unique values:
import csv
with open('filename.csv') as f:
data = list(csv.reader(f))
new_data = [a for i, a in enumerate(data) if a not in data[:i]]
with open('filename.csv', 'w') as t:
write = csv.writer(t)
write.writerows(new_data)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With