Remove Commas from Large CSV (1GB)

Tags:

I have a large CSV file (1GB) that I would like to remove commas from. The data are all positive integers. Methods I have tried include dlmwrite with a space as the delimiter, but the output is then output in decimal format. I have also tried to use the fprintf command, but then I lose the shape of the matrix (i.e. all data appear in one line or column).

Thus,

Is there a simple way to read in from a CSV (input.txt):

1, 2, 3, 4, 5
2, 3, 4, 5, 6

and then output to a text file (output.txt) in the form:

1 2 3 4 5
2 3 4 5 6

302

asked Jul 31 '12 15:07

user1566235

2 Answers

In Python, if the format is really that simple (and there already is a space after each comma):

with open("infile.csv") as infile, open("outfile.csv", "w") as outfile:
    for line in infile:
        outfile.write(line.replace(",", ""))

If you can't be sure about whitespace:

import re
with open("infile.csv") as infile, open("outfile.csv", "w") as outfile:
    for line in infile:
        outfile.write(re.sub(r"\s*,\s*", " ", line))

124

answered Oct 05 '22 17:10

Tim Pietzcker

Personally, I like to use sed, a command line program that replaces strings.

This application is available on linux and via a cygwin install also in windows.

Using

sed -i 's/,/ /g' filename

all the commas in the file are replaced by spaces.

answered Oct 05 '22 17:10

Hugo van den Brand

Related questions
                            
                                Python raw socket listening for UDP packets; only half of the packets received
                            
                                Python Subprocess returns non-zero exit status only in cron
                            
                                Is greedy "or" group in regex exists?
                            
                                Database storage: Why is Pipeline better than Feed Export?
                            
                                How to get a window title and scan it every 100ms use python?
                            
                                Line buffered serial input
                            
                                How do I encrypt in Python and decrypt in Java?
                            
                                Python WX - Returning user input from wx Dialog
                            
                                Update request.POST or request.GET using a view decorator
                            
                                Load image from memory in Kivy
                            
                                Defining __getattr__ and __getitem__ on a function has no effect
                            
                                assigning two variables to one list slice
                            
                                Inverse function for monotonically increasing function, OverflowError for log10()
                            
                                uWSGI Server log…permission denied to read file...which file?
                            
                                error in deploying a project using scrapyd
                            
                                HEX decoding in Python 3.2
                            
                                Python hadoop streaming : Setting a job name
                            
                                How do I create a Python socket server that listens on a file descriptor?
                            
                                import next() python 2.5
                            
                                Python Shared Memory Array, no attribute get_obj()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove Commas from Large CSV (1GB)

Tags:

python

csv

matlab

comma

user1566235

People also ask

2 Answers

Tim Pietzcker

Hugo van den Brand

Recent Activity

Donate For Us