Using numpy to filter out multiple comment symbols

Question

I am looking for a way to pull data from a file that has multiple comment symbols. The input file looks similar to:

# filename: sample.txt
# Comment 1
# Comment 2
$ Comment 3
1,10
2,20
3,30
4,40
# Comment 4

I can only seem to remove one comment type with the following code and can't find any documentation on how I might remove both.

import numpy as np
data = np.loadtxt('sample.txt',comments="#") # I need to also filter out '$'

Are there any alternative methods I could use to accomplish this?

Vladas O. · Accepted Answer

Simply use a list for comments, for example:

data = np.loadtxt('sample.txt',comments=['#', '$', '@'])

Saullo G. P. Castro · Answer

I would create a generator that will ignore the comments and then pass it to np.genfromtxt():

gen = (r for r in open('sample.txt') if not r[0] in ('$', '#'))
a = np.genfromtxt(gen, delimiter=',')

Fredrik Pihl · Answer

for this case, you need to resort to standard-python looping over the input, e.g. something like this:

data = []
with open("input.txt") as fd:
    for line in fd:
        if line.startswith('#') or line.startswith('$'):
            continue
        data.append(map(int, line.strip().split(',')))

print data

output:

[[1, 10], [2, 20], [3, 30], [4, 40]]

Using numpy to filter out multiple comment symbols

Tags:

python

numpy

tirefire

3 Answers

Vladas O.

Saullo G. P. Castro

Fredrik Pihl

Recent Activity

Donate For Us

Using numpy to filter out multiple comment symbols

Tags:

python

numpy

tirefire

3 Answers

Vladas O.

Saullo G. P. Castro

Fredrik Pihl

Related questions

Recent Activity

Donate For Us