I have two tab-delimited files, and I need to test every row in the first file against all the rows in the other file. For instance, file1: <pre class="prettyprint lang-none prettyprint-override"><code>row1 c1 36 345 A row2 c3 36 9949 B row3 c4 36 858 C </code></pre> file2: <pre class="prettyprint lang-none prettyprint-override"><code>row1 c1 3455 3800 row2 c3 6784 7843 row3 c3 10564 99302 row4 c5 1405 1563 </code></pre> let's say I would like to output all the rows in (file1) for which col[3] of file1 is smaller than any (not every) col[2] of file2, given that col[1] are the same. Expected output: <pre class="prettyprint lang-none prettyprint-override"><code>row1 c1 36 345 A row2 c3 36 9949 B </code></pre> Since I am working in Ubuntu, I would like the input command to look like this: <code>python code.py [file1] [file2] > [output]</code> I wrote the following code: <pre class="prettyprint"><code>import sys filename1 = sys.argv[1] filename2 = sys.argv[2] file1 = open(filename1, 'r') file2 = open(filename2, 'r') done = False for x in file1.readlines(): col = x.strip().split() for y in file2.readlines(): col2 = y.strip().split() if col[1] == col2[1] and col[3] < col2[2]: done = True break else: continue print x </code></pre> However, the output looks like this: <pre class="prettyprint lang-none prettyprint-override"><code>row2 c3 36 9949 B </code></pre> This is evident for larger datasets, but basically I always get only the last row for which the condition in the nested loop was true. I am suspecting that "break" is breaking me out of both loops. I would like to know (1) how to break out of only one of the for loops, and (2) if this is the only problem I've got here.

<code>break</code> and <code>continue</code> apply to the innermost loop. The issue is that you open the second file only once, and therefore it's only read once. When you execute <code>for y in file2.readlines():</code> for the second time, <code>file2.readlines()</code> returns an empty iterable. Either move <code>file2 = open(filename2, 'r')</code> into the outer loop, or use <code>seek()</code> to rewind to the beginning of <code>file2</code>.

how to break out of only one nested loop

Tags:

I have two tab-delimited files, and I need to test every row in the first file against all the rows in the other file. For instance,

file1:

row1    c1    36    345   A row2    c3    36    9949  B row3    c4    36    858   C

file2:

row1    c1    3455  3800 row2    c3    6784  7843 row3    c3    10564 99302 row4    c5    1405  1563

let's say I would like to output all the rows in (file1) for which col[3] of file1 is smaller than any (not every) col[2] of file2, given that col[1] are the same.

Expected output:

row1    c1    36    345   A row2    c3    36    9949  B

Since I am working in Ubuntu, I would like the input command to look like this:
python code.py [file1] [file2] > [output]

I wrote the following code:

import sys  filename1 = sys.argv[1] filename2 = sys.argv[2]  file1 = open(filename1, 'r') file2 = open(filename2, 'r')  done = False  for x in file1.readlines():     col = x.strip().split()     for y in file2.readlines():         col2 = y.strip().split()         if col[1] == col2[1] and col[3] < col2[2]:             done = True             break         else: continue print x

However, the output looks like this:

row2    c3    36    9949  B

This is evident for larger datasets, but basically I always get only the last row for which the condition in the nested loop was true. I am suspecting that "break" is breaking me out of both loops. I would like to know (1) how to break out of only one of the for loops, and (2) if this is the only problem I've got here.

260

asked Sep 01 '13 07:09

biohazard

1 Answers

break and continue apply to the innermost loop.

The issue is that you open the second file only once, and therefore it's only read once. When you execute for y in file2.readlines(): for the second time, file2.readlines() returns an empty iterable.

Either move file2 = open(filename2, 'r') into the outer loop, or use seek() to rewind to the beginning of file2.

answered Oct 13 '22 02:10

NPE

Related questions
                            
                                How do I implement charts in Bootstrap?
                            
                                junit test should use main/resources
                            
                                How to paramaterize Int as Ordered in scala
                            
                                Redirect to action with parameters always null in mvc
                            
                                The constructor notification is deprecated
                            
                                Mongoose: what's the differences between Model.create and Collection.insert
                            
                                When will JVM use intrinsics
                            
                                Internet explorer 11 detection on server side
                            
                                How to prevent calling of en event handler twice on fast clicks?
                            
                                pycallgraph with pycharm does not work
                            
                                Adobe Brackets SFTP/FTP [closed]
                            
                                Why should I refer to "names" and "binding" in Python instead of "variables" and "assignment"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to break out of only one nested loop

Tags:

biohazard

People also ask

1 Answers

NPE

Recent Activity

Donate For Us