Counting differences between two strings

Tags:

python

I'm trying to count the number of differences between two imported strings (seq1 and seq2, import code not listed), but am getting no result when running the program. I want the output to read something like "2 differences." Not sure where I'm going wrong...

def difference (seq1, seq2):    
    count = 0
    for i in seq1:
        if seq1[i] != seq2[i]:
            count += 1
        return (count)
    print (count, "differences")

830

asked Feb 10 '15 03:02

Ryan Scott

3 Answers

You could do this pretty flatly with a generator expression

count = sum(1 for a, b in zip(seq1, seq2) if a != b)

If the sequences are of a different length, then you may consider the difference in length to be difference in content (I would). In that case, tag on an extra piece to account for it

count = sum(1 for a, b in zip(seq1, seq2) if a != b) + abs(len(seq1) - len(seq2))

Another weirdish way to write that which takes advantage of True being 1 and False being 0 is:

sum(a != b for a, b in zip(seq1, seq2))+ abs(len(seq1) - len(seq2))

zip is a python builtin that allows you to iterate over two sequences at once. It will also terminate on the shortest sequence, observe:

>>> seq1 = 'hi'
>>> seq2 = 'world'
>>> for a, b in zip(seq1, seq2):
...     print('a =', a, '| b =', b)
... 
a = h | b = w
a = i | b = o

This will evaluate similar to sum([1, 1, 1]) where each 1 represents a difference between the two sequences. The if a != b filter causes the generator to only produce a value when a and b differ.

133

answered Oct 22 '22 16:10

Ryan Haining

When you say for i in seq1 you are iterating over the characters, not the indexes. You can use enumerate by saying for i, ch in enumerate(seq1) instead.

Or even better, use the standard function zip to go through both sequences at once.

You also have a problem because you return before you print. Probably your return needs to be moved down and unindented.

answered Oct 22 '22 18:10

John Zwinck

in your script there are to mistakes

"i" should be integer, not char
"return" should be in function the same level as print, not in cycle "for"
try not to use "print" in such way in functions

here is working version:

def difference (seq1, seq2):    
    count = 0
    for i in range(len(seq1)):
        if seq1[i] != seq2[i]:
            count += 1
    return (count)

answered Oct 22 '22 18:10

breezin

Related questions
                            
                                how to set global const variables in python
                            
                                Animate points with labels with matplotlib
                            
                                Is there a faster way to test if two lists have the exact same elements than Pythons built in == operator?
                            
                                breaking up long path names
                            
                                how to serve downloadable zip file in django
                            
                                Python float to ratio
                            
                                How to redirect the raw_input to stderr and not stdout?
                            
                                Get (column, row) index from NumPy array that meets a boolean condition
                            
                                How to access the next element in for loop in current index/iteration?
                            
                                Django login form
                            
                                Can't run MySql Utilities
                            
                                Forming numpy array from array buffer from shared memory (multiprocessing) fails
                            
                                Breaking ties in Python sort
                            
                                Python-peewee get last saved row
                            
                                Python Convert time to UTC format
                            
                                Python: How do I format numbers for a fixed width?
                            
                                Convert a list of bytes to a byte string
                            
                                Pydub concatenate mp3 in a directory
                            
                                Two's complement of Hex number in Python
                            
                                pip install error: cannot import name 'unpack_url'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With