I'm using Python 2.7.9 in Windows. I have a UTF-8-encoded python script file with the following contents: <pre class="prettyprint"><code># coding=utf-8 def test_func(): u""" >>> test_func() u'☃' """ return u'☃' </code></pre> I get a curious failure when I run the doctest: <pre class="prettyprint"><code>Failed example: test_func() Expected: u'\u2603' Got: u'\u2603' </code></pre> I see this same failure output whether I launch the doctests through the IDE I usually use (IDEA IntelliJ), or from the command line: <pre class="prettyprint"><code>> x:\my_virtualenv\Scripts\python.exe -m doctest -v hello.py </code></pre> I copied the lines under <code>Expected</code> and <code>Got</code> into WinMerge to rule out some subtle difference in the characters I couldn't spot; it told me they were identical. However, if I redo the command line run, but redirect the output to a text file, like so: <pre class="prettyprint"><code>> x:\my_virtualenv\Scripts\python.exe -m doctest -v hello.py > out.txt </code></pre> the test still fails, but the resulting failure output is a bit different: <pre class="prettyprint"><code>Failed example: test_func() Expected: u'☃' Got: u'\u2603' </code></pre> If I put the escaped unicode literal in my doctest: <pre class="prettyprint"><code># coding=utf-8 def test_func(): u""" >>> test_func() u'☃' """ return u'\\u2603' </code></pre> the test passes. But as far as I can tell, <code>u'\u2603'</code> and <code>u'☃'</code> should evaluate to the same thing. Really I have two questions about the failing case: <ul> <li>Is one of the representations that the doctester is giving (under <code>Expected</code> or <code>Got</code>) incorrect for the value that the doctester has for that case? (i.e. <code>x != eval(repr(x))</code>)</li> <li>If not, why does the test fail?</li> </ul>

The <code>doctest</code> module uses <code>difflib</code> to differentiate between the result and the expected result. Like the following: <pre class="prettyprint"><code>>>> import difflib >>> variation = difflib.unified_diff('x', 'x') >>> list(variation) [] >>> variation = difflib.unified_diff('x', 'y') >>> list(variation) ['--- \n', '+++ \n', '@@ -1 +1 @@\n', '-x', '+y'] </code></pre> Under the hood, the <code>doctest</code> module formats the result and expected result several times. Your problem seems to be an interpretation mistake caused by the string encodings. What gets printed to the console has been formatted (using <code>%s</code>), thus getting rid of any visible differences; making them look identical.

How can a python 2 doctest fail and yet have no difference in the values in the failure message?

Tags:

python

unicode

doctest

I'm using Python 2.7.9 in Windows.

I have a UTF-8-encoded python script file with the following contents:

# coding=utf-8

def test_func():
    u"""
    >>> test_func()
    u'☃'
    """
    return u'☃'

I get a curious failure when I run the doctest:

Failed example:
    test_func()
Expected:
    u'\u2603'
Got:
    u'\u2603'

I see this same failure output whether I launch the doctests through the IDE I usually use (IDEA IntelliJ), or from the command line:

> x:\my_virtualenv\Scripts\python.exe -m doctest -v hello.py

I copied the lines under Expected and Got into WinMerge to rule out some subtle difference in the characters I couldn't spot; it told me they were identical.

However, if I redo the command line run, but redirect the output to a text file, like so:

> x:\my_virtualenv\Scripts\python.exe -m doctest -v hello.py > out.txt

the test still fails, but the resulting failure output is a bit different:

Failed example:
    test_func()
Expected:
    u'☃'
Got:
    u'\u2603'

If I put the escaped unicode literal in my doctest:

# coding=utf-8

def test_func():
    u"""
    >>> test_func()
    u'☃'
    """
    return u'\\u2603'

the test passes. But as far as I can tell, u'\u2603' and u'☃' should evaluate to the same thing.

Really I have two questions about the failing case:

Is one of the representations that the doctester is giving (under Expected or Got) incorrect for the value that the doctester has for that case? (i.e. x != eval(repr(x)))
If not, why does the test fail?

692

asked May 16 '15 03:05

rakslice

1 Answers

The doctest module uses difflib to differentiate between the result and the expected result. Like the following:

>>> import difflib
>>> variation = difflib.unified_diff('x', 'x')
>>> list(variation)
[]
>>> variation = difflib.unified_diff('x', 'y')
>>> list(variation)
['--- \n', '+++ \n', '@@ -1 +1 @@\n', '-x', '+y']

Under the hood, the doctest module formats the result and expected result several times. Your problem seems to be an interpretation mistake caused by the string encodings. What gets printed to the console has been formatted (using %s), thus getting rid of any visible differences; making them look identical.

107

answered Sep 30 '22 16:09

Zach Gates

Related questions
                            
                                How to enable gzip compression on Heroku Cedar (Python/Flask/Gunicorn)
                            
                                How to show and hide code in sphinx doc?
                            
                                How do I store a Python object in memory for use by different processes?
                            
                                .py file showing code in browser instead of running
                            
                                Difference between PyMODINIT_FUNC and PyModule_Create
                            
                                Convert POSIX->WIN path, in Cygwin Python, w/o calling cygpath
                            
                                Can writing to a UDP socket ever block?
                            
                                Stdout encoding in python
                            
                                Audio/Video streaming fails using SimpleHTTPServer [closed]
                            
                                Field Level Permission Django
                            
                                beautifulsoup won't recognize lxml
                            
                                How to programmatically tell Celery to send all log messages to stdout or stderr?
                            
                                Post install script after installing a wheel
                            
                                How to plot proper 3D axes in MayaVi, like those found in Matplotlib
                            
                                What is a secure way to send an email using Python and Gmail as the provider?
                            
                                TypeError: float() argument must be a string or a number in Django distance
                            
                                Machine learning for monitoring servers
                            
                                Can a mock side_effect iterator be reset after it has been exhausted?
                            
                                How can SQLAlchemy be taught to recover from a disconnect?
                            
                                Suppress "field should be unique" error in Django REST framework

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With