Testing in Python using doctest

Python Testing Cookbook

Coding a test harness for doctest

The doctest module supports creating objects, invoking methods, and checking results. With this recipe, we will explore this in more detail.

An important aspect of doctest is that it finds individual instances of docstrings, and runs them in a local context. Variables declared in one docstring cannot be used in another docstring.

The reader can benefit from the previous article on Python: Using doctest for Documentation.

How to do it...

Create a new file called recipe19.py to contain the code from this recipe.

Write a simple shopping cart application.
class ShoppingCart(object):
def __init__(self):
self.items = []
def add(self, item, price):
self.items.append(Item(item, price))
return self
def item(self, index):
return self.items[index-1].item
def price(self, index):
return self.items[index-1].price
def total(self, sales_tax):
sum_price = sum([item.price for item in self.items])
return sum_price*(1.0 + sales_tax/100.0)
def __len__(self):
return len(self.items)

class Item(object):
def __init__(self, item, price):
self.item = item
self.price = price

Insert a docstring at the top of the module, before the ShoppingCart class declaration.
```
"""
This is documentation for the this entire recipe.
With it, we can demonstrate usage of the code.
```
>>> cart = ShoppingCart().add("tuna sandwich", 15.0)
>>> len(cart)
1
>>> cart.item(1)
'tuna sandwich'
>>> cart.price(1)
15.0
>>> print round(cart.total(9.25), 2)
16.39
"""
class ShoppingCart(object):
...

Run the recipe using -m doctest and -v for verbosity.

Copy all the code we just wrote from recipe19.py into a new file called recipe19b.py.

Inside recipe19b.py add another docstring to item, which attempts to re-use the cart variable defined at the top of the module.
def item(self, index):
"""
>>> cart.item(1)
'tuna sandwich'
"""
return self.items[index-1].item

Run this variant of the recipe. Why does it fail? Wasn't cart declared in the earlier docstring?

How it works...

The doctest module looks for every docstring. For each docstring it finds, it creates a shallow copy of the module's global variables and then runs the code and checks results. Apart from that, every variable created is locally scoped and then cleaned up when the test is complete. This means that our second docstring that was added later cannot see the cart that was created in our first docstring. That is why the second run failed.

There is no equivalent to a setUp method as we used with some of the unittest recipes. If there is no setUp option with doctest, then what value is this recipe? It highlights a key limitation of doctest that all developers must understand before using it.

There's more...

The doctest module provides an incredibly convenient way to add testability to our documentation. But this is not a substitute for a full-fledged testing framework, like unittest. As noted earlier, there is no equivalent to a setUp. There is also no syntax checking of the Python code embedded in the docstrings.

Mixing the right level of doctests with unittest (or other testing framework we pick) is a matter of judgment.

Filtering out test noise

Various options help doctest ignore noise, such as whitespace, in test cases. This can be useful, because it allows us to structure the expected outcome in a better way, to ease reading for the users.

We can also flag some tests that can be skipped. This can be used where we want to document known issues, but haven't yet patched the system.

Both of these situations can easily be construed as noise, when we are trying to run comprehensive testing, but are focused on other parts of the system. In this recipe, we will dig in to ease the strict checking done by doctest. We will also look at how to ignore entire tests, whether it's on a temporary or permanent basis.

How to do it...

With the following steps, we will experiment with filtering out test results and easing certain restrictions of doctest.

Create a new file called recipe20.py to contain the code from this recipe.

Create a recursive function that converts base10 numbers into other bases.
def convert_to_basen(value, base):
import math
def _convert(remaining_value, base, exp):
def stringify(value):
if value > 9:
return chr(value + ord('a')-10)
else:
return str(value)

if remaining_value >= 0 and exp >= 0:
factor = int(math.pow(base, exp))
if factor <= remaining_value:
multiple = remaining_value / factor
return stringify(multiple) +
_convert(remaining_value-multiple*factor,
base, exp-1)
else:
return "0" +
_convert(remaining_value, base, exp-1)
else:
return ""
return "%s/%s" % (_convert(value, base,
int(math.log(value, base))), base)

Add a docstring that includes a test to exercise a range of values as well as documenting a future feature that is not yet implemented.
def convert_to_basen(value, base):
"""Convert a base10 number to basen.
>>> [convert_to_basen(i, 16) for i in range(1,16)] #doctest:
+NORMALIZE_WHITESPACE
['1/16', '2/16', '3/16', '4/16', '5/16', '6/16', '7/16',
'8/16',
'9/16', 'a/16', 'b/16', 'c/16', 'd/16', 'e/16', 'f/16']
FUTURE: Binary may support 2's complement in the future, but
not now.
>>> convert_to_basen(-10, 2) #doctest: +SKIP
'0110/2'
"""
import math

Add a test runner.
if __name__ == "__main__":
import doctest
doctest.testmod()

Run the test case in verbose mode.

Copy the code from recipe20.py into a new file called recipe20b.py.

Edit recipe20b.py by updating the docstring to include another test exposing that our function doesn't convert 0.
def convert_to_basen(value, base):
"""Convert a base10 number to basen.
>>> [convert_to_basen(i, 16) for i in range(1,16)] #doctest:
+NORMALIZE_WHITESPACE
['1/16', '2/16', '3/16', '4/16', '5/16', '6/16', '7/16',
'8/16',
'9/16', 'a/16', 'b/16', 'c/16', 'd/16', 'e/16', 'f/16']
FUTURE: Binary may support 2's complement in the future, but
not now.
>>> convert_to_basen(-10, 2) #doctest: +SKIP
'0110/2'
BUG: Discovered that this algorithm doesn't handle 0. Need to
patch it.
TODO: Renable this when patched.
>>> convert_to_basen(0, 2)
'0/2'
"""
import math

Run the test case. Notice what is different about this version of the recipe; and why does it fail?

Copy the code from recipe20b.py into a new file called recipe20c.py.

Edit recipe20c.py and update the docstring indicating that we will skip the test for now.
def convert_to_basen(value, base):
"""Convert a base10 number to basen.
>>> [convert_to_basen(i, 16) for i in range(1,16)] #doctest:
+NORMALIZE_WHITESPACE
['1/16', '2/16', '3/16', '4/16', '5/16', '6/16', '7/16',
'8/16',
'9/16', 'a/16', 'b/16', 'c/16', 'd/16', 'e/16', 'f/16']
FUTURE: Binary may support 2's complement in the future, but
not now.
>>> convert_to_basen(-10, 2) #doctest: +SKIP
'0110/2'
BUG: Discovered that this algorithm doesn't handle 0. Need to
patch it.
TODO: Renable this when patched.
>>> convert_to_basen(0, 2) #doctest: +SKIP
'0/2'
"""
import math

Run the test case.

How it works...

In this recipe, we revisit the function for converting from base-10 to any base numbers. The first test shows it being run over a range. Normally, Python would fit this array of results on one line. To make it more readable, we spread the output across two lines. We also put some arbitrary spaces between the values to make the columns line up better.

This is something that doctest definitely would not support, due to its strict pattern matching nature. By using #doctest: +NORMALIZE_WHITESPACE, we are able to ask doctest to ease this restriction. There are still constraints. For example, the first value in the expected array cannot have any whitespace in front of it. But wrapping the array to the next line no longer breaks the test.

We also have a test case that is really meant as documentation only. It indicates a future requirement that shows how our function would handle negative binary values. By adding #doctest: +SKIP, we are able to command doctest to skip this particular instance.

Finally, we see the scenario where we discover that our code doesn't handle 0. As the algorithm gets the highest exponent by taking a logarithm, there is a math problem. We capture this edge case with a test. We then confirm that the code fails in classic test driven design (TDD) fashion. The final step would be to fix the code to handle this edge case. But we decide, in a somewhat contrived fashion, that we don't have enough time in the current sprint to fix the code. To avoid breaking our continuous integration (CI) server, we mark the test with a TO-DO statement and add #doctest: +SKIP.

There's more...

Both the situations that we have marked up with #doctest: +SKIP, are cases where eventually we will want to remove the SKIP tag and have them run. There may be other situations where we will never remove SKIP. Demonstrations of code that have big fluctuations may not be readily testable without making them unreadable. For example, functions that return dictionaries are harder to test, because the order of results varies. We can bend it to pass a test, but we may lose the value of documentation to make it presentable to the reader.