Testing Python code

pytest

Pytest is the most popular testing library for Python. It is not included with the Python standard library so it must be installed with pip. While it does not include a declaration library, it is robust enough to handle most scenarios having a rich and expressive set of constructs and decorators that let you declare what your tests should do, under what conditions they should run, and how they should interact with the rest of your system.

Using pytest

  • Pytest will automatically detect test files so long as they are named appropriately. E.g. for a module called lorem, it will detzect the unit test files lorem_test.py and test_lorem.py.
  • In order to detect tests it should be run from a directory level above them

Examples

Here is a basic example of using pytest for a local module callled palindrome:

import palindrome

def test_is_palindrome():
    assert palindrome.is_palindrome('soros')
    assert palindrome.is_palindrome('torot')
    assert not palindrome.is_palindrome('chair')

Mocking

patch() and Mock enable us to mock objects whilst testing (classes, functions, methods and properties belonging). They are used in combination.

The @patch decorator temporarily replaces a specified object in your code with a mock object and restores the original object after the test is complete

A Mock object simulates the object it replaces so that the object behaves as expected during testing. For example, if your code calls some_function.some_method(), and some_method is mocked, calling some_method will not execute real logic but will interact with the Mock object instead. Mock objects record details about how they have been used, like what methods have been called, with what arguments, etc., allowing you to make assertions about how they have been used.

@patch and Mock work together because a patch is used to replace an object or attribute with a Mock object. Mock handles the simulated functionality, and @patch designates the real value we are replacing with the mock.

Example case

I will use the following example from one my projects:

# get_articles.py
def get_articles(article_type: str) -> Optional[Dict[str, Any]]:
    """Retrieve articles from Pocket API"""

    if POCKET_LAMBDA_ENDPOINT is None:
        logging.error("Error: POCKET_LAMBDA_ENDPOINT envinronment variable is not set")
        return None
    else:
        # Interpolate the article_type into the Pocket request URL
        endpoint = POCKET_LAMBDA_ENDPOINT.format(article_type=article_type)

    try:
        response = requests.get(endpoint)
        response.raise_for_status()
        return response.json()

    except RequestException as e:
        print(f"An error occurred: {e}")
        return None

This function: sources a URL from an environment variable, interpolates a query string into the URL (which comes in as a parameter), makes a request to the URL, and returns the response as JSON.

It has some safeguards in place:

  • It checks that the environment variable is set
  • It checks that the request was successful

In the example we could use a Mock object to simulate the response from the Pocket API. This would allow us to test the function without having to make a real request to the API:

def test_successful_request():
    # Replace the requests.get function with a Mock object (mock_get)
    with patch("requests.get") as mock_get:
        # Specify the return value of the mock_get object)
        mock_get.return_value = Mock(ok=true)
        mock_get.return_value.json.return_value = {"value": "test"}

        # Call the function under test
        result = get_articles("gaby")

    # Assert expected outcomes
    mock_get.assert_called_once_with(endpoint)
    assert result == mock_json_response

The example above follows the Arrange, Act, Assert pattern:

StageAction
ArrangeReplace the requests.get function with patch and set properties with Mock
ActCall the function under test
AssertAssert that the function under test behaved as expected

Alternative mock syntax

The with patch(...) as mock_name syntax is fine for small-scale mocking but can become cumbersome when you are mocking several dependencies.

There is another syntax (which I actually find clearer). Say we have a function with three dependencies: update_worksheet, process_articles, get_articles. We could mock like so:

@patch("app.update_worksheet")
@patch("app.process_articles")
@patch("app.get_articles")
def test_success(
    mock_get_articles, mock_process_articles, mock_update_worksheet
):
    mock_get_articles.return_value = [1, 2, 3]

Here the patching is done by the decorator and the mocks are defined as parameters to the test function (always in reverse order)

Mock assertion lexicon

return_value

State what the mock should return

my_mocked_function.return_value = ['one', 'two', 'three']

call_count

Test how many times a dependent function is called

assert my_mocked_function.call_count = 3

assert_any_call()

Test that a given mock is called at least once during the execution of the function under test

my_mocked_function.assert_any_call(some_mocked_return_value)

When the output of one function is used as a parameter to another, and we don’t particularly care about the details of what is concerned we can just pass the executed function, e.g:

my_mocked_function.assert_any_call(preceding_function())

call_args_list

Get a list of all the arguments that a mock object was called with during the test.

call_args_list is useful when you want to check the arguments that a mock object was called with during the test, especially if the mock object was called multiple times with different arguments. You can use it to inspect the arguments of each call and make assertions based on them.

second_my_mocked_function_call = my_mocked_function.call_args_list[1]

# check the first argument of the second call:

assert second_my_mocked_function_call[0][0] == "expected arg"

side_effect

Use to trigger a side effect when returning a value from a mock. Most useful for mocking exceptions.

my_mocked_function.side_effect = Exception("Some exception raised")

Testing exceptions with raises

Testing exceptions is quite straightforward. You can use the raises helper provided by pytest, and combine this with excinfo (“exception info”) to inspect the exception message.

if POCKET_LAMBDA_ENDPOINT is None:
    raise ValueError(
        "Error: POCKET_LAMBDA_ENDPOINT environment variable is not set"
    )

Then to test this, we would use pytest’s excinfo fixture along with raises:

 with pytest.raises(ValueError) as excinfo:  # Watch for the ValueError
    get_articles("some_type")

    assert "Error: POCKET_LAMBDA_ENDPOINT environment variable is not set" in str(
        excinfo.value
    )

We could actually simplify the above test by using the match parameter with raise. This way we do not need the separate assertion:

with pytest.raises(ValueError, match="Error: POCKET_LAMBDA_ENDPOINT environment variable is not set"):
    get_articles("some_type")

Note that excinfo is best used for testing the exception text that you the developer explicitly raise. For exceptions tha may occur naturaly in the code you are testing, you should use caplog or capsys (see below).

Before-each and after-each

When testing functions, we achieve this in Python using setup_function and teardown_function methods. These methods are called before and after each test method respectively.

To apply a “before each” to every test just put your setup function and/or teardown function at the top level of your test module.

For example, below we set and remove an env var before and after each test:

@pytest.fixture(scope="function") # specify that this fixture should be run before each function test
def setup_function():
    print("Setting up test environment...")
    os.environ["POCKET_LAMBDA_ENDPOINT"] = "https://some_endpoint.com/{article_type}"


def teardown_function():
    print("Tearing down test environment...")
    del os.environ["POCKET_LAMBDA_ENDPOINT"]

If the setup/teardown should only be applied to a subset of tests, just pass the name of the fixture as a parameter to the test function:

def some_function(setup_function):
    # setup_function will be run before this test

You don’t need to use the names setup_function and teardown_function so long as you are passing the fixture as a parameter.

You can also use yield to combine the setup and teardown into a single function:

@pytest.fixture(scope="function")
def setup_function():
    os.environ["POCKET_LAMBDA_ENDPOINT"] = "https://some_endpoint.com/{article_type}"
    yield
    del os.environ["POCKET_LAMBDA_ENDPOINT"]

Another example:

The following test suite uses the same three mocked functions in every test. The following set-up assigns the mocks before each test and resets after each individual test has run:

@pytest.fixture(scope="function")
def setup_function():
    with patch("app.get_articles") as mock_get_articles, patch(
        "app.process_articles"
    ) as mock_process_articles, patch("app.update_worksheet") as mock_update_worksheet:
        yield mock_get_articles, mock_process_articles, mock_update_worksheet

Then to use:

def individual_test(setup_function):
    mock_get_articles, mock_process_articles, mock_update_worksheet = setup_function
    # Now each mock can be referenced using the vars above

Parameterized tests

For a sequence of tests that are repetitive, to avoid repeating the same code over and over again, we can use parameterized tests. This is where we pass in a list of parameters to the test function and the test function is run once for each parameter.

For example, in the function below I am handling numerous possible Exceptions that could be raised by the requests.get method:

    try:
        response = requests.get(endpoint)
        response.raise_for_status()
        return response.json()

    except HTTPError as http_err:
        logging.error(f"HTTP Error occurred: {http_err}")

    except ConnectionError as conn_err:
        logging.error(f"Connection Error occurred: {conn_err}")

    except Timeout as timeout_err:
        logging.error(f"Timeout Error occurred: {timeout_err}")

    except RequestException as e:
        logging.error(f"Request Exception occurred: {e}")

    return None

Instead of writing something like the following for each of the four exceptions:


 def test_exception_generic(caplog):
    with patch("requests.get", side_effect=RequestException("Some error")):
        result = get_articles("some_type")

    assert "Request Exception occurred" in caplog.text
    assert result is None

I could parameterize like so:

@pytest.mark.parametrize(
    "exception_type, log_message",
    [
        (RequestException, "Request Exception occurred: "),
        (HTTPError, "HTTP Error occurred: "),
        (Timeout, "Timeout Error occurred: "),
        (ConnectionError, "Connection Error occurred: "),
    ],
)
def test_exceptions(caplog, exception_type, log_message):
    with patch("requests.get", side_effect=exception_type("Some error")):
        result = get_articles("some_type")

    assert log_message in caplog.text
    assert result is None

Caplog, syslog, excinfo

caplog and capsys are built-in pytest fixtures. caplog lets you test log messages. capsys lets you test stdout and stderr. As such they are very useful when testing that error messages are logged correctly.

caplog

In our example, if the endpoing environment is not set, we log an error message. We can test that this message is logged correctly using caplog:

def test_no_endpoint_env_var(caplog):
    os.environ.pop("POCKET_LAMBDA_ENDPOINT", None)  # Remove env variable if it exists

    with caplog.at_level(logging.ERROR):
        result = get_articles("some_type")

    assert (
        "Error: POCKET_LAMBDA_ENDPOINT environment variable is not set" in caplog.text
    )
    assert result is None

Note tha we pass in caplog as a parameter to the test function. This is how pytest knows to use it as a fixture.

capsys

In our example, if the request is unsuccessful, we log an error message with print rather than logging. We can test that this message is printed correctly using capsys to check the stdout:

def test_http_error(capsys):
    with patch("requests.get") as mock_get:
        mock_get.return_value = Mock(ok=False, status_code=404)

        # Raise an HTTP error when raise_for_status is called
        mock_get.return_value.raise_for_status.side_effect = RequestException()

        result = get_articles("some_type")

    captured = capsys.readouterr()
    assert "An error occurred" in captured.out

excinfo