`gap_` Keywords¶

This post discusses the effect of each gap_ keyword and how they can be used.

`gap_` Keyword Listing¶

We list the possible gap_ keywords below.

gap_expect: The expected output of the test case.
gap_expect_stdout: The expected stdout of the test case.
gap_hidden: Whether the test case is hidden.
gap_name: The name of the test case.
gap_extra_points: The extra credit of the test case.
gap_override_check: The custom equality check function.
gap_easy_context: Whether to use context directly in gap override tests.
gap_override_test: The custom test function.
gap_pre_hooks: The custom functions run before tests.
gap_post_hooks: The custom functions run after tests.
gap_description: The description of the test case.
gap_is_pipeline: Whether the test case is a pipeline.
gap_max_score: The max score of the test case. This and gap_weight cannot be specified as the same time. 
gap_weight: The weight of the test case. This and gap_max_score cannot be specified as the same time. .

How To Specify Them In `@test_case()` And `@test_cases`¶

In @test_case(), gap_ keywords are specified as ordinary keyword arguments. For example, @test_case(1, 2, gap_expect=3, gap_name="secret test", gap_max_score=5).
In @test_cases(), gap_ keywords are also specified as keyword arguments but accept one single value or a Sequence of values. For example, @test_cases.params([1,2], [3,4], gap_max_score=2, gap_name=["test 1", "test 2"]). When a single value is passed, it will be duplicated to every test cases. When a sequence is passed, the length of the sequence has to be the same as the number of test cases, and will be applied to each test case in order.

`gap_expect`¶

This serves as sanity check to the correctness of the defined problem. When specified in a test case, it's value equals the expected outcome of executing the test case. For example,

# add_num.py
@test_case(2, 2, gap_expect=3)
@test_case(1, 2, gap_expect=3)
@problem()
def add(a: int, b: int) -> int:
    return a + b

Using the command line, we invoke gapper check add_num.py ad we will see

❯ gapper check add_num.py
Test (1, 2) passed: True
Test (2, 2) passed: False
  result: 4
  expected result: 3
  output: None
  expected output: None

`gap_expect_stdout`¶

It is similar to gap_expect except it expects output from the stdout.

`gap_hidden`¶

It accepts True or False and indicates if the test case can be seen by student. For example,

@test_cases.param_iter(
    ([random.randint(100, 10000)] for _ in range(3)),
    gap_max_score=2,
    gap_hidden=True,
    gap_name="random hidden tests",
)

will make the entries not shown to students at all (but still visible from instructor/TA's panel)

`gap_name`¶

A custom name of the autograder beside showing the argument passed to the test. For example,

@test_cases.param_iter(
    ([random.randint(100, 10000)] for _ in range(3)),
    gap_max_score=2,
    gap_hidden=True,
    gap_name="random hidden tests",
)
@test_cases.param_iter(
    ([random.randint(100, 10000)] for _ in range(3)),
    gap_max_score=0,
    gap_name="random tests",
)

will produce

`gap_description`¶

Description of the test that will be shown to the students. For example, the following code will produce the result below.

@test_case(
    1, gap_max_score=1, gap_description="this is the test in the assignment handout"
)
@test_case(1234, gap_max_score=1, gap_description=["this test is ", "slightly longer"])

`gap_max_score`¶

The max score this test is worth. This cannot coexist with gap_weight.

`gap_weight`¶

The weight of the max score this test is worth. This cannot coexist with gap_max_score. The calculation of the max score is

max_score_of_the_test = gap_weight * (totoal_score_of_the_assignment - sum(gap_max_score in all tests)) / sum(gap_weight in all tests)

If both gap_weight and gap_max_score are not set, the test case will be assigned with a default weight of 1.

For example, suppose we create 4 test cases worth total 4 points.

@test_case(1, gap_description="this is the test in the assignment handout")
@test_case(1234, gap_weight=2, gap_description=["this test is ", "slightly longer"])
@test_case(3731, gap_weight=4)
@test_case(7)

will produce the following max score assignment

because, for the test case of (1,) and case of (7,), the score is calculated as

1 * (4 - 0) / (1 + 2 + 4 + 1) = 4 / 8 = 0.5

and similarly, the score assigned for the case of (1234,) is from

2 * (4 - 0) / (1 + 2 + 4 + 1) = 8 / 8 = 1

`gap_extra_points`¶

Extra points specified in a test is a number and will be granted to student if they passed to test. For example,

@test_case(random.randint(100000, 1000000), gap_max_score=0, gap_extra_points=5)

will result in 5 extra points when the student passes the test, shown as following:

`gap_override_check`¶

You can override tests' equality checks by passing a comparator function to gap_override_check keyword. The function should raise an AssertionError if the two values are not equal.

For example, suppose that you want to compare answers from students' submissions with the solution but do not care about ordering, you can pass gap_override_check=set_equality to @test_case() where set_equality is pre-defined in your script as

from gapper.core.types import CustomEqualityTestData


def set_equality(data: CustomEqualityTestData) -> None:
    assert set(data.expected) == set(data.actual)

`gap_override_test`¶

You can override entire test by passing a custom function to gap_override_test parameter, similar to override equality checks. For example, you not only want to check the answers, but also ensure the function is recursive. You can define custom_test as the following and pass it as gap_override_test=custom_test in your @test_case(). Note that you have to run the test and equality check by yourself, for the entire test process is overridden.

import ast
import inspect
from gapper import problem, test_case
from gapper.core.types import CustomTestData

from pytest import approx


def check_recursive_ast(fn):
    tree = ast.parse(inspect.getsource(fn))
    for node in ast.walk(tree):
        if isinstance(node, ast.Call):
            if isinstance(node.func, ast.Name):
                if node.func.id == fn.__name__:
                    return True
    return False


def custom_test(data: CustomTestData) -> None:
    soln_ans = data.solution(*data.args, **data.kwargs)
    subm_ans = data.submission(*data.args, **data.kwargs)

    data.case.assertEqual(soln_ans, subm_ans)  # equivalent to `assert soln_ans == subm_ans`

    # param.assertTrue(check_recursive_ast(submission))
    # equivalent to `assert check_recursive_ast(submission)`

    # you can even use pytest helper functions
    # the following line is dumb but just for demonstration
    assert soln_ans == approx(subm_ans, rel=1e-3)

    if not check_recursive_ast(data.submission):
        data.result_proxy.set_score(data.result_proxy.max_score // 2)


@test_case(10, gap_override_test=custom_test)
@problem()
def fib(n: int) -> int:
    ...

A overriding function show have the following positional parameter signature

from gapper.core.types import CustomTestData

class CustomTestFn(Protocol):
    def __call__[T](self, data: CustomTestData[T]) -> None:
        ...

`gap_pre_hooks`¶

gap_post_hooks are function(s) run before running the the student's answer and the solution.

You can setup the environment or use it to modify the test case before. For example,

from gapper import problem, test_case
from gapper.core.types import PreHookData
from tempfile import NamedTemporaryFile


def preparation(data: PreHookData) -> None:
    lines = data.args[0]
    # put lines into a temporary file
    with NamedTemporaryFile("w", delete=False) as infile:
        infile.write("\n".join(lines))

    # set the args to the filename passed the student's function
    data.param.args = (infile.name,)


def gen_lines(num_of_lines: int) -> list[str]:
    return [str(i) for i in range(num_of_lines)]


@test_case(gen_lines(10), gap_pre_hooks=preparation)
@problem()
def print_lines(filename: str) -> int:
    with open(filename, "r") as f:
        return sum(map(lambda line: int(line.strip()), f.readlines()), 0)

Note that, the preparation function should have a tear down phase to clean up the temporary file created, since the code above sets the automatic deletion to False. The hooks support yield syntax, which nicely separate a hook into two phases, setup and tear down. For example, the function preparation can be rewritten as

from typing import Generator
from tempfile import NamedTemporaryFile
from gapper.core.types import PreHookData


def preparation(data: PreHookData) -> Generator[None, None, None]:
    lines = data.args[0]
    # put lines into a temporary file
    with NamedTemporaryFile("w") as infile:
        infile.write("\n".join(lines))
        data.param.args = (infile.name,)

        # everything above yield will be run before the test case is tested
        yield 
        # everything below yield will be run after all tests of the test case are done
        # this means the code exists the with statement after testing the test case
        # on which the temporary file can be safely deleted

A pre hook function has to follow the following positional parameter signature

from gapper.core.types import PreHookData

class PostHookFn(Protocol):
    def __call__[T](self, data: PreHookData[T]) -> None:
        ...

`gap_post_hooks`¶

gap_post_hooks are function(s) run after running the student's answer and the solution, and comparing the results and stdout of the two.

Consider the situation in which you'd like to provide extra checks but not override the whole test. You can write custom check functions and pass it into gap_post_checks. For example, you'd like to check if the students' solutions are recursive, you can write

from gapper import problem, test_case
from gapper.core.types import PostHookData

import ast 
import inspect


def check_recursive_ast(fn):
    tree = ast.parse(inspect.getsource(fn))
    for node in ast.walk(tree):
        if isinstance(node, ast.Call):
            if isinstance(node.func, ast.Name):
                if node.func.id == fn.__name__:
                    return True
    return False


def recursive_check(data: PostHookData) -> None:
    if not check_recursive_ast(data.submission):
        data.result_proxy.set_score(data.result_proxy.max_score // 2)
        data.result_proxy.set_pass_status("failed")
        data.result_proxy.add_description(
            "Failed because recursive call not found in submission."
        )


@test_case(10, gap_post_hooks=recursive_check)
@problem()
def fib(n: int) -> int:
    ...

A post hook function has to follow the following positional parameter signature

from gapper.core.types import PostHookData

class PostHookFn(Protocol):
    def __call__[T](self, data: PostHookData[T]) -> None:
        ...

Note that post test hooks also support the yield syntax as stated in the gap_pre_hooks section.

`gap_pipeline`¶

The gap_pipeline keyword is invented to simulated a sequence of actions acting on an object. The object going into the pipeline might remain the same, be modified, or be swapped. It comes handy when testing classes and their instances' behaviors. For example, given a Car class,

class Car:
    def __init__(self, x: int, y: int, tank_size: float) -> None:
        self.x = x
        self.y = y
        self.tank_size = tank_size
        self.fuel = tank_size

    def drive_to(self, x: int, y: int) -> bool:
        dis = abs(x - self.x) + abs(y - self.y)
        if dis > self.fuel:
            return False
        else:
            self.fuel -= dis
            self.x = x
            self.y = y
            return True

    def refill(self) -> None:
        self.fuel = self.tank_size

    def get_fuel(self) -> float:
        return self.fuel

Using pipeline, we can (1) test creating instances with different parameters, (2) running some functions of the instances and check if their outputs match, (3) and checking if attributes and states match (but we recommend requiring student to create uniform interfaces (functions) and not check the properties directly).

from gapper import problem, test_case
from gapper.core.pipeline_support import Constructor, Function, Property

init = Constructor()
drive_to = Function("drive_to")
refill = Function("refill")
get_fuel = Function("get_fuel")
x = Property("x")
y = Property("y")
tank_size = Property("tank_size")


@test_case.pipeline(   # using `@test_case.pipeline` is equivalent to `@test_case(gap_pipeline=True)`
    init(0, 0, 100),
    tank_size,         # we recommend creating a uniform interface such as `get_tank_size()`
    x,                 # and not to check the attributes directly 
    y,
    drive_to(10, 10),
    get_fuel(),
    drive_to(100, 0),
    get_fuel(),
    refill(),
    drive_to(100, 0),
    get_fuel(),
)
@problem()
class Car:
    ...

Example Script¶

from gapper import problem, test_case, test_cases
import random


@test_case(random.randint(100000, 1000000), gap_max_score=0, gap_extra_points=5)
@test_cases.param_iter(
    ([random.randint(100, 10000)] for _ in range(3)),
    gap_max_score=2,
    gap_hidden=True,
    gap_name="random hidden tests",
)
@test_cases.param_iter(
    ([random.randint(100, 10000)] for _ in range(3)),
    gap_max_score=0,
    gap_name="random tests",
)
@test_case(1, gap_description="this is the test in the assignment handout")
@test_case(1234, gap_weight=2, gap_description=["this test is ", "slightly longer"])
@test_case(3731, gap_weight=4)
@test_case(7)
@problem(check_stdout=True)
def print_digits(n: int) -> None:
    print(n % 10)
    if n >= 10:
        print_digits(n // 10)

gap_ Keywords¶

gap_ Keyword Listing¶

How To Specify Them In @test_case() And @test_cases¶

gap_expect¶

gap_expect_stdout¶

gap_hidden¶

gap_name¶

gap_description¶

gap_max_score¶

gap_weight¶

gap_extra_points¶

gap_override_check¶

gap_override_test¶

gap_pre_hooks¶

gap_post_hooks¶

gap_pipeline¶