Exams

bocoel.core.exams

The exams module provides the functionality to create and manage exams. Here, an exam is used to measure how well the corpus or the model performs on a given task.

The module provides the following functionality:

Examinators are responsible for launch exams.
Exams are the tests that take in an accumulated history of model / corpus and returns a score.
Managers are responsible for managing results across runs.

bocoel.Examinator

Examinator(exams: Mapping[str, Exam])

The examinator is responsible for launching exams. Examinators take in an index and results of an optimizer run, and return a DataFrame of scores for the accumulated history performance of the optimizer.

Source code in src/bocoel/core/exams/examinators.py

def __init__(self, exams: Mapping[str, Exam]) -> None:
    self.exams = exams

examine

examine(index: Index, results: OrderedDict[int, float]) -> DataFrame

Perform the exams on the results. This method looks up results in the index and runs the exams on the results.

Parameters:

Name	Type	Description	Default
`index`	`Index`	The index of the results.	required
`results`	`OrderedDict[int, float]`	The results.	required

Returns:

Type	Description
`DataFrame`	The scores of the exams.

TODO

Run the different exams in parallel. Currently the exams are run sequentially and can be slow.

Source code in src/bocoel/core/exams/examinators.py

def examine(self, index: Index, results: OrderedDict[int, float]) -> DataFrame:
    """
    Perform the exams on the results.
    This method looks up results in the index and runs the exams on the results.

    Parameters:
        index: The index of the results.
        results: The results.

    Returns:
        The scores of the exams.

    TODO:
        Run the different exams in parallel.
        Currently the exams are run sequentially and can be slow.
    """

    scores = {k: v.run(index, results) for k, v in self.exams.items()}
    original = {
        exams.STEP_IDX: list(range(len(results))),
        exams.ORIGINAL: list(results.values()),
    }
    return DataFrame.from_dict({**original, **scores})

presets `classmethod`

presets() -> Self

Returns:

Type	Description
`Self`	The default examinator.

Source code in src/bocoel/core/exams/examinators.py

@classmethod
def presets(cls) -> Self:
    """
    Returns:
        The default examinator.
    """

    return cls(
        {
            exams.ACC_MIN: Accumulation(AccType.MIN),
            exams.ACC_MAX: Accumulation(AccType.MAX),
            exams.ACC_AVG: Accumulation(AccType.AVG),
        }
    )

bocoel.Exam

Bases: Protocol

Exams are designed to evaluate the performance of a particular index, using a particular set of results generated by the optimizer.

run

run(index: Index, results: OrderedDict[int, float]) -> NDArray

Run the exam on the given index and results.

Parameters:

Name	Type	Description	Default
`index`	`Index`	The index to evaluate.	required
`results`	`OrderedDict[int, float]`	The results generated by the optimizer.	required

Returns:

Type	Description
`NDArray`	The scores for each entry in the index. The length must be the same as the results.

Source code in src/bocoel/core/exams/interfaces.py

def run(self, index: Index, results: OrderedDict[int, float]) -> NDArray:
    """
    Run the exam on the given index and results.

    Parameters:
        index: The index to evaluate.
        results: The results generated by the optimizer.

    Returns:
        The scores for each entry in the index. The length must be the same as the results.
    """

    outcome = self._run(index=index, results=results)

    if len(outcome) != len(results):
        raise ValueError(
            f"Length of outcome ({len(outcome)}) must be the same as "
            f"the length of results ({len(results)})"
        )

    return outcome

_run `abstractmethod`

_run(index: Index, results: OrderedDict[int, float]) -> NDArray

Run the exam on the given index and results.

Parameters:

Name	Type	Description	Default
`index`	`Index`	The index to evaluate.	required
`results`	`OrderedDict[int, float]`	The results generated by the optimizer.	required

Returns:

Type	Description
`NDArray`	The scores for each entry in the index. The length must be the same as the results.

Source code in src/bocoel/core/exams/interfaces.py

@abc.abstractmethod
def _run(self, index: Index, results: OrderedDict[int, float]) -> NDArray:
    """
    Run the exam on the given index and results.

    Parameters:
        index: The index to evaluate.
        results: The results generated by the optimizer.

    Returns:
        The scores for each entry in the index. The length must be the same as the results.
    """

    ...

bocoel.AccType

Bases: StrEnum

Accumulation type.

MIN `class-attribute` `instance-attribute`

MIN = 'MINIMUM'

Minimum value accumulation.

MAX `class-attribute` `instance-attribute`

MAX = 'MAXIMUM'

Maximum value accumulation.

AVG `class-attribute` `instance-attribute`

AVG = 'AVERAGE'

Average value accumulation.

bocoel.Accumulation

Accumulation(typ: AccType)

Bases: Exam

Accumulation is an exam designed to evaluate the min / max / avg of the history.

Source code in src/bocoel/core/exams/stats/acc.py

def __init__(self, typ: AccType) -> None:
    self._acc_func: Callable[[NDArray], NDArray]
    match typ:
        case AccType.MIN:
            self._acc_func = np.minimum.accumulate
        case AccType.MAX:
            self._acc_func = np.maximum.accumulate
        case AccType.AVG:
            self._acc_func = lambda arr: np.cumsum(arr) / np.arange(1, arr.size + 1)
        case _:
            raise ValueError(f"Unknown accumulation type {typ}")

run

run(index: Index, results: OrderedDict[int, float]) -> NDArray

Run the exam on the given index and results.

Parameters:

Name	Type	Description	Default
`index`	`Index`	The index to evaluate.	required
`results`	`OrderedDict[int, float]`	The results generated by the optimizer.	required

Returns:

Type	Description
`NDArray`	The scores for each entry in the index. The length must be the same as the results.

Source code in src/bocoel/core/exams/interfaces.py

def run(self, index: Index, results: OrderedDict[int, float]) -> NDArray:
    """
    Run the exam on the given index and results.

    Parameters:
        index: The index to evaluate.
        results: The results generated by the optimizer.

    Returns:
        The scores for each entry in the index. The length must be the same as the results.
    """

    outcome = self._run(index=index, results=results)

    if len(outcome) != len(results):
        raise ValueError(
            f"Length of outcome ({len(outcome)}) must be the same as "
            f"the length of results ({len(results)})"
        )

    return outcome

_acc `staticmethod`

_acc(array: NDArray, accumulate: Callable[[NDArray], NDArray]) -> NDArray

Accumulate the array using the given function.

Parameters:

Name	Type	Description	Default
`array`	`NDArray`	The array to accumulate.	required
`accumulate`	`Callable[[NDArray], NDArray]`	The accumulation function to use.	required

Returns:

Type	Description
`NDArray`	The accumulated array.

Raises:

Type	Description
`ValueError`	If the array is not 1D.

Source code in src/bocoel/core/exams/stats/acc.py

@staticmethod
def _acc(array: NDArray, accumulate: Callable[[NDArray], NDArray]) -> NDArray:
    """
    Accumulate the array using the given function.

    Parameters:
        array: The array to accumulate.
        accumulate: The accumulation function to use.

    Returns:
        The accumulated array.

    Raises:
        ValueError: If the array is not 1D.
    """

    _check_dim(array, 1)
    result = accumulate(array)
    _check_dim(result, 1)
    return result

bocoel.Manager

Manager(root: str | Path | None = None, skip_rerun: bool = True)

The manager for running and saving evaluations.

Parameters:

Name	Type	Description	Default
`root`	`str \| Path \| None`	The path to save the scores to.	`None`
`skip_rerun`	`bool`	Whether to skip rerunning the optimizer if the scores already exist.	`True`

Raises:

Type	Description
`ValueError`	If the path is not a directory.

Source code in src/bocoel/core/exams/managers.py

def __init__(self, root: str | Path | None = None, skip_rerun: bool = True) -> None:
    """
    Parameters:
        root: The path to save the scores to.
        skip_rerun: Whether to skip rerunning the optimizer if the scores already exist.

    Raises:
        ValueError: If the path is not a directory.
    """

    if root is not None:
        root = Path(root)
        if root.exists() and not root.is_dir():
            raise ValueError(f"{root} is not a directory")
        root.mkdir(parents=True, exist_ok=True)

        # Prevent data from being tracked by git.
        gitigore = root / ".gitignore"
        if not gitigore.exists():
            with open(gitigore, "w+") as f:
                f.write("# Automatically generated by BoCoEL.\n*")

    self._start = self.current()
    self._examinator = Examinator.presets()

    # Public attributes. Can be overwritten at any time.
    self.root = root
    self.skip_rerun = skip_rerun

_examinator `instance-attribute`

_examinator: Examinator = presets()

The examinator that would perform evaluations on the results.

run

run(
    steps: int | None = None,
    *,
    optimizer: Optimizer,
    embedder: Embedder,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor
) -> DataFrame

Runs the optimizer until the end. If the root path is set in the constructor, the scores are saved to the path.

Parameters:

Name	Type	Description	Default
`optimizer`	`Optimizer`	The optimizer to run.	required
`embedder`	`Embedder`	The embedder to run the optimizer with.	required
`corpus`	`Corpus`	The corpus to run the optimizer on.	required
`model`	`GenerativeModel \| ClassifierModel`	The model to run the optimizer with.	required
`adaptor`	`Adaptor`	The adaptor to run the optimizer with.	required
`steps`	`int \| None`	The number of steps to run the optimizer for.	`None`

Returns:

Type	Description
`DataFrame`	The final state of the optimizer. Keys are the indices of the queries, and values are the corresponding scores.

Source code in src/bocoel/core/exams/managers.py

def run(
    self,
    steps: int | None = None,
    *,
    optimizer: Optimizer,
    embedder: Embedder,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor,
) -> DataFrame:
    """
    Runs the optimizer until the end.
    If the root path is set in the constructor,
    the scores are saved to the path.

    Parameters:
        optimizer: The optimizer to run.
        embedder: The embedder to run the optimizer with.
        corpus: The corpus to run the optimizer on.
        model: The model to run the optimizer with.
        adaptor: The adaptor to run the optimizer with.
        steps: The number of steps to run the optimizer for.

    Returns:
        The final state of the optimizer.
            Keys are the indices of the queries,
            and values are the corresponding scores.
    """

    md5 = self.md5(
        optimizer=optimizer,
        embedder=embedder,
        corpus=corpus,
        model=model,
        adaptor=adaptor,
    )

    if self.skip_rerun and self.root is not None and (self.root / md5).exists():
        LOGGER.warning("Previous scores found. Skip", md5=md5)
        return self.load(self.root / md5)

    # Run the optimizer and collect the results.
    LOGGER.info("Running the optimizer", steps=steps)
    results: OrderedDict[int, float] = OrderedDict()
    for res in self._launch(optimizer=optimizer, steps=steps):
        results.update(res)

    # Examine the results.
    LOGGER.info("Examing the results")
    scores = self._examinator.examine(index=corpus.index, results=results)

    self.save(
        scores=scores,
        optimizer=optimizer,
        corpus=corpus,
        model=model,
        adaptor=adaptor,
        embedder=embedder,
        md5=md5,
    )

    return scores

save

save(
    *,
    scores: DataFrame,
    optimizer: Optimizer,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor,
    embedder: Embedder,
    md5: str
) -> None

Saves the scores to the path. If the root path is not set in the constructor, the scores are not saved.

Parameters:

Name	Type	Description	Default
`scores`	`DataFrame`	The scores to save.	required
`optimizer`	`Optimizer`	The optimizer used to generate the scores.	required
`corpus`	`Corpus`	The corpus used to generate the scores.	required
`model`	`GenerativeModel \| ClassifierModel`	The model used to generate the scores.	required
`adaptor`	`Adaptor`	The adaptor used to generate the scores.	required
`embedder`	`Embedder`	The embedder used to generate the scores.	required
`md5`	`str`	The md5 hash of the identifier columns.	required

Raises:

Type	Description
`ValueError`	If the path is not set.

Source code in src/bocoel/core/exams/managers.py

def save(
    self,
    *,
    scores: DataFrame,
    optimizer: Optimizer,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor,
    embedder: Embedder,
    md5: str,
) -> None:
    """
    Saves the scores to the path.
    If the root path is not set in the constructor, the scores are not saved.

    Parameters:
        scores: The scores to save.
        optimizer: The optimizer used to generate the scores.
        corpus: The corpus used to generate the scores.
        model: The model used to generate the scores.
        adaptor: The adaptor used to generate the scores.
        embedder: The embedder used to generate the scores.
        md5: The md5 hash of the identifier columns.

    Raises:
        ValueError: If the path is not set.
    """

    if self.root is None:
        LOGGER.warning("No path set to save the scores. Skip")
        return

    scores = self.with_cols(
        scores,
        {
            columns.OPTIMIZER: optimizer,
            columns.MODEL: model,
            columns.ADAPTOR: adaptor,
            columns.INDEX: corpus.index,
            columns.STORAGE: corpus.storage,
            columns.EMBEDDER: embedder,
            columns.TIME: self._start,
            columns.MD5: md5,
        },
    )

    (self.root / md5).mkdir(exist_ok=True)
    scores.to_csv(self.root / md5 / f"{self._start}.csv", index=False)

with_cols

with_cols(df: DataFrame, columns: dict[str, Any]) -> DataFrame

Adds identifier columns to the DataFrame.

Parameters:

Name	Type	Description	Default
`df`	`DataFrame`	The DataFrame to add the columns to.	required
`mappings`		The columns to add to the DataFrame.	required

Returns:

Type	Description
`DataFrame`	The md5 hash of the identifier columns and the DataFrame with the columns added.

Source code in src/bocoel/core/exams/managers.py

def with_cols(self, df: DataFrame, columns: dict[str, Any]) -> DataFrame:
    """
    Adds identifier columns to the DataFrame.

    Parameters:
        df: The DataFrame to add the columns to.
        mappings: The columns to add to the DataFrame.

    Returns:
        The md5 hash of the identifier columns and the DataFrame with the columns added.
    """

    df = df.copy()

    for key, value in columns.items():
        df[key] = [str(value)] * len(df)

    return df

_launch `staticmethod`

_launch(
    optimizer: Optimizer, steps: int | None = None
) -> Generator[Mapping[int, float], None, None]

Launches the optimizer as a generator.

Source code in src/bocoel/core/exams/managers.py

@staticmethod
def _launch(
    optimizer: Optimizer, steps: int | None = None
) -> Generator[Mapping[int, float], None, None]:
    "Launches the optimizer as a generator."

    steps_range = range(steps) if steps is not None else itertools.count()

    for _ in ap.alive_it(steps_range, title="Running the optimizer"):
        # Raises StopIteration (converted to RuntimError per PEP 479) if done.
        try:
            results = optimizer.step()
        except StopIteration:
            break

        yield results

load `staticmethod`

load(path: str | Path) -> DataFrame

Loads the scores from the path.

Parameters:

Name	Type	Description	Default
`path`	`str \| Path`	The path to load the scores from.	required

Returns:

Type	Description
`DataFrame`	The loaded scores.

Raises:

Type	Description
`ValueError`	If the path does not exist or is not a directory.
`ValueError`	If no csv files are found in the path.

Source code in src/bocoel/core/exams/managers.py

@staticmethod
def load(path: str | Path) -> DataFrame:
    """
    Loads the scores from the path.

    Parameters:
        path: The path to load the scores from.

    Returns:
        The loaded scores.

    Raises:
        ValueError: If the path does not exist or is not a directory.
        ValueError: If no csv files are found in the path.
    """

    # Iterate over all csv files in the path.
    dfs = [pd.read_csv(csv) for csv in Path(path).rglob(f"*.csv")]

    if not dfs:
        raise ValueError(f"No csv files found in {path}")

    return pd.concat(dfs)

md5 `staticmethod`

md5(
    *,
    optimizer: Optimizer,
    embedder: Embedder,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor
) -> str

Generates an md5 hash from the given data.

Parameters:

Name	Type	Description	Default
`optimizer`	`Optimizer`	The optimizer used to generate the scores.	required
`corpus`	`Corpus`	The corpus used to generate the scores.	required
`model`	`GenerativeModel \| ClassifierModel`	The model used to generate the scores.	required
`adaptor`	`Adaptor`	The adaptor used to generate the scores.	required
`embedder`	`Embedder`	The embedder used to generate the scores.	required
`time`		The time the scores were generated.	required

Returns:

Type	Description
`str`	The md5 hash of the given data.

Source code in src/bocoel/core/exams/managers.py

@staticmethod
def md5(
    *,
    optimizer: Optimizer,
    embedder: Embedder,
    corpus: Corpus,
    model: GenerativeModel | ClassifierModel,
    adaptor: Adaptor,
) -> str:
    """
    Generates an md5 hash from the given data.

    Parameters:
        optimizer: The optimizer used to generate the scores.
        corpus: The corpus used to generate the scores.
        model: The model used to generate the scores.
        adaptor: The adaptor used to generate the scores.
        embedder: The embedder used to generate the scores.
        time: The time the scores were generated.

    Returns:
        The md5 hash of the given data.
    """

    data = [optimizer, embedder, corpus.index, corpus.storage, model, adaptor]

    return hashlib.md5(
        str.encode(" ".join([str(item) for item in data]))
    ).hexdigest()

bocoel.core.exams.columns

This module contains the columns names used in the manager dataframes, which correspond to the different components and exams of the system.

components

TIME `module-attribute`

TIME = 'time'

Corresponds to the time at which the evaluation was performed.

INDEX `module-attribute`

INDEX = 'index'

Corresponds to the index.

STORAGE `module-attribute`

STORAGE = 'storage'

Corresponds to the storage.

EMBEDDER `module-attribute`

EMBEDDER = 'embedder'

Corresponds to the embedder.

OPTIMIZER `module-attribute`

OPTIMIZER = 'optimizer'

Corresponds to the optimizer.

MODEL `module-attribute`

MODEL = 'model'

Corresponds to the model.

ADAPTOR `module-attribute`

ADAPTOR = 'adaptor'

Corresponds to the adaptor.

MD5 `module-attribute`

MD5 = 'md5'

Corresponds to the MD5 hash of the evaluation. This is a hash of most movable components.

exams

ORIGINAL `module-attribute`

ORIGINAL = 'original'

Corresponds to the original evaluation. The raw values.

STEP_IDX `module-attribute`

STEP_IDX = 'step_idx'

Corresponds to the step index.

ACC_MIN `module-attribute`

ACC_MIN = 'acc_min'

Corresponds to the minimum accuracy.

ACC_MAX `module-attribute`

ACC_MAX = 'acc_max'

Corresponds to the maximum accuracy.

ACC_AVG `module-attribute`

ACC_AVG = 'acc_avg'

Corresponds to the average accuracy.

MST_MAX_EDGE_QUERY `module-attribute`

MST_MAX_EDGE_QUERY = 'mst_max_edge_query'

Corresponds to the query for the maximum edge of the minimum spanning tree.

MST_MAX_EDGE_DATA `module-attribute`

MST_MAX_EDGE_DATA = 'mst_max_edge_data'

Corresponds to the data for the maximum edge of the minimum spanning tree.

SEGREGATION `module-attribute`

SEGREGATION = 'segregation'

Corresponds to the number of unique clusters.

Exams

bocoel.core.exams

bocoel.Examinator

examine

presets classmethod

bocoel.Exam

run

_run abstractmethod

bocoel.AccType

MIN class-attribute instance-attribute

MAX class-attribute instance-attribute

AVG class-attribute instance-attribute

bocoel.Accumulation

run

_acc staticmethod

bocoel.Manager

_examinator instance-attribute

run

save

with_cols

_launch staticmethod

load staticmethod

md5 staticmethod

bocoel.core.exams.columns

components

TIME module-attribute

INDEX module-attribute

STORAGE module-attribute

EMBEDDER module-attribute

OPTIMIZER module-attribute

MODEL module-attribute

ADAPTOR module-attribute

MD5 module-attribute

exams

ORIGINAL module-attribute

STEP_IDX module-attribute

ACC_MIN module-attribute

ACC_MAX module-attribute

ACC_AVG module-attribute

MST_MAX_EDGE_QUERY module-attribute

MST_MAX_EDGE_DATA module-attribute

SEGREGATION module-attribute

presets `classmethod`

_run `abstractmethod`

MIN `class-attribute` `instance-attribute`

MAX `class-attribute` `instance-attribute`

AVG `class-attribute` `instance-attribute`

_acc `staticmethod`

_examinator `instance-attribute`

_launch `staticmethod`

load `staticmethod`

md5 `staticmethod`

TIME `module-attribute`

INDEX `module-attribute`

STORAGE `module-attribute`

EMBEDDER `module-attribute`

OPTIMIZER `module-attribute`

MODEL `module-attribute`

ADAPTOR `module-attribute`

MD5 `module-attribute`

ORIGINAL `module-attribute`

STEP_IDX `module-attribute`

ACC_MIN `module-attribute`

ACC_MAX `module-attribute`

ACC_AVG `module-attribute`

MST_MAX_EDGE_QUERY `module-attribute`

MST_MAX_EDGE_DATA `module-attribute`

SEGREGATION `module-attribute`