aac_metrics.functional.evaluate module¶

Evaluate candidates with multiple references with the DCASE2023 Audio Captioning metrics.

Parameters:¶

candidates: list[str]¶: The list of sentences to evaluate.
mult_references: list[list[str]]¶: The list of list of sentences used as target.
preprocess: bool | Callable[[list[str]], list[str]] = True¶: If True, the candidates and references will be passed as input to the PTB stanford tokenizer before computing metrics. defaults to True.
cache_path: str | Path | None = None¶: The path to the external code directory. defaults to the value returned by get_default_cache_path().
java_path: str | Path | None = None¶: The path to the java executable. defaults to the value returned by get_default_java_path().
tmp_path: str | Path | None = None¶: Temporary directory path. defaults to the value returned by get_default_tmp_path().
device: str | device | None = 'cuda_if_available'¶: The PyTorch device used to run FENSE and SPIDErFL models. If None, it will try to detect use cuda if available. defaults to “cuda_if_available”.
verbose: int = 0¶: The verbose level. defaults to 0.

Returns:¶

A tuple contains the corpus and sentences scores.

Evaluate candidates with multiple references with the DCASE2024 Audio Captioning metrics.

Parameters:¶

candidates: list[str]¶: The list of sentences to evaluate.
mult_references: list[list[str]]¶: The list of list of sentences used as target.
preprocess: bool | Callable[[list[str]], list[str]] = True¶: If True, the candidates and references will be passed as input to the PTB stanford tokenizer before computing metrics. defaults to True.
cache_path: str | Path | None = None¶: The path to the external code directory. defaults to the value returned by get_default_cache_path().
java_path: str | Path | None = None¶: The path to the java executable. defaults to the value returned by get_default_java_path().
tmp_path: str | Path | None = None¶: Temporary directory path. defaults to the value returned by get_default_tmp_path().
device: str | device | None = 'cuda_if_available'¶: The PyTorch device used to run FENSE and SPIDErFL models. If None, it will try to detect use cuda if available. defaults to “cuda_if_available”.
verbose: int = 0¶: The verbose level. defaults to 0.

Returns:¶

A tuple contains the corpus and sentences scores.

Evaluate candidates with multiple references with custom metrics.

Parameters:¶

candidates: list[str]¶: The list of sentences to evaluate.
mult_references: list[list[str]]¶: The list of list of sentences used as target.
preprocess: bool | Callable[[list[str]], list[str]] = True¶: If True, the candidates and references will be passed as input to the PTB stanford tokenizer before computing metrics. defaults to True.
metrics: str | Iterable[str] | Iterable[Callable[[list, list], tuple]] = 'default'¶: The name of the metric list or the explicit list of metrics to compute. defaults to “default”.
cache_path: str | Path | None = None¶: The path to the external code directory. defaults to the value returned by get_default_cache_path().
java_path: str | Path | None = None¶: The path to the java executable. defaults to the value returned by get_default_java_path().
tmp_path: str | Path | None = None¶: Temporary directory path. defaults to the value returned by get_default_tmp_path().
device: str | device | None = 'cuda_if_available'¶: The PyTorch device used to run FENSE and SPIDErFL models. If None, it will try to detect use cuda if available. defaults to “cuda_if_available”.
verbose: int = 0¶: The verbose level. defaults to 0.

Returns:¶

A tuple contains the corpus and sentences scores.

get_argnames( fn: Callable, ) → list[str][source]¶: Get arguments names of a method, function or callable object.