aac_metrics.functional.cider_d module

class CIDErDScores

Bases: dict

cider_d: Tensor
cider_d(candidates: list[str], mult_references: list[list[str]], return_all_scores: bool = True, *, n: int = 4, sigma: float = 6.0, tokenizer: ~typing.Callable[[str], list[str]] = <method 'split' of 'str' objects>, return_tfidf: bool = False, scale: float = 10.0) tuple[CIDErDScores, CIDErDScores] | Tensor[source]

Consensus-based Image Description Evaluation function.

Warning

This metric requires at least 2 candidates with 2 sets of references, otherwise it will raises a ValueError.

Parameters:
  • candidates – The list of sentences to evaluate.

  • mult_references – The list of list of sentences used as target.

  • return_all_scores – If True, returns a tuple containing the globals and locals scores. Otherwise returns a scalar tensor containing the main global score. defaults to True.

  • n – Maximal number of n-grams taken into account. defaults to 4.

  • sigma – Standard deviation parameter used for gaussian penalty. defaults to 6.0.

  • tokenizer – The fast tokenizer used to split sentences into words. defaults to str.split.

  • return_tfidf – If True, returns the list of dictionaries containing the tf-idf scores of n-grams in the sents_score output. defaults to False.

  • scale – CIDEr-D score factor. defaults to 10.0.

Returns:

A tuple of globals and locals scores or a scalar tensor with the main global score.