There's more...
Evaluating the quality of generated text works similarly to evaluating labels. The Bilingual Evaluation Understudy (BLEU) score is a popular metric for comparing a generated translation of a piece of text to a reference translation and varies from 0 to 1. The closer the generated text is to the original text, the higher the score, with 1 being the score of a perfect match. Through the BLEU score metric, the n-grams of the candidate text are compared with the n-grams of the reference translation, along with the number of matches; these matches are also position-independent. Also, the n-grams matching is modified in such a way that it does not reward any such translation that generates only a few reasonable words. This technique ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access