BLEU Performance
BLEU has frequently been reported as correlating well with human judgement,[1][2][3] and remains a benchmark for the assessment of any new evaluation metric. There are however a number of criticisms that have been voiced. It has been noted that although … Read More