user generated content

A Multi-task Learning Framework for Evaluating Machine Translation of Emotion-loaded User-generated Content

Machine translation (MT) of user-generated content (UGC) poses unique challenges, including handling slang, emotion, and literary devices like irony and sarcasm. Evaluating the quality of these translations is challenging as current metrics do not …

Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?

This paper investigates whether large language models (LLMs) are state-of-the-art quality estimators for machine translation of user-generated content (UGC) that contains emotional expressions, without the use of reference translations. To achieve …

Evaluating Machine Translation for Emotion-loaded User Generated Content (TransEval4Emo-UGC)

This paper presents a dataset for evaluating the machine translation of emotion-loaded user generated content. It contains human-annotated quality evaluation data and post-edited reference translations. The dataset is available at our GitHub …