quality estimation

Optimizing Large Language Models for Low-resource Quality Estimation

Large Language Models (LLMs) are positioned as generalist models often claiming superlative performance on many Natural Language Processing (NLP) tasks. However, they tend to fail at Quality Estimation (QE) of Machine Translation (MT), particularly …

Findings of the WMT25 Shared Task on Automated Translation Evaluation Systems: Linguistic Diversity is Challenging and References Still Help

The WMT25 Shared Task on Automated Translation Evaluation Systems evaluates metrics and quality estimation systems that assess the quality of language translation systems. This task unifies and consolidates the separate WMT shared tasks on Machine …

ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Large Language Models (LLMs) have shown remarkable performance across a wide range of natural language processing tasks. Quality Estimation (QE) for Machine Translation (MT), which assesses the quality of a source-MT pair without relying on reference …

Reference-Less Evaluation of Machine Translation: Navigating Through the Resource-Scarce Scenarios

Reference-less evaluation of machine translation, or Quality Estimation (QE), is vital for low-resource language pairs where high-quality references are often unavailable. In this study, we investigate segment-level QE methods comparing encoder-based …

ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Prompt-based Explainable Quality Estimation for English-Malayalam

The aim of this project was to curate data for the English-Malayalam language pair for the tasks of Quality Estimation (QE) and Automatic Post-Editing (APE) of Machine Translation. Whilst the primary aim of the project was to create a dataset for a …

Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems

Evaluating machine translation (MT) of user-generated content (UGC) involves unique challenges such as checking whether the nuance of emotions from the source are preserved in the target text. Recent studies have proposed emotion-related datasets, …

Giving the Old a Fresh Spin: Quality Estimation-Assisted Constrained Decoding for Automatic Post-Editing

Automatic Post-Editing (APE) systems often struggle with over-correction, where unnecessary modifications are made to a translation, diverging from the principle of minimal editing. In this paper, we propose a novel technique to mitigate …

When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages

This paper investigates the reference-less evaluation of machine translation for low-resource language pairs, known as quality estimation (QE). Segment-level QE is a challenging cross-lingual language understanding task that provides a quality score …

Optimizing Large Language Models for Low-resource Quality Estimation

Findings of the WMT25 Shared Task on Automated Translation Evaluation Systems: Linguistic Diversity is Challenging and References Still Help

ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Reference-Less Evaluation of Machine Translation: Navigating Through the Resource-Scarce Scenarios

ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models

Prompt-based Explainable Quality Estimation for English-Malayalam

Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems

Giving the Old a Fresh Spin: Quality Estimation-Assisted Constrained Decoding for Automatic Post-Editing

When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages

Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?

Findings of the Quality Estimation Shared Task at WMT 2024: Are LLMs Closing the Gap in QE?

Optimizing Quality Estimation for Low-Resource Language Translations: Exploring the Role of Language Relatedness