• Home
  • Browse
    • Current Issue
    • By Issue
    • By Author
    • By Subject
    • Author Index
    • Keyword Index
  • Journal Info
    • About Journal
    • Aims and Scope
    • Editorial Board
    • Publication Ethics
    • Peer Review Process
  • Guide for Authors
  • Submit Manuscript
  • Contact Us
 
  • Login
  • Register
Home Articles List Article Information
  • Save Records
  • |
  • Printable Version
  • |
  • Recommend
  • |
  • How to cite Export to
    RIS EndNote BibTeX APA MLA Harvard Vancouver
  • |
  • Share Share
    CiteULike Mendeley Facebook Google LinkedIn Twitter
CDELT Occasional Papers in the Development of English Education
arrow Articles in Press
arrow Current Issue
Journal Archive
Volume Volume 89 (2025)
Volume Volume 88 (2024)
Volume Volume 87 (2024)
Volume Volume 86 (2024)
Volume Volume 85 (2024)
Volume Volume 84 (2023)
Volume Volume 83 (2023)
Volume Volume 82 (2023)
Volume Volume 81 (2023)
Volume Volume 80 (2022)
Volume Volume 79 (2022)
Volume Volume 78 (2022)
Volume Volume 77 (2022)
Volume Volume 76 (2021)
Volume Volume 75 (2021)
Issue Issue 1
Volume Volume 74 (2021)
Volume Volume 73 (2021)
Volume Volume 72 (2020)
Volume Volume 71 (2020)
Volume Volume 70 (2020)
Volume Volume 69 (2020)
Volume Volume 68 (2019)
Volume Volume 67 (2019)
Volume Volume 66 (2019)
Volume Volume 65 (2018)
Volume Volume 64 (2018)
Volume Volume 63 (2017)
Volume Volume 62 (2016)
Volume Volume 61 (2016)
Volume Volume 60 (2015)
Volume Volume 59 (2015)
Diab, N. (2021). Out of the BLEU: An Error Analysis of Statistical and Neural Machine Translation of WikiHow Articles from English into Arabic. CDELT Occasional Papers in the Development of English Education, 75(1), 181-211. doi: 10.21608/opde.2021.208437
Nessma Diab. "Out of the BLEU: An Error Analysis of Statistical and Neural Machine Translation of WikiHow Articles from English into Arabic". CDELT Occasional Papers in the Development of English Education, 75, 1, 2021, 181-211. doi: 10.21608/opde.2021.208437
Diab, N. (2021). 'Out of the BLEU: An Error Analysis of Statistical and Neural Machine Translation of WikiHow Articles from English into Arabic', CDELT Occasional Papers in the Development of English Education, 75(1), pp. 181-211. doi: 10.21608/opde.2021.208437
Diab, N. Out of the BLEU: An Error Analysis of Statistical and Neural Machine Translation of WikiHow Articles from English into Arabic. CDELT Occasional Papers in the Development of English Education, 2021; 75(1): 181-211. doi: 10.21608/opde.2021.208437

Out of the BLEU: An Error Analysis of Statistical and Neural Machine Translation of WikiHow Articles from English into Arabic

Article 8, Volume 75, Issue 1, July 2021, Page 181-211  XML PDF (1.32 MB)
Document Type: Original Article
DOI: 10.21608/opde.2021.208437
View on SCiNiTO View on SCiNiTO
Author
Nessma Diab
Abstract
Most studies that compare the quality of Neural Machine Translation (NMT) to that of Statistical Machine Translation (SMT) rely on automatic evaluation methods, mainly the bilingual evaluation understudy (BLEU), without performing any kind of human assessment. While BLEU is a good indicator of the overall performance of MT systems, it does not offer any detailed linguistic insights into the types of errors generated by those MT models. Such insights are crucial for researchers to identify areas for improvement and for language service providers to understand how upgrading to NMT gives them better results. This paper breaks free from BLEU by conducting an error analysis that compares the performance of Google SMT and NMT engines for English-into-Arabic translation. The corpus consists of six WikiHow articles. The analysis is guided by the DQF-MQM Harmonized Error Typology which classifies translation errors into eight major categories, namely, accuracy, fluency, terminology, style, design, locale convention, verity and other (for any other issues). A fine-grained classification of translation errors as such enables the researcher to explore the error types generated by each MT model, the error types eliminated by NMT, and the new error types introduced by NMT. The paper focuses on the English-Arabic language pair because it is one of the least studied pairs in the comparative literature of SMT and NMT. The results show that NMT generates less grammatical errors and mistranslations than SMT. NMT output is more fluent and robust. However, SMT is more consistent with translating proper nouns and out-of-vocabulary words.
Keywords
DQF-MQM harmonized error typology; neural machine translation; statistical machine translation; translation quality assessment
Statistics
Article View: 649
PDF Download: 1,096
Home | Glossary | News | Aims and Scope | Sitemap
Top Top

Journal Management System. Designed by NotionWave.