Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2024
Historique:
received: 30 04 2024
accepted: 08 09 2024
medline: 23 10 2024
pubmed: 23 10 2024
entrez: 23 10 2024
Statut: epublish

Résumé

This article introduces the Saudi Learner Translation Corpus (SauLTC), an innovative multi-version English-Arabic parallel corpus featuring part-of-speech tagging. We describe the corpus parameters and compilation process and explain how textual processing and sentence alignment are conducted. The participants include 366 student translators, 48 instructors, and 23 alignment verifiers. The corpus provides access to two target versions of every ST to allow the detection of the changes in the translation and revision processes from the initial to the final draft. The translations were collected over three years, yielding 5,160,386 tokens. The metadata of 23 sentence alignment verifiers were added to the analysis as a unique variable to investigate individual differences in the manual verification process. This unidirectional corpus can be used to identify student translators' strategies and errors in translation and analyze the efficacy of instructors' feedback. Furthermore, it is accessible via an application and a website. It provides translation teachers and researchers with a database that can help develop corpus-based and corpus-driven teaching materials.

Identifiants

pubmed: 39441857
doi: 10.1371/journal.pone.0303729
pii: PONE-D-24-05899
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0303729

Informations de copyright

Copyright: © 2024 Al-Harthi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Auteurs

Maha Al-Harthi (M)

Department of Applied Linguistics, College of Languages, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Amal Alsaif (A)

College of Computer and Information Sciences, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia.

Eman Al-Nafjan (E)

Department of English, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.

Fatma Alshihri (F)

Department of Translation, College of Languages, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Mahmoud Saleh (M)

Department of Translation, College of Languages and Translation, King Saud University, Riyadh, Saudi Arabia.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH