Machine Translation at South Tyrolean Institutions (pilot study)

  • Deutsch
  • English
  • Italiano
  • Project duration: -
  • Project status: ongoing

In this project, we aim to conduct a preliminary study on machine translation (MT) at South Tyrolean institutions. Local institutions generally do not employ customized machine translation software, but they have high translation demands, to some extent repetitive and structured texts and specific needs related to the local variety of German. Customized MT could therefore represent an improvement into their translation process, especially because free online MT systems already belong to their tool-box.

Against this background, our first aim is to build aligned parallel corpora with local institutional documents to feed an MT engine at a later stage. To do so, we need a great amount of parallel segments, ideally several hundred thousand up to millions. In order to build the corpora, we seek already aligned texts (i.e. translation memories) from the institutions or collect and align them afterwards. In both cases, we contact the institutions and pursue a collaboration.

Secondly, we aim to evaluate the performance of one customized MT software when translating local institutional texts. To do so, we will pursue both an automatic and a manual quality assessment. State-of-the-art automatic scores, such as BLEU, chrF and TER give a preliminary overview on the quality of the machine translation. However, a fine-grained in-depth manual analysis of the mistakes sheds much more light on the weaknesses of the selected system. For this reason, we plan to annotate a corpus of machine translated decrees from the Autonomous Province of Bolzano as for their accuracy and fluency mistakes, e.g. mistranslations, omissions, punctuation and gender mistakes.

At the end of this experimental study, we supposedly will have gained the necessary preliminary knowledge to define a proper research project on machine translation for South Tyrolean institutions, which we aim to submit to international funding programs (e.g. Interreg).

Diseño de un curso de formación para traductores no profesionales en un marco institucional: la administración de Tirol del Sur
De Camillis F (2023)

Conference: 8º Congreso Internacional sobre Traducción e Interpretación en los Servicios Públicos (TISP8/PSIT8) | Alcalá de Henares, Madrid | 22.3.2023 - 24.3.2023

MT for NPT: ethical dilemmas in an institutional setting
De Camillis F (2023)

Conference: 6th International Conference on Non-Professional Interpreting and Translation | Nicosia | 25.5.2023 - 27.5.2023

Entre ámbito y variedad: peculiaridades de un corpus de decretos traducidos automáticamente

De Camillis F, Chiocchetti E (2023)

Conference: 14th International Conference on Corpus Linguistics (CILC2023) | Oviedo | 10.5.2023 - 12.5.2023

Fondamenti di traduzione: trucchi e consigli pratici / Tipps & Tricks beim Übersetzen
Chiocchetti E, Ralli N (2023)

Conference: Fondamenti di traduzione| trucchi e consigli pratici / Tipps & Tricks beim Übersetzen | Bolzano : 2.2.2023 - 9.2.2023

How we address research questions and survive as researchers
Chiocchetti E, De Camillis F (2022)

Conference: Profession-Based Research course, University of Bolgna/Forlì | Forlì | 11.4.2022 - 11.4.2022

Adapting machine translation for under-resourced languages: a first attempt for institutional German in South Tyrol
De Camillis F, Contarino AG (2021)

Conference: PaCor 2021 | Vitoria-Gasteiz | 23.6.2021 - 25.6.2021

Anpassungsbedarf aus der Sicht einer etwas besonderen Nutzergruppe
Chiocchetti E (2019)

Conference: Workshop NMT-Diagnose | Hildesheim | 12.12.2019 - 12.12.2019

Project Team
1 - 5


1 - 9


Coherence in academic Italian

Duration: - Funding: Provincial P.-L.P. 14. Research ...

view all

Institute's Projects