Korpus Südtirol (Corpus South Tyrol)
Home |
Research departments |
Applied Linguistics |
Institute for Specialised Communication and Multilingualism |
Projects |
The project Korpus Südtirol aims at the collection, filing, and corpus linguistic processing of South Tyrolean texts. It deals with the German written in South Tyrol, e.g. in newspapers or in fiction; for the time being, it does not include transcriptions of local dialects, Italian/Ladin texts published in South Tyrol or spoken language.
The project started in September 2005 in collaboration with the Faculty of Education of the Free University of Bozen - Bolzano (project leader: Prof. Dr. Johann Drumbl) and the Institut für Germanistik at the University of Innsbruck and is funded by Interreg-III-A.
In the current stage of the project we elaborate a feasibility study regarding the development of a comprehensive text corpus of South Tyrolean German. Texts in various formats of diverse origin are exemplarily converted into a standardised XML format. Data which do not yet exist electronically are digitised. We enrich the texts with relevant metadata and annotate them structurally and linguistically on the word level. A user-friendly internet application allows for querying and investigating the collected text data.
On http://www.korpus-suedtirol.it you can find more detailled information on the project and the access to the query interface.
last update
15.9.2008
|