Internship in the area of Computational Linguistics/Computer Science ("extracting neologisms from text corpora")
Home |
Our institute |
Job openings |
The Institute for Specialised Communication and Multilingualism offers an
Internship in the areas of computational linguistics and computer science.
The internship involves extracting neologisms from text corpora. For this task, tools for the (semi-)automatic extraction of 'new words' (neologism candidates) from up-to-date and constantly updated web corpora and other resources will have to be designed and/or adapted. The contexts where the candidates appear should be recorded, adding metadata such as the source and the date of extraction. The possible neologism candidates will also have to be filtered against existing lexicon entries, other neologism lists, existing lists of Austrian, Swiss, and South Tyrolean variants, proper name lists, spelling errors and typos, etc. to create a reduced list of more probable candidates.
For evaluating the candidates, a graphical user interface will have to be created, dynamically updating to reflect new and revised candidates.
Starting date: October / November 2008 Duration: 3-6 months; full or part time
This position can provide credit as an obligatory internship, as required by certain universities. There is the possibility of ERASMUS funding for this internship with the support of the intern's university, which Italian citizens enrolled in a foreign university may also be able to make use of.
Please send your application (including your CV and a cover letter outlining your interest in the internship) to the following address:
European Academy Bolzano Institute for Specialised Communication and Multilingualism Viale Druso 1 - 39100 Bolzano communication.multilingualism@eurac.edu
Please add the following phrase to your CV; otherwise your application cannot be considered: 'I consent to the use of my personal data in accordance with the provisions of decree 196/2003.'
|