Description
While the frontiers in natural language processing (NLP) research are rapidly expanding, it is still a great challenge to develop customized NLP solutions in the real world. Properly annotated data is scarce, making it impossible to train or even finetune data-hungry algorithms. In particular for named entity recognition (NER), accurate annotations are crucial but very expensive to gather. Even more so in the 3 Swiss national languages.
Active learning methods can mitigate the problem by facilitating the annotation task and reducing the work effort required by domain experts.
Within the scope of this internship, the student will leverage an active learning approach to develop an efficient annotation tool and integrate it into one of our existing NLP products. In addition, the application of NER to data anonymization will be studied and developed. This application is becoming rapidly pervasive with all data analytics services that require a certain level of privacy and security. Existing libraries have very good performance when it comes to NER for English documents, however for other languages, the performance drops, often drastically. Data anonymization systems need to rely on highly performant extraction models, with minimal leakage and having such a system
Objectives
The goal of the internship is to:
INTERNSHIPin Lausanne. Join our team as intern and you will find a young, dynamic and culturally diverse working environment.
About ELCA
With 50+ year of history and over 1300 specialists, we offer a unique spectrum of experience, skills and technical innovations.
Read moreCorporateStarting your career with us!
You want to leverage your educational background, apply your infinite curiosity and your out-of-the-box thinking.
Read moreJob opportunityAll Job opportunities
Your initiative is a chance ! We're constantly looking for talented individuals. Check our latest job opportunities !
OK, accept all
These cookies provide us with insight into traffic sources and allow us to better understand our visitors anonymously.
(Google Analytics and CrazyEgg)
NewDisableAllowSocial media cookies allow content sharing on your preferred networks.
(ShareThis)
NewDisableAllowThese cookies are used to track visitors across websites.
The intention is to enable us to offer more relevant, targeted content to existing contacts (ClickDimensions) and display ads that are relevant and engaging for users (Facebook Pixels).
NewDisableAllowFor more information about these cookies and our cookie policy, click here