[PDF][PDF] Multilingual lexicon bootstrapping-improving a lexicon induction system using a parallel corpus

P Ziering, L van der Plas, H Schütze - Proceedings of the sixth …, 2013 - aclanthology.org
Proceedings of the sixth international joint conference on natural …, 2013aclanthology.org
We address the task of improving the quality of lexicon bootstrapping, ie, of expanding a
semantic lexicon on a given corpus. A main problem of iterative bootstrapping techniques is
the fact that lexicon quality degrades gradually as more and more false terms are added. We
propose to exploit linguistic variation between languages to reduce this problem of semantic
drift with a knowledge-lean and language-independent ensemble method. Our results on
English and German show that lexicon bootstrapping benefits significantly from the …
Abstract
We address the task of improving the quality of lexicon bootstrapping, ie, of expanding a semantic lexicon on a given corpus. A main problem of iterative bootstrapping techniques is the fact that lexicon quality degrades gradually as more and more false terms are added. We propose to exploit linguistic variation between languages to reduce this problem of semantic drift with a knowledge-lean and language-independent ensemble method. Our results on English and German show that lexicon bootstrapping benefits significantly from the multilingual symbiosis.
aclanthology.org
Showing the best result for this search. See all results