Multilingual Lexicons

Multilingual Central Repository (MRC)

Authors:
 TALP, and the other members of the MEANING project.
 
 
Description:
 The Multilingual Central Repository (MCR) follows the model proposed by the EuroWordNet project. EuroWordNet (Vossen, 1998) is a multilingual lexical database with wordnets for several European languages, which are structured as the Princeton WordNet. The Princeton WordNet contains information about nouns, verbs, adjectives and adverbs in English and is organized around the notion of a synset.

 

 The current version of the MCR (Atserias et al., 2004) is a result of the 5th Framework MEANING project. The MCR integrates into the same EuroWordNet framework wordnets from five different languages (together with four English WordNet versions). The MCR also integrates WordNet Domains (Magnini and Cavaglià, 2000) and new versions of the Base Concepts and Top Concept Ontology. The final version of the MCR contains 1,642,389 semantic relations between synsets, most of them acquired by automatic means. This is almost one order of magnitude larger than the Princeton WordNet (204,074 unique semantic relations in WordNet 2.0).



Functionality:
 To add Semantic Knowledge to any Resource.



Technology:
 Java, Php, Perl are used for the Demos.
 MySQL is used by the Database containing MCR.



Technical Requirements:
 Java, Perl, Apache and MySqL are required for the use of MCR.



Modules:



Innovation:
 Currently, MCR integrates into the EuroWordNet framework five local wordnets (including four versions of the English WordNet from Princeton), the EuroWordNet Top Concept ontology, MultiWordNet Domains, and hundreds of thousand of new semantic relations and properties automatically acquired from corpora. MCR constitutes the largest and richest multilingual resource for lexical knowledge ever build.



Development:
 MCR (Multilingual Central Repository) was the result of the 5th Framework European MEANING Project (2002-2005), and has been extendend thanks to domestic projects KNOW (2006-2009) and KNOW2 (2010-2012).



Publications:
 There are a lot of related publications, including Deliverables and Reports that can be found here: http://nlp.lsi.upc.edu/projectes/meaning/documentation/3rdYear/



Remarkable publications are:

 

  • Cuadros M. and Rigau G. KnowNet: Building a Large Net of Knowledge from the Web. 22nd International Conference on Computational Linguistics COLING'08. Manchester, UK. 2008.

 

  • Álvez J., Atserias J., Carrera J., Climent S., Laparra E., Oliver A. and Rigau G. Complete and Consistent Annotation of WordNet using the Top Concept Ontology. 6th international conference on Language Resources and Evaluation, LREC'08, Marrakesh, Morroco. 2008.

 

  • Cuadros M. and Rigau G. Quality Assessment of Large-Scale Knowledge Resources. Proceedings of Joint SIGDAT Conference on Empirical Methods in Natural Language Processing (EMNLP'06). Sydney, Australia. 2006.

 

  • Atserias J., Villarejo L., Rigau G., Agirre E., Carroll J., Magnini B., Vossen P. The MEANING Multilingual Central Repository. In Proceedings of the Second International Global WordNet Conference (GWC-2004). ISBN 80-210-3302-9. Brno, Czech Republic. Enero, 2004.


Contact: This email address is being protected from spambots. You need JavaScript enabled to view it.

Additional information