Musa Alkhalifa, Manuel Bertran, William J. Black, Sabri Elkateb, Javier Farreres, David Farwell, Christiane Fellbaum, James Kirk, Ma Antònia Martí, Adam Pease, Horacio Rodríguez, Piek Vossen
The Arabic WordNet (AWN) is a lexical database of the Arabic language following the development process of Princeton English WordNet and Euro WordNet. It utilizes the Suggested Upper Merged Ontology as an interlingua to link Arabic WordNet to previously developed wordnets. Christiane Fellbaum at Princeton was the project lead. The project was sponsored by DOI/REFLEX.
From http://www.globalwordnet.org/AWN/DataSpec.html you can get the XML data exchange specifications of the database.
AWN contains about 11,000 synsets (including 1,000 NE).
There are several different ways for accessing the database:
- The browser package (available at http://sourceforge.net/projects/awnbrowser/) includes the AWN data and Princeton WN2.0 mappings in a relational database. You can use the export facilities to export the data as XML or CSV to taylor them to your needs
- The database can also be downloaded in XML format (linked to Princeton WN 2.0) from http://nlp.lsi.upc.edu/awn/get_bd.php
- A set of basic python functions for accessing the database can be obtained from: http://nlp.lsi.upc.edu/awn/AWNDatabaseManagement.py.gz
- A Java API written by Abobakr Ahmed, available from sourceforge: http://sourceforge.net/projects/javasourcecodeapiarabicwordnet/
AWN Browser: Browsing the database
AWN can be downloaded in XML format and access its content be directly used at developers' will.
Java, Perl, MySQL
One of the most important lexical resources for Arabic language.
- Christiane Fellbaum, Musa Alkhalifa, William J. Black, Sabri Elkateb, Adam Pease, Horacio Rodríguez, Piek Vossen (2006). Introducing the Arabic WordNet project. Proceedings of the 3rd Global Wordnet Conference, Jeju Island, Korea, January, 2006. http://nlp.lsi.upc.edu/papers/fellbaum-alkhalifa-206.pdf
- Christiane Fellbaum, Musa Alkhalifa, William J. Black, Sabri Elkateb, Adam Pease, Horacio Rodríguez, Piek Vossen (2006). Building a WordNet for Arabic. Proceedings of the the 5th Conference on Language Resources and Evaluation LREC2006, May, 2006. http://nlp.lsi.upc.edu/papers/fellbaum06.pdf
- Horacio Rodríguez, David Farwell, Javi Farreres, Manuel Bertran, Musa Alkhalifa, M. Antonia Martí, William Black, Sabri Elkateb, James Kirk, Adam Pease, Piek Vossen, and Christiane Fellbaum. Arabic WordNet: Current State and Future Extensions in: Proceedings of the Fourth International GlobalWordNet Conference - GWC 2008, Szeged, Hungary, January 22-25, 2008. http://nlp.lsi.upc.edu/papers/rodriguez08.pdf
- Horacio Rodríguez, David Farwell, Javi Farreres, Manuel Bertran, Musa Alkhalifa, M. Antonia Martí (2008). Arabic WordNet: Semi-automatic Extensions using Bayesian Inference. Proceedings of the the 6th Conference on Language Resources and Evaluation LREC2008. Marrakech (Morocco), May 2008. http://nlp.lsi.upc.edu/papers/rodriguez08b.pdf