UWN is an automatically constructed multilingual lexical knowledge base based on WordNet.
The English language represents a constantly decreasing fraction of the Web. China and the EU each have greatly surpassed the U.S. in the number of Internet users, and other regions are expected to follow. Multilingual knowledge bases address this development by providing labels in multiple languages and making the semantic connections between words and names in different languages explicit.
For over 1,500,000 words in over 200 languages, UWN provides a corresponding list of meanings and shows how such meanings are semantically related. Additionally, the new MENTA extension adds a large-scale hierarchical taxonomy of named entities and their classes, drawing on over 200 different language editions of Wikipedia. This leads to a knowledge base with over 15 million words and names in different languages.
The meanings are embedded in a large taxonomy.
For instance, a word like "board" could refer to a wooden panel, to a committee, to a blackboard, as a verb to the process of getting on a vehicle (e.g. "to board a plane"), and so on. For each of these meanings one can obtain the corresponding words in different languages, e.g. the committee sense of "board" corresponds to "комитет" in Russian and "委員会" in Japanese. Additionally, meanings are connected to related meanings, e.g. the committee meaning is linked to its generalizations administrative unit, social group, etc., and for each of these meanings one can again obtain corresponding words in different languages.
We offer a User Interface that allows you to search and browse UWN.
Towards a Universal Wordnet by Learning from Combined Evidence
Gerard de Melo, Gerhard Weikum (2009)
In: Proc. 18th ACM Conference on Information and Knowledge Management (CIKM 2009), Hong Kong, China.