Lexica

From UNL Wiki
Revision as of 17:52, 17 September 2012 by Martins (Talk | contribs)
Jump to: navigation, search

The UNL framework contains three different types of lexical databases: UNL-only, NL-only and UNL-NL.

UNL

The UNL lexical databases are the following:

  • The UNL Dictionary, or simply UNLdic, is a flat list of UW's and their semantic features
  • The UNL Knowledge Base, or simply UNLkb, is a network with systematic relations between UW's
  • The UNL Example Base, or simply UNLeb, is a network with any relations between UW's

These three databases are nested. The UNLdic contains UW's and their basic semantic features (such as the information that the UW corresponding to "table" is a nominal concrete concept that belongs to the class of artifacts). The UNLkb contains the UNLdic and the set of relations that are necessary to define a UW (such as the information that "table" is a piece of furniture with vertical legs and a flat horizontal surface). The UNLeb contains the UNLkb and the set of relations that are often found between UW's (such as the information that tables are normally round or square, that are made of hard materials, etc.). In general, the difference between the UNLkb and the UNLeb is that the former is dictionary-based (i.e., it tries to represent the information that is normally ascribed in dictionaries) whereas the latter is corpus-based (i.e., it tries to represent the concept as it appears in the corpus).

NL

  • The NL Dictionary, or simply NLdic, is a list of natural language entries with the corresponding morphological and syntactic features.

UNL-NL

  • The UNL-NL Dictionary, or simply UNL-NLdic, is list of systematic lexical mappings between UW's and natural language entries
  • The UNLization Memory, or UNL Memory Base, or simply UNLMB, is a list of mappings between UNL and a given natural language
Software