UNL System

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(UNLization)
(UNLization)
Line 14: Line 14:
 
*The [[NL-UNL Transformation Grammar]]
 
*The [[NL-UNL Transformation Grammar]]
 
*The [[NL-UNL Disambiguation Grammar]]
 
*The [[NL-UNL Disambiguation Grammar]]
 +
*A [[NLizer]]
  
 
== NLization ==  
 
== NLization ==  

Revision as of 13:43, 24 September 2012

The UNL System is the set of basic modules of UNL and the rules to combine them.

Contents

UNLization

Main article: UNLization

UNLization.png

UNLization is the process of representing the content of a natural language structure into UNL. It may involve the following modules:

NLization

Main article: NLization

NLization.png

Modules

  • Lexical Databases
    • The UNL-NL Dictionary and the NL-UNL Dictionary contains mappings between UWs and natural language entries. It is a bilingual lexicon where UW's are translated into "lexical realisation units" of a given natural language, along with the minimal set of features necessary to differentiate between homonyms. As all open-class natural language entries are associated to UWs, the UNL-NL Dictionary also works as the pivot table that allows the alignment of all NL dictionaries.
    • The UNL Ontology, the UNL Knowledge Base and the UNL Memory and the NL-UNL Dictionary contains semantic relations between UW's along with a degree of necessity, i.e., the possibility of occurrence. These repositories include ontological relations ("a kind of" and "an instance of"), in case of the UNL Ontology; necessary ontological and thematic relations (such as "is the agent of", "is the place where", "is the moment when"), in case of the UNL Knowledge Base; and typical ontological and thematic relations, in case of the UNL Memory.
  • Grammars
    • The UNL-NL D-Grammar and the UNL-NL T-Grammar are the set of rules used for natural language generation (out of UNL). The NL-UNL D-Grammar and the NL-UNL T-Grammar are the set of rules used for natural language analysis (to UNL). The T-Grammar is the repository of the transformation rules, which are used to manipulate data structures; the D-Grammar is the repository of disambiguation rules, which are used to control the use of transformation rules.

Specs

Tagset
The Tagset is a harmonized system for representing linguistic constants.
Dictionary Specs
The Dictionary Specs state the syntax of dictionaries.
Grammar Specs
The Grammar Specs state the syntax of grammar rules.
KB Specs
The UNL Knowledge Base Specs state the syntax of knowledge bases.
Memory Specs
The Memory Specs state the syntax of example bases.
Software