Lingware

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
 
(28 intermediate revisions by 2 users not shown)
Line 1: Line 1:
The [[UNL System]] comprises three different types of language resources: '''lexical databases''', '''grammars''' and '''corpora'''.
+
The [[UNL System]] comprises three different types of language resources: '''lexica''', '''grammars''' and '''corpora'''. These resources have been provided mainly through the [[UNLarium]].
  
; Lexical Databases
+
== [[Lexica]] ==
: In the UNL framework, there are two major types of lexical databases: monolingual (in UNL only) and multilingual (in UNL and other natural languages).
+
''Main article: [[Lexica]]''
::*Monolingual resources:
+
*[[UNL Dictionary]], or UNL<sup>dic</sup>: a flat list of UW's and their corresponding semantic features
::**The [[UNL Dictionary]], which is a flat list of UWs
+
*[[UNL Ontology]]: a hierarchy of UW's
::**The [[UNL Ontology]], which is a hierarchy of UWs
+
*[[UNL Knowledge Base]], or UNL<sup>KB</sup>: a network of necessary interactions between UW's
::**The [[UNL Knowledge Base]], which is a network of UWs
+
*[[UNL Memory]]: a network of necessary and typical interactions between UW's
::*Multilingual resources:
+
*[[NL Dictionary]], or NL<sup>dic</sup>: a list of natural language entries and their corresponding features
::**[[UNL-NL Dictionaries]]
+
*[[NL Memory]]: a list of typical interactions between natural language entries
 +
*[[UNL-NL Dictionary]], or UNL-NL<sup>dic</sup>: a list of lexical mappings between UNL and a given natural language
 +
*[[UNL-NL Memory]]: a list of mappings between UNL and a given natural language
  
; [[Grammars]]
+
== [[Grammar]]s ==
: Enconversion and deconversion grammars for UNL
+
''Main article: [[Grammar]]''
  
; [[Corpora]]
+
*UNL-NL T-Grammar: used for natural language generation
: Documents written in UNL
+
*UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
 +
*NL-UNL T-Grammar: used for natural language analysis
 +
*NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar
 +
 
 +
== [[Corpus|Corpora]] ==
 +
''Main article: [[Corpus]]''
 +
 
 +
*UNL Corpus: documents written in UNL, to be provided according to the [[UNL document|UNL document structure]]
 +
*NL Corpus: documents written in natural language

Latest revision as of 19:43, 21 September 2012

The UNL System comprises three different types of language resources: lexica, grammars and corpora. These resources have been provided mainly through the UNLarium.

Lexica

Main article: Lexica

  • UNL Dictionary, or UNLdic: a flat list of UW's and their corresponding semantic features
  • UNL Ontology: a hierarchy of UW's
  • UNL Knowledge Base, or UNLKB: a network of necessary interactions between UW's
  • UNL Memory: a network of necessary and typical interactions between UW's
  • NL Dictionary, or NLdic: a list of natural language entries and their corresponding features
  • NL Memory: a list of typical interactions between natural language entries
  • UNL-NL Dictionary, or UNL-NLdic: a list of lexical mappings between UNL and a given natural language
  • UNL-NL Memory: a list of mappings between UNL and a given natural language

Grammars

Main article: Grammar

  • UNL-NL T-Grammar: used for natural language generation
  • UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
  • NL-UNL T-Grammar: used for natural language analysis
  • NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar

Corpora

Main article: Corpus

  • UNL Corpus: documents written in UNL, to be provided according to the UNL document structure
  • NL Corpus: documents written in natural language
Software