Tagset

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(General Guidelines)
(List of attributes and values)
Line 14: Line 14:
  
 
== List of attributes and values ==
 
== List of attributes and values ==
 +
 +
The hierarchy of tags is depicted in the tree below. The topmost levels represent the attributes of which the tags are a value. Examples and definitions for each tag are presented below.
  
 
{{#tree:id=tagset|openlevels=0|root=Attributes|
 
{{#tree:id=tagset|openlevels=0|root=Attributes|

Revision as of 15:26, 17 November 2009

The set of features in a UNL-driven dictionary depends on the structure of the natural language and may vary a lot. However, in order to better standardize lexical resources inside the UNL framework, the UNDL Foundation recommends the adoption of the following tags for some specific and pervasive grammatical phenomena. Several of those linguistic constants have been already proposed to the Data Category Registry (ISO 12620), and represent widely accepted linguistic concepts. Our main intention here is just to provide a harmonized system to be shared by the UNL community so as to make dictionaries as easily understandable and exchangeable as possible.

General Guidelines

In order to define the tags to be used in the UNL Tagset, the following premises were adopted:

  • Tags should be as language-independent as possible (i.e., they should avoid language biases)
  • Tags should be as comprehensive as possible (i.e., they should cover all widely accepted linguistic concepts)
  • Tags should be as few as possible (i.e., they should avoid redundancy)
  • Tags should be as short as possible (i.e., they should fit in a three-character string)
  • Tags should be as mnemonic as possible (i.e., they should be readable)
  • Tags should be provided in an subsumptive containment hierarchy (a taxonomic hierarchy) so that upper level values could be inferred from the lower ones.

The resulting set of tags, which is still subject to additions and revisions, is presented below. For the time being, the definitions and examples have been extracted out of the Glossary of Linguistic Terms (Loos et alii), available at SIL International. The tags are expected to migrate to an on-line environment, still under construction, where accredited linguists will have the opportunity to enhance and to improve this repertoire.

List of attributes and values

The hierarchy of tags is depicted in the tree below. The topmost levels represent the attributes of which the tags are a value. Examples and definitions for each tag are presented below.

Software