Tagset

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(List of attributes)
(List of attributes and values)
Line 184: Line 184:
 
**contraction (CTT)
 
**contraction (CTT)
 
**determiner (DET)
 
**determiner (DET)
**dummy word (DUM)
 
 
***article (ART)
 
***article (ART)
 
***demonstrative (DEM)
 
***demonstrative (DEM)
 
***quantifier (QUA)
 
***quantifier (QUA)
 +
**dummy word (DUM)
 
**interjection (ITJ)
 
**interjection (ITJ)
 
**noun (NOU)
 
**noun (NOU)
Line 217: Line 217:
 
***reflexive verb (RXV)
 
***reflexive verb (RXV)
 
*relative tense (RTE)
 
*relative tense (RTE)
**past (RPT)
+
**relative past (RPT)
 
**relative nonpast (NRPT)
 
**relative nonpast (NRPT)
**present (RPS)
+
**relative present (RPS)
**future (RFT)
+
**relative future (RFT)
 
**relative nonfuture (NRFT)
 
**relative nonfuture (NRFT)
 
*semantic features (SEM)
 
*semantic features (SEM)
**acts or actions (ACT)
+
**act or action (ACT)
 
**animal (ANL)
 
**animal (ANL)
 
**artifact (ARF)
 
**artifact (ARF)
 
**attribute (ATT)
 
**attribute (ATT)
**body parts (BON)
+
**body part (BON)
**body actions (BOV)
+
**body action (BOV)
**cognition nouns (CGN)
+
**cognitive noun (CGN)
**cognition verbs (CGV)
+
**cognitive verb (CGV)
 
**change (CHA)
 
**change (CHA)
**communication nouns (CMN)
+
**communication noun (CMN)
**communication verbs (CMV)
+
**communication verb (CMV)
 
**competition (CPT)
 
**competition (CPT)
 
**creation (CRE)
 
**creation (CRE)
Line 245: Line 245:
 
**motion (MOT)
 
**motion (MOT)
 
**motive (MTV)
 
**motive (MTV)
**natural events (NEV)
+
**natural event (NEV)
**natural objects (NOB)
+
**natural object (NOB)
 
**perception (PCP)
 
**perception (PCP)
 
**natural phenomena (PHE )
 
**natural phenomena (PHE )
Line 274: Line 274:
 
*syntactic roles (SYN)
 
*syntactic roles (SYN)
 
**adverbial phrase (AP)
 
**adverbial phrase (AP)
**adjunct to a conjunction (CA)
 
 
***adjunct to an adverb (AA)
 
***adjunct to an adverb (AA)
 
***adverbial phrase (AB)
 
***adverbial phrase (AB)
Line 280: Line 279:
 
***specifier of an adverb (AS)
 
***specifier of an adverb (AS)
 
**conjunctional phrase (CP)
 
**conjunctional phrase (CP)
 +
**adjunct to a conjunction (CA)
 
***conjunctional phrase (CB)
 
***conjunctional phrase (CB)
 
***complement of a conjunction (CC)
 
***complement of a conjunction (CC)

Revision as of 14:58, 17 November 2009

The set of features in a UNL-driven dictionary depends on the structure of the natural language and may vary a lot. However, in order to better standardize lexical resources inside the UNL framework, the UNDL Foundation recommends the adoption of the following tags for some specific and pervasive grammatical phenomena. Several of those linguistic constants have been already proposed to the Data Category Registry (ISO 12620), and represent widely accepted linguistic concepts. Our main intention here is just to provide a harmonized system to be shared by the UNL community so as to make dictionaries as easily understandable and exchangeable as possible.

General Guidelines

In order to define the tags to be used in the UNL Tagset, the following premises were adopted:

  • Tags should be as few as possible
  • Tags should be as short as possible
  • Tags should be as mnemonic as possible

These assumptions led us to the following general guidelines:

  • Tags should be made of a three-character upper-case string
  • Tags should be labelled out of English words
  • Tags should be provided in a attribute-value structure, along with definitions and examples.

The resulting set of tags, which is still subject to additions and revisions, is presented below. For the time being, the definitions and examples have been extracted out of the Glossary of Linguistic Terms (Loos et alii), available at SIL International. The tags are expected to migrate to an on-line environment, still under construction, where accredited linguists will have the opportunity to improve this repertoire.

List of attributes and values

Software