Tagset

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
Line 1: Line 1:
The set of features in a UNL-NL dictionary depends on the structure of the natural language and may vary a lot. However, in order to better standardize lexical resources inside the UNL framework, the UNDL Foundation recommends the adoption of the following tags for some specific and pervasive grammatical phenomena. According to the [[UNL-NL Dictionary Specs]], the features may be represented either as a simple list or as attribute-value pairs.
+
The set of features in a UNL-driven dictionary depends on the structure of the natural language and may vary a lot. However, in order to better standardize lexical resources inside the UNL framework, the UNDL Foundation recommends the adoption of the following tags for some specific and pervasive grammatical phenomena. Several of those linguistic constants have been already proposed to the '''Data Category Registry''' (ISO 12620), and represent widely accepted linguistic concepts. Our main intention here is just to provide a harmonized system to be shared by the UNL community so as to make dictionaries as easily understandable as possible.
== ATTRIBUTE-VALUE LIST ==
+
===  ANI (ANIMACY)  ===
+
Animacy is a grammatical and/or semantic category of nouns based on how sentient or alive the referent of the noun is. Animacy can have various effects on the grammar of a language, such as word order, case endings, or the form a verb takes when it is associated with that noun.
+
  
ANIMATE (ANM) - Indicates an animate reference - Example: he, she
+
== NAMING STRATEGIES ==
  
INANIMATE (NANM) - Indicates an inanimate reference - Example: it
+
In order to define the tags to be used in the UNL Tagset, the following premises were adopted:
 +
* Tags should be as few as possible
 +
* Tags should be as short as possible
 +
* Tags should be as mnemonic as possible
  
===  ASP (ASPECT)  ===
+
These assumptions led us to the following general guidelines:
The grammatical aspect (sometimes called viewpoint aspect) of a verb defines the temporal flow (or lack thereof) in the described event or state. In English, for example, the past-tense sentences "I swam" and "I was swimming" differ in aspect (the first sentence is in what is called the perfective or completive aspect, and the second in what is called the imperfective or durative aspect).
+
* Tags should be made of a three-character upper-case string (except for negative values, which should be preceded by "N", such as NPFC = non-perfect);
 +
* Tags should be labelled out of English words;
 +
* Tags should be provided in a attribute-value structure, along with definitions and examples.
  
IMPERFECTIVE (NPFC) - An event in the process of unfolding (often a repeated or habitual event) - Example: I was swimming.
+
The resulting set of tags, which is still subject to additions and revisions, is presented below. For the time being, the definitions and examples have been extracted out of the ''Glossary of Linguistic Terms'' (Loos et alii), available at [http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/ SIL International], and are expected to migrate to an online environment, still under construction, where accredited linguists will have the opportunity of improving this repertoire.
 
+
PERFECTIVE (PFC) - A single event conceived as a unit - Example: I swam.
+
 
+
===  CAS (CASE)  ===
+
The case of a noun or pronoun indicates its grammatical function in a greater phrase or clause such as the role of subject or of direct object.
+
 
+
ABLATIVE (ABL) - Indicates movement from something, and/or cause
+
 
+
ACCUSATIVE (ACC) - Indicates the direct object of a verb - Example: him (in I saw him)
+
 
+
DATIVE (DAT) - Indicates the indirect object of a verb - Example: us (in He gave us the book)
+
 
+
INSTRUMENTAL (INS) - Indicates an object used in performing an action
+
 
+
LOCATIVE (LOC) - Indicates a location
+
 
+
NOMINATIVE (NOM) - Indicates the subject of a finite verb. - Example: I (in I saw him)
+
 
+
POSSESSIVE (POS) - Indicates the possessor of another noun - Example: my
+
 
+
===  CLU (CLUSIVITY)  ===
+
A distinction between inclusive and exclusive
+
 
+
EXCLUSIVE (EXC) - Does not include the addressee - Example: let us
+
 
+
INCLUSIVE (INC) - Includes the addressee - Example: let's
+
 
+
===  DEF (DEFINITENESS)  ===
+
Distinguishes between entities which are specific and identifiable in a given context (definite noun phrases) and entities which are not (indefinite noun phrases)  ===
+
DEFINITE (DEF) - Specific and identifiable in a given context - Example: the
+
 
+
INDEFINITE (NDEF) - Non-identifiable in a given context - Example: a
+
 
+
===  DEG (DEGREE OF COMPARISON)  ===
+
Describes the relational value of one thing with something in another clause of a sentence
+
 
+
COMPARATIVE (COM) - An adjective that compares the quality with that of another of its kind - Example: better
+
 
+
SUPERLATIVE (SUP) - An adjective that compares the quality with many or all others of its kind - Example: best
+
 
+
===  EVI (EVIDENTIALITY)  ===
+
The indication of the nature of evidence for a given statement, that is, whether evidence exists for the statement and/or what kind of evidence exists.
+
 
+
EVIDENT (EVI) - Example: to be (in Bob is hungry)
+
 
+
NON-EVIDENT (NEVI - Example: to look (in Bob looks hungry)
+
 
+
===  EVT (EVENT TENSE)  ===
+
A temporal linguistic quality expressing the time at, during, or over which a state or action denoted by a verb occurs with reference to the speaker.
+
 
+
FUTURE EVENT TENSE (ET2) - Example: will be (I will be here)
+
 
+
PAST EVENT TENSE (ET0) - Example: was (I was here)
+
 
+
PRESENT EVENT TENSE (ET1) - Example: am (I am here)
+
 
+
===  GEN (GENDER)  ===
+
FEMININE (FEM) - Includes most words that refer to females. - Example: she
+
 
+
MASCULINE (MCL) - Includes most words that refer to males. - Example: he
+
 
+
MASCULINE AND FEMININE (MCLAFEM) - Words that can be used to refer both to males and females. - Example: they (he and she)
+
 
+
MASCULINE OR FEMININE (MCLOFEM) - Words that can be used to refer either to males or females. - Example: you (singular)
+
 
+
NEUTER (NEU) - Includes mostly words that do not refer to males or females. - Example: it
+
 
+
===  LEX (LEXICAL STATUS)  ===
+
COMPOUND (CPW) - Any combination of words linked by "-" - Example: baby-talk
+
 
+
MULTIWORD (MTW) - Any string comprising more than a word - Example: United States of America
+
 
+
SUBWORD (SBW) - Any string smaller than a word (a root, a stem, etc) - Example: bab (baby)
+
 
+
===  MOO (MOOD)  ===
+
CONDITIONAL (CON) -  The form of the verb used in conditional sentences to refer to a hypothetical state of affairs, or an uncertain event that is contingent on another set of circumstances.
+
 
+
IMPERATIVE (IMP) - A grammatical mood that expresses direct commands or requests. It is also used to signal a prohibition, permission or any other kind of exhortation.
+
 
+
INDICATIVE (IND)
+
 
+
INJUNCTIVE (INJ)
+
 
+
OPTATIVE (OPT) - A grammatical mood that indicates a wish or hope.
+
 
+
SUBJUNCTIVE (SUB)
+
 
+
A verb mood typically used in dependent clauses to express wishes, commands, emotion, possibility, judgment, opinion, necessity, or statements that are contrary to fact at present
+
 
+
===  NUM (NUMBER)  ===
+
A grammatical category of nouns, pronouns, and adjective and verb agreement that expresses count distinctions.
+
 
+
DUAL (DUA)
+
 
+
PAUCAL (PAU)
+
 
+
PLURAL (PLR) - Example: they
+
 
+
QUADRUAL (QDR)
+
 
+
SINGULAR (SNG) - Example: he
+
 
+
TRIAL (TRI)
+
 
+
===  PER (PERSON)  ===
+
A deictic reference to a participant in an event, such as the speaker, the addressee, or others.
+
 
+
FIRST PERSON PLURAL (1PP) - Example: we
+
 
+
FIRST PERSON SINGULAR (1PS) - Example: I
+
 
+
SECOND PERSON PLURAL (2PP) - Example: you
+
 
+
SECOND PERSON SINGULAR (2PS) - Example: you
+
 
+
THIRD PERSON PLURAL (3PP) - Example: they
+
 
+
THIRD PERSON SINGULAR (3PS) - Example: he
+
 
+
===  POL (POLARITY)  ===
+
The distinction of affirmative and negative, which indicates the truth or falsehood of a statement respectively.
+
 
+
AFFIRMATIVE (AFF)
+
 
+
NEGATIVE (NEG)
+
 
+
===  POS (PART OF SPEECH)  ===
+
ABBREVIATION (ABB) - Example: Dr.
+
 
+
ACRONYM (ACR) - Example: UNL
+
 
+
ADJECTIVE (ADJ) - Example: beautiful
+
 
+
ADVERB (ADV) - Example: beautifully
+
 
+
ARTICLE (ART) - Example: the
+
 
+
AUXILIARY VERB (AUX) - Example: will
+
 
+
CARDINAL NUMBER (CNB) - Example: two
+
 
+
CIRCUMPOSITION (CIR)
+
 
+
CONTRACTION (CTC) - Example: don't
+
 
+
COORDINATING CONJUNCTION (CCJ) - Example: and
+
 
+
COPULA (COP) - Example: be (to be)
+
 
+
DEMONSTRATIVE (DEM) - Example: this
+
 
+
FRACTION NUMBER (FNB) - Example: two thirds
+
 
+
GERUND (GER) - Example: sleeping
+
 
+
INFINITIVE (INF) - Example: be (to be)
+
 
+
INFIX (IFX)
+
 
+
INTERJECTION (ITJ) - Example: hello
+
 
+
INTERROGATIVE PRONOUN (IPR) - Example: who
+
 
+
MODAL VERB (MOV) - Example: can
+
 
+
NOUN (NOU) - Example: beauty
+
 
+
ORDINAL NUMBER (ONB) - Example: second
+
 
+
PARTICIPLE (PTP) - Example: done
+
 
+
PARTICLE (PTC) - Example: to
+
 
+
PERSONAL PRONOUN (PPR) - Example: I
+
 
+
POSTPOSITION (PPS)
+
 
+
PREFIX (PFX) - Example: un
+
 
+
PREPOSITION (PRE) - Example: against
+
 
+
PROPER NOUN (PPN) - Example: Geneva
+
 
+
QUANTIFIER (QUA) - Example: every
+
 
+
RELATIVE PRONOUN (RPR) - Example: who
+
 
+
SUBORDINANTING CONJUNCTION (SCJ) - Example: if
+
 
+
SUFFIX (SFX) - Example: s
+
 
+
VERB (VER) - Example: buy
+
 
+
===  RTE (REFERENCE TENSE)  ===
+
A temporal linguistic quality expressing the time at, during, or over which a state or action denoted by a verb occurs with reference to another state or action.
+
 
+
FUTURE REFERENCE TENSE (RT2)
+
 
+
PAST REFERENCE TENSE (RT0) - Example: had been (I had been here)
+
 
+
PRESENT REFERENCE TENSE (RT1)
+
 
+
 
+
 
+
===  TRA (TRANSITIVITY)  ===
+
A property of verbs that relates to whether a verb can take direct objects
+
 
+
AMBITRANSITIVE (ATST) - A verb that can be used both as intransitive or as transitive without requiring a morphological change - Example: read
+
 
+
DIRECT TRANSITIVE (TST) - A verb which takes a subject and a single direct object - Example: kiss
+
 
+
DITRANSITIVE (DTST) - A verb which takes a subject and two objects. - Example: give
+
 
+
INDIRECT TRANSITIVE (ITST) - A verb which takes a subject and a single indirect object
+
 
+
INTRANSITIVE (NTST) - A verb that does not take an object - Example: fall
+
 
+
TRITRANSITIVE (TTST) - A verb which takes a subject and three objects. - Example: trade
+
 
+
===  VAL (VALENCY)  ===
+
Verb valency or valence refers to the number of arguments controlled by a verbal predicate.
+
 
+
AVALENT (VAL0) - An avalent verb takes no arguments - Example: rain
+
 
+
DIVALENT (VAL2) - A divalent verb takes two arguments - Example: eat
+
 
+
MONOVALENT (VAL1) - A monovalent verb takes one argument - Example: sleep
+
 
+
TETRAVALENT (VAL4) - A trivalent verb takes four arguments
+
 
+
TRIVALENT (VAL3) - A trivalent verb takes three arguments - Example: give
+
 
+
===  VOI (VOICE)  ===
+
The voice (also called diathesis) of a verb describes the relationship between the action (or state) that the verb expresses and the participants identified by its arguments (subject, object, etc.).
+
 
+
ACTIVE VOICE (ACT) - When the subject is the agent or actor of the verb.
+
 
+
MIDDLE VOICE (MID)
+
 
+
PASSIVE VOICE (PAS) - When the subject is the patient, target or undergoer of the action.
+
 
+
== LIST OF VALUES (in alphabetical order of tags) ==
+
 
+
{|border="1" cellpadding="2"
+
!Value Tag||Value Name||Value Definition||Value Example||Attribute Tag
+
|-
+
|1PP||FIRST PERSON PLURAL||||we||PER
+
|-
+
|1PS||FIRST PERSON SINGULAR||||I||PER
+
|-
+
|2PP||SECOND PERSON PLURAL||||you||PER
+
|-
+
|2PS||SECOND PERSON SINGULAR||||you||PER
+
|-
+
|3PP||THIRD PERSON PLURAL||||they||PER
+
|-
+
|3PS||THIRD PERSON SINGULAR||||he||PER
+
|-
+
|ABB||ABBREVIATION||||Dr.||POS
+
|-
+
|ABL||ABLATIVE||Indicates movement from something, and/or cause||||CAS
+
|-
+
|ACC||ACCUSATIVE||Indicates the direct object of a verb||him (in I saw him)||CAS
+
|-
+
|ACR||ACRONYM||||UNL||POS
+
|-
+
|ACT||ACTIVE VOICE||When the subject is the agent or actor of the verb.||||VOI
+
|-
+
|ADJ||ADJECTIVE||||beautiful||POS
+
|-
+
|ADV||ADVERB||||beautifully||POS
+
|-
+
|AFF||AFFIRMATIVE||||||POL
+
|-
+
|ANM||ANIMATE||Indicates an animate reference||he, she||ANI
+
|-
+
|ART||ARTICLE||||the||POS
+
|-
+
|ATST||AMBITRANSITIVE||A verb that can be used both as intransitive or as transitive without requiring a morphological change||read||TRA
+
|-
+
|AUX||AUXILIARY VERB||||will||POS
+
|-
+
|CCJ||COORDINATING CONJUNCTION||||and||POS
+
|-
+
|CIR||CIRCUMPOSITION||||||POS
+
|-
+
|CNB||CARDINAL NUMBER||||two||POS
+
|-
+
|COM||COMPARATIVE||An adjective that compares the quality with that of another of its kind||better||DEG
+
|-
+
|CON||CONDITIONAL||The form of the verb used in conditional sentences to refer to a hypothetical state of affairs, or an uncertain event that is contingent on another set of circumstances.||||MOO
+
|-
+
|COP||COPULA||||be (to be)||POS
+
|-
+
|CPW||COMPOUND||Any combination of words linked by "-"||baby-talk||LEX
+
|-
+
|CTC||CONTRACTION||||don't||POS
+
|-
+
|DAT||DATIVE||Indicates the indirect object of a verb||us (in He gave us the book)||CAS
+
|-
+
|DEF||DEFINITE||Specific and identifiable in a given context||the||DEF
+
|-
+
|DEM||DEMONSTRATIVE||||this||POS
+
|-
+
|DTST||DITRANSITIVE||A verb which takes a subject and two objects.||give||TRA
+
|-
+
|DUA||DUAL||||||NUM
+
|-
+
|ET0||PAST EVENT TENSE||||was (I was here)||EVT
+
|-
+
|ET1||PRESENT EVENT TENSE||||am (I am here)||EVT
+
|-
+
|ET2||FUTURE EVENT TENSE||||will be (I will be here)||EVT
+
|-
+
|EVI||EVIDENT||||to be (in Bob is hungry)||EVI
+
|-
+
|EXC||EXCLUSIVE||Does not include the addressee||let us||CLU
+
|-
+
|FEM||FEMININE||Includes most words that refer to females.||she||GEN
+
|-
+
|FNB||FRACTION NUMBER||||two thirds||POS
+
|-
+
|GER||GERUND||||sleeping||POS
+
|-
+
|IFX||INFIX||||||POS
+
|-
+
|IMP||IMPERATIVE||A grammatical mood that expresses direct commands or requests. It is also used to signal a prohibition, permission or any other kind of exhortation.||||MOO
+
|-
+
|INC||INCLUSIVE||Includes the addressee||let's||CLU
+
|-
+
|IND||INDICATIVE||||||MOO
+
|-
+
|INF||INFINITIVE||||be (to be)||POS
+
|-
+
|INJ||INJUNCTIVE||||||MOO
+
|-
+
|INS||INSTRUMENTAL||Indicates an object used in performing an action||||CAS
+
|-
+
|IPR||INTERROGATIVE PRONOUN||||who||POS
+
|-
+
|ITJ||INTERJECTION||||hello||POS
+
|-
+
|ITST||INDIRECT TRANSITIVE||A verb which takes a subject and a single indirect object||||TRA
+
|-
+
|LOC||LOCATIVE||Indicates a location||||CAS
+
|-
+
|MCL||MASCULINE||Includes most words that refer to males.||he||GEN
+
|-
+
|MCLAFEM||MASCULINE AND FEMININE||Words that can be used to refer both to males and females.||they (he and she)||GEN
+
|-
+
|MCLOFEM||MASCULINE OR FEMININE||Words that can be used to refer either to males or females.||you (singular)||GEN
+
|-
+
|MID||MIDDLE VOICE||||||VOI
+
|-
+
|MOV||MODAL VERB||||can||POS
+
|-
+
|MTW||MULTIWORD||Any string comprising more than a word||United States of America||LEX
+
|-
+
|NANM||INANIMATE||Indicates an inanimate reference||it||ANI
+
|-
+
|NDEF||INDEFINITE||Non-identifiable in a given context||a||DEF
+
|-
+
|NEG||NEGATIVE||||||POL
+
|-
+
|NEU||NEUTER||Includes mostly words that do not refer to males or females.||it||GEN
+
|-
+
|NEVI||NON-EVIDENT||||to look (in Bob looks hungry)||EVI
+
|-
+
|NOM||NOMINATIVE||Indicates the subject of a finite verb.||I (in I saw him)||CAS
+
|-
+
|NOU||NOUN||||beauty||POS
+
|-
+
|NPFC||IMPERFECTIVE||An event in the process of unfolding (often a repeated or habitual event)||I was swimming.||ASP
+
|-
+
|NTST||INTRANSITIVE||A verb that does not take an object||fall||TRA
+
|-
+
|ONB||ORDINAL NUMBER||||second||POS
+
|-
+
|OPT||OPTATIVE||A grammatical mood that indicates a wish or hope.||||MOO
+
|-
+
|PAS||PASSIVE VOICE||When the subject is the patient, target or undergoer of the action.||||VOI
+
|-
+
|PAU||PAUCAL||||||NUM
+
|-
+
|PFC||PERFECTIVE||A single event conceived as a unit||I swam.||ASP
+
|-
+
|PFX||PREFIX||||un||POS
+
|-
+
|PLR||PLURAL||||they||NUM
+
|-
+
|POS||POSSESSIVE||Indicates the possessor of another noun||my||CAS
+
|-
+
|PPN||PROPER NOUN||||Geneva||POS
+
|-
+
|PPR||PERSONAL PRONOUN||||I||POS
+
|-
+
|PPS||POSTPOSITION||||||POS
+
|-
+
|PRE||PREPOSITION||||against||POS
+
|-
+
|PTC||PARTICLE||||to||POS
+
|-
+
|PTP||PARTICIPLE||||done||POS
+
|-
+
|QDR||QUADRUAL||||||NUM
+
|-
+
|QUA||QUANTIFIER||||every||POS
+
|-
+
|RPR||RELATIVE PRONOUN||||who||POS
+
|-
+
|RT0||PAST REFERENCE TENSE||||had been (I had been here)||RTE
+
|-
+
|RT1||PRESENT REFERENCE TENSE||||||RTE
+
|-
+
|RT2||FUTURE REFERENCE TENSE||||||RTE
+
|-
+
|SBW||SUBWORD||Any string smaller than a word (a root, a stem, etc)||bab (baby)||LEX
+
|-
+
|SCJ||SUBORDINANTING CONJUNCTION||||if||POS
+
|-
+
|SFX||SUFFIX||||s||POS
+
|-
+
|SNG||SINGULAR||||he||NUM
+
|-
+
|SUB||SUBJUNCTIVE||A verb mood typically used in dependent clauses to express wishes, commands, emotion, possibility, judgment, opinion, necessity, or statements that are contrary to fact at present||||MOO
+
|-
+
|SUP||SUPERLATIVE||An adjective that compares the quality with many or all others of its kind||best||DEG
+
|-
+
|TRI||TRIAL||||||NUM
+
|-
+
|TST||DIRECT TRANSITIVE||A verb which takes a subject and a single direct object||kiss||TRA
+
|-
+
|TTST||TRITRANSITIVE||A verb which takes a subject and three objects.||trade||TRA
+
|-
+
|VAL0||AVALENT||An avalent verb takes no arguments||rain||VAL
+
|-
+
|VAL1||MONOVALENT||A monovalent verb takes one argument||sleep||VAL
+
|-
+
|VAL2||DIVALENT||A divalent verb takes two arguments||eat||VAL
+
|-
+
|VAL3||TRIVALENT||A trivalent verb takes three arguments||give||VAL
+
|-
+
|VAL4||TETRAVALENT||A trivalent verb takes four arguments||||VAL
+
|-
+
|VER||VERB||||buy||POS
+
|}
+
 
+
== LIST OF VALUES (in alphabetical order of names) ==
+
 
+
{|border="1" cellpadding="1"
+
!Value Name||Value Tag||Value Definition||Value Example||Attribute Tag
+
|-
+
|ABBREVIATION||ABB||||Dr.||POS
+
|-
+
|ABLATIVE||ABL||Indicates movement from something, and/or cause||||CAS
+
|-
+
|ACCUSATIVE||ACC||Indicates the direct object of a verb||him (in I saw him)||CAS
+
|-
+
|ACRONYM||ACR||||UNL||POS
+
|-
+
|ACTIVE VOICE||ACT||When the subject is the agent or actor of the verb.||||VOI
+
|-
+
|ADJECTIVE||ADJ||||beautiful||POS
+
|-
+
|ADVERB||ADV||||beautifully||POS
+
|-
+
|AFFIRMATIVE||AFF||||||POL
+
|-
+
|AMBITRANSITIVE||ATST||A verb that can be used both as intransitive or as transitive without requiring a morphological change||read||TRA
+
|-
+
|ANIMATE||ANM||Indicates an animate reference||he, she||ANI
+
|-
+
|ARTICLE||ART||||the||POS
+
|-
+
|AUXILIARY VERB||AUX||||will||POS
+
|-
+
|AVALENT||VAL0||An avalent verb takes no arguments||rain||VAL
+
|-
+
|CARDINAL NUMBER||CNB||||two||POS
+
|-
+
|CIRCUMPOSITION||CIR||||||POS
+
|-
+
|COMPARATIVE||COM||An adjective that compares the quality with that of another of its kind||better||DEG
+
|-
+
|COMPOUND||CPW||Any combination of words linked by "-"||baby-talk||LEX
+
|-
+
|CONDITIONAL||CON||The form of the verb used in conditional sentences to refer to a hypothetical state of affairs, or an uncertain event that is contingent on another set of circumstances.||||MOO
+
|-
+
|CONTRACTION||CTC||||don't||POS
+
|-
+
|COORDINATING CONJUNCTION||CCJ||||and||POS
+
|-
+
|COPULA||COP||||be (to be)||POS
+
|-
+
|DATIVE||DAT||Indicates the indirect object of a verb||us (in He gave us the book)||CAS
+
|-
+
|DEFINITE||DEF||Specific and identifiable in a given context||the||DEF
+
|-
+
|DEMONSTRATIVE||DEM||||this||POS
+
|-
+
|DIRECT TRANSITIVE||TST||A verb which takes a subject and a single direct object||kiss||TRA
+
|-
+
|DITRANSITIVE||DTST||A verb which takes a subject and two objects.||give||TRA
+
|-
+
|DIVALENT||VAL2||A divalent verb takes two arguments||eat||VAL
+
|-
+
|DUAL||DUA||||||NUM
+
|-
+
|EVIDENT||EVI||||to be (in Bob is hungry)||EVI
+
|-
+
|EXCLUSIVE||EXC||Does not include the addressee||let us||CLU
+
|-
+
|FEMININE||FEM||Includes most words that refer to females.||she||GEN
+
|-
+
|FIRST PERSON PLURAL||1PP||||we||PER
+
|-
+
|FIRST PERSON SINGULAR||1PS||||I||PER
+
|-
+
|FRACTION NUMBER||FNB||||two thirds||POS
+
|-
+
|FUTURE EVENT TENSE||ET2||||will be (I will be here)||EVT
+
|-
+
|FUTURE REFERENCE TENSE||RT2||||||RTE
+
|-
+
|GERUND||GER||||sleeping||POS
+
|-
+
|IMPERATIVE||IMP||A grammatical mood that expresses direct commands or requests. It is also used to signal a prohibition, permission or any other kind of exhortation.||||MOO
+
|-
+
|IMPERFECTIVE||NPFC||An event in the process of unfolding (often a repeated or habitual event)||I was swimming.||ASP
+
|-
+
|INANIMATE||NANM||Indicates an inanimate reference||it||ANI
+
|-
+
|INCLUSIVE||INC||Includes the addressee||let's||CLU
+
|-
+
|INDEFINITE||NDEF||Non-identifiable in a given context||a||DEF
+
|-
+
|INDICATIVE||IND||||||MOO
+
|-
+
|INDIRECT TRANSITIVE||ITST||A verb which takes a subject and a single indirect object||||TRA
+
|-
+
|INFINITIVE||INF||||be (to be)||POS
+
|-
+
|INFIX||IFX||||||POS
+
|-
+
|INJUNCTIVE||INJ||||||MOO
+
|-
+
|INSTRUMENTAL||INS||Indicates an object used in performing an action||||CAS
+
|-
+
|INTERJECTION||ITJ||||hello||POS
+
|-
+
|INTERROGATIVE PRONOUN||IPR||||who||POS
+
|-
+
|INTRANSITIVE||NTST||A verb that does not take an object||fall||TRA
+
|-
+
|LOCATIVE||LOC||Indicates a location||||CAS
+
|-
+
|MASCULINE||MCL||Includes most words that refer to males.||he||GEN
+
|-
+
|MASCULINE AND FEMININE||MCLAFEM||Words that can be used to refer both to males and females.||they (he and she)||GEN
+
|-
+
|MASCULINE OR FEMININE||MCLOFEM||Words that can be used to refer either to males or females.||you (singular)||GEN
+
|-
+
|MIDDLE VOICE||MID||||||VOI
+
|-
+
|MODAL VERB||MOV||||can||POS
+
|-
+
|MONOVALENT||VAL1||A monovalent verb takes one argument||sleep||VAL
+
|-
+
|MULTIWORD||MTW||Any string comprising more than a word||United States of America||LEX
+
|-
+
|NEGATIVE||NEG||||||POL
+
|-
+
|NEUTER||NEU||Includes mostly words that do not refer to males or females.||it||GEN
+
|-
+
|NOMINATIVE||NOM||Indicates the subject of a finite verb.||I (in I saw him)||CAS
+
|-
+
|NON-EVIDENT||NEVI||||to look (in Bob looks hungry)||EVI
+
|-
+
|NOUN||NOU||||beauty||POS
+
|-
+
|OPTATIVE||OPT||A grammatical mood that indicates a wish or hope.||||MOO
+
|-
+
|ORDINAL NUMBER||ONB||||second||POS
+
|-
+
|PARTICIPLE||PTP||||done||POS
+
|-
+
|PARTICLE||PTC||||to||POS
+
|-
+
|PASSIVE VOICE||PAS||When the subject is the patient, target or undergoer of the action.||||VOI
+
|-
+
|PAST EVENT TENSE||ET0||||was (I was here)||EVT
+
|-
+
|PAST REFERENCE TENSE||RT0||||had been (I had been here)||RTE
+
|-
+
|PAUCAL||PAU||||||NUM
+
|-
+
|PERFECTIVE||PFC||A single event conceived as a unit||I swam.||ASP
+
|-
+
|PERSONAL PRONOUN||PPR||||I||POS
+
|-
+
|PLURAL||PLR||||they||NUM
+
|-
+
|POSSESSIVE||POS||Indicates the possessor of another noun||my||CAS
+
|-
+
|POSTPOSITION||PPS||||||POS
+
|-
+
|PREFIX||PFX||||un||POS
+
|-
+
|PREPOSITION||PRE||||against||POS
+
|-
+
|PRESENT EVENT TENSE||ET1||||am (I am here)||EVT
+
|-
+
|PRESENT REFERENCE TENSE||RT1||||||RTE
+
|-
+
|PROPER NOUN||PPN||||Geneva||POS
+
|-
+
|QUADRUAL||QDR||||||NUM
+
|-
+
|QUANTIFIER||QUA||||every||POS
+
|-
+
|RELATIVE PRONOUN||RPR||||who||POS
+
|-
+
|SECOND PERSON PLURAL||2PP||||you||PER
+
|-
+
|SECOND PERSON SINGULAR||2PS||||you||PER
+
|-
+
|SINGULAR||SNG||||he||NUM
+
|-
+
|SUBJUNCTIVE||SUB||A verb mood typically used in dependent clauses to express wishes, commands, emotion, possibility, judgment, opinion, necessity, or statements that are contrary to fact at present||||MOO
+
|-
+
|SUBORDINANTING CONJUNCTION||SCJ||||if||POS
+
|-
+
|SUBWORD||SBW||Any string smaller than a word (a root, a stem, etc)||bab (baby)||LEX
+
|-
+
|SUFFIX||SFX||||s||POS
+
|-
+
|SUPERLATIVE||SUP||An adjective that compares the quality with many or all others of its kind||best||DEG
+
|-
+
|TETRAVALENT||VAL4||A trivalent verb takes four arguments||||VAL
+
|-
+
|THIRD PERSON PLURAL||3PP||||they||PER
+
|-
+
|THIRD PERSON SINGULAR||3PS||||he||PER
+
|-
+
|TRIAL||TRI||||||NUM
+
|-
+
|TRITRANSITIVE||TTST||A verb which takes a subject and three objects.||trade||TRA
+
|-
+
|TRIVALENT||VAL3||A trivalent verb takes three arguments||give||VAL
+
|-
+
|VERB||VER||||buy||POS
+
|}
+

Revision as of 17:06, 16 November 2009

The set of features in a UNL-driven dictionary depends on the structure of the natural language and may vary a lot. However, in order to better standardize lexical resources inside the UNL framework, the UNDL Foundation recommends the adoption of the following tags for some specific and pervasive grammatical phenomena. Several of those linguistic constants have been already proposed to the Data Category Registry (ISO 12620), and represent widely accepted linguistic concepts. Our main intention here is just to provide a harmonized system to be shared by the UNL community so as to make dictionaries as easily understandable as possible.

NAMING STRATEGIES

In order to define the tags to be used in the UNL Tagset, the following premises were adopted:

  • Tags should be as few as possible
  • Tags should be as short as possible
  • Tags should be as mnemonic as possible

These assumptions led us to the following general guidelines:

  • Tags should be made of a three-character upper-case string (except for negative values, which should be preceded by "N", such as NPFC = non-perfect);
  • Tags should be labelled out of English words;
  • Tags should be provided in a attribute-value structure, along with definitions and examples.

The resulting set of tags, which is still subject to additions and revisions, is presented below. For the time being, the definitions and examples have been extracted out of the Glossary of Linguistic Terms (Loos et alii), available at SIL International, and are expected to migrate to an online environment, still under construction, where accredited linguists will have the opportunity of improving this repertoire.

Software