Language settings

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(New page: '''Language settings''' are used to define the behaviour of a given language in case of absence of grammatical categories. They are necessary to solve underspecification problems that norm...)
 
Line 1: Line 1:
'''Language settings''' are used to define the behaviour of a given language in case of absence of grammatical categories. They are necessary to solve underspecification problems that normally appear when translating between languages that have very different grammatical structures.
+
'''Language settings''' are used to define the general parameters of a given language, such as word order, sentence structure and other overall patterns. They can also be used to describe grammatical redundancy (and therefore to avoid proliferating rules) or to indicate how an absent (i.e., a non-grammaticalized) category should be translated.
  
 
== When to use language settings ==
 
== When to use language settings ==
  
Language settings should be defined in two cases:
+
Language settings should be defined in three cases:
* to state grammatical redundancy (i.e., when the same grammatical structure represents several different linguistic phenomena); and
+
* to set phonotactic, morphotactic and syntactic parameters of the language;
* to indicate how an absent grammatical category should be translated.
+
* to state grammatical redundancy and to avoid proliferating rules; and
 +
* to ensure cross-linguistic mapping.  
 +
 
 +
=== General parameters ===
 +
 
 +
In English, the grammatical category of [[person]] is represented, in almost all cases, by the same morpheme (zero). Instead of repeating this information inside every verb paradigm, we can represent it as general language setting such as:
 +
 
 +
1PS:=0>""; (= if FIRST PERSON OF SINGULAR, then ADD NOTHING)
 +
2PS:=0>""; (= if SECOND PERSON OF SINGULAR, then ADD NOTHING)
 +
1PP:=0>""; (= if FIRST PERSON OF PLURAL, then ADD NOTHING)
 +
2PP:=0>""; (= if SECOND PERSON OF PLURAL, then ADD NOTHING)
 +
3PP:=0>""; (= if THIRD PERSON OF PLURAL, then ADD NOTHING)
 +
 
 +
In this case, only the exceptions (such as the 3PS and the verb "to be") would be treated inside the verb paradigm.
 +
 
 +
The same can be stated for the present progressive [[tense]], which is always formed by the periphrasis TO BE + GERUND. Instead of indicating this possibility inside the verb paradigms, we can simply create a general rule that would be applied in all cases.
 +
 
 +
ET1&PGS&1PS:="am"<<(+GER);
 +
ET1&PGS&2PS:="are"<<(+GER);
 +
ET1&PGS&3PS:="is"<<(+GER);
 +
ET1&PGS&1PP:="are"<<(+GER);
 +
ET1&PGS&2PP:="are"<<(+GER);
 +
ET1&PGS&3PP:="are"<<(+GER);
  
 
=== Grammatical redundancy ===
 
=== Grammatical redundancy ===
 +
 +
In English, the grammatical category of [[mood]] is conflated, and there is no clear morphological distinction between the indicative, the subjunctive, the conditional and other possible values of the attribute. This information can be represented by a general rule:
 +
 +
SUB=IND; The subjunctive is equal to the indicative
 +
CON=IND; The conditional is equal to the conditional
 +
IMP=IND; The imperative is equal to the indicative
 +
 +
In this case, the indicative (IND) is the only form that needs to be defined in the verb paradigm.
  
 
=== Translation ===  
 
=== Translation ===  
 +
 +
In English, the grammatical category of [[number]] may have only two different values: singular or plural. In several other languages, however, the number may assume other values, such as dual, trial and quadrual. The English grammar must inform what to do in those cases in order to ensure full intertranslatability.
 +
 +
The language settings may indicate that:
 +
 +
DUA:="a couple of"<<(+PLR); (if DUAL, the determiner "a couple of" should be generated at the left of the word, which would assume the value of PLURAL)
 +
TRI=PLR; (if TRIAL, the word will assume the value of PLURAL)
 +
QDR=PLR; (if QUADRUAL, the word will assume the value of PLURAL
  
 
== Syntax ==  
 
== Syntax ==  
  
== Examples ==
+
The language settings follow the general formalism that is defined in the [[UNL Grammar Specs]], with a single difference.

Revision as of 14:33, 3 September 2009

Language settings are used to define the general parameters of a given language, such as word order, sentence structure and other overall patterns. They can also be used to describe grammatical redundancy (and therefore to avoid proliferating rules) or to indicate how an absent (i.e., a non-grammaticalized) category should be translated.

Contents

When to use language settings

Language settings should be defined in three cases:

  • to set phonotactic, morphotactic and syntactic parameters of the language;
  • to state grammatical redundancy and to avoid proliferating rules; and
  • to ensure cross-linguistic mapping.

General parameters

In English, the grammatical category of person is represented, in almost all cases, by the same morpheme (zero). Instead of repeating this information inside every verb paradigm, we can represent it as general language setting such as:

1PS:=0>""; (= if FIRST PERSON OF SINGULAR, then ADD NOTHING) 2PS:=0>""; (= if SECOND PERSON OF SINGULAR, then ADD NOTHING) 1PP:=0>""; (= if FIRST PERSON OF PLURAL, then ADD NOTHING) 2PP:=0>""; (= if SECOND PERSON OF PLURAL, then ADD NOTHING) 3PP:=0>""; (= if THIRD PERSON OF PLURAL, then ADD NOTHING)

In this case, only the exceptions (such as the 3PS and the verb "to be") would be treated inside the verb paradigm.

The same can be stated for the present progressive tense, which is always formed by the periphrasis TO BE + GERUND. Instead of indicating this possibility inside the verb paradigms, we can simply create a general rule that would be applied in all cases.

ET1&PGS&1PS:="am"<<(+GER); ET1&PGS&2PS:="are"<<(+GER); ET1&PGS&3PS:="is"<<(+GER); ET1&PGS&1PP:="are"<<(+GER); ET1&PGS&2PP:="are"<<(+GER); ET1&PGS&3PP:="are"<<(+GER);

Grammatical redundancy

In English, the grammatical category of mood is conflated, and there is no clear morphological distinction between the indicative, the subjunctive, the conditional and other possible values of the attribute. This information can be represented by a general rule:

SUB=IND; The subjunctive is equal to the indicative CON=IND; The conditional is equal to the conditional IMP=IND; The imperative is equal to the indicative

In this case, the indicative (IND) is the only form that needs to be defined in the verb paradigm.

Translation

In English, the grammatical category of number may have only two different values: singular or plural. In several other languages, however, the number may assume other values, such as dual, trial and quadrual. The English grammar must inform what to do in those cases in order to ensure full intertranslatability.

The language settings may indicate that:

DUA:="a couple of"<<(+PLR); (if DUAL, the determiner "a couple of" should be generated at the left of the word, which would assume the value of PLURAL) TRI=PLR; (if TRIAL, the word will assume the value of PLURAL) QDR=PLR; (if QUADRUAL, the word will assume the value of PLURAL

Syntax

The language settings follow the general formalism that is defined in the UNL Grammar Specs, with a single difference.

Software