S-rule

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(When to use S-rules)
Line 6: Line 6:
 
*[[subcategorization]], i.e., defining the number and the type of arguments of a given [[base form]];
 
*[[subcategorization]], i.e., defining the number and the type of arguments of a given [[base form]];
 
*[[case marking]], i.e., defining the grammatical cases of the arguments of a given [[base form]];
 
*[[case marking]], i.e., defining the grammatical cases of the arguments of a given [[base form]];
*[[agreement]], i.e., concord between different parts of phrase;
+
*[[agreement]], i.e., concord between different parts of a phrase;
 
*[[distribution]], i.e., defining the precedence of word forms;  
 
*[[distribution]], i.e., defining the precedence of word forms;  
*[[adjacency]], i.e., defining the distance of syntactic branches; and
+
*[[adjacency]], i.e., defining the distance between syntactic branches; and
 
*[[projection]], i.e., projecting syntactic structures out of the constituents.
 
*[[projection]], i.e., projecting syntactic structures out of the constituents.
  

Revision as of 08:50, 26 March 2010

S-rule (syntactic rule) is the formalism used for describing syntactic structures and syntactic operations in the UNLarium framework.

Contents

When to use S-rules

S-rules are used for:

  • composition, i.e., creating compounds out of the base forms (such as "take">"take into account");
  • periphrasis, i.e., generating analytic grammatical structures, such as in ("love">"will love")
  • subcategorization, i.e., defining the number and the type of arguments of a given base form;
  • case marking, i.e., defining the grammatical cases of the arguments of a given base form;
  • agreement, i.e., concord between different parts of a phrase;
  • distribution, i.e., defining the precedence of word forms;
  • adjacency, i.e., defining the distance between syntactic branches; and
  • projection, i.e., projecting syntactic structures out of the constituents.

When not to use S-rules

S-rules are not used for for affixation (prefixation, infixation, suffixation) or spelling changes, which must be addressed by A-rules and Ph-rules, respectively.

Types of S-rules

There are four types of S-rules:

  • Head extension extends a given head.
  • Specification creates a specifier (determiner) to the head;
  • Complementation creates a complement (object) to the head; and
  • Adjunction creates an adjunct (modifier) to the head.

For further information on the constituents "head", "specifier", "complement" and "adjunct", see Syntax.

Syntax

S-rules comply with the following syntax:

CONDITION := RELATION(HEAD; ARGUMENT);

Where

  • CONDITION (optional) is a tag or list of tags, extracted from the UNDLF Tagset, that indicates when the rule should be applied. It may also be a relation or a list of relations in case of projection rules. The condition must be omitted in case of general rules (i.e., when the rule is always applied).
  • RELATION is the syntactic relation, extracted from the syntactic roles, between the head and its argument. An S-rule may comprise several different relations.
  • HEAD (optional) is the head of the syntactic structure, which is to be omitted when does not undergo any change;
  • ARGUMENT (optional in case of head-only relations) is the argument (the specifier, the complement or the adjunct) of the head.

The HEAD and the ARGUMENT may be expressed as:

  • a "string" (strings must come between parentheses);
  • a [lemma] (lemmas must come between square brackets);
  • a feature or a set of features, separated by comma, and extracted from the the UNDLF Tagset;
  • a direction (">",">>","<","<<");
  • an index between the left and the right side of the rule (to be specified by the syntax %name);
  • an action, to be performed through an A-rule; and
  • a syntactic relation itself.

Examples

Examples of S-rules:

  • composition
    • VA("into account"); (add the string "into account" as the adjunct of the verb)
  • subcategorization
    • VC(PH("in")); (the complement of the verb is a prepositional phrase headed by the preposition "in")
  • agreement
    • VS(ANUM,APER); (the specifier of the verb assigns number (ANUM) and person (APER) to its head
  • case marking
    • VS(NOM); (the specifier of the verb receives the case nominative (NOM)
  • distribution
    • VA(>>); (the adjunct of the verb comes at the right side of the verb after a blank space)
  • adjacency
    • VA(AJ2); (the adjunct of the verb integrates the second projection of the head)
  • projection
    • VS(%head;%spec)VB(%head;%comp):=VP(VB(%head;%comp);%spec); (integrate the two relations on the left side into a single relation)

Formal Syntax

S-rules comply with the following formal syntax:

<S-RULE>                ::= <CONDITION> ":=" (<SYNTACTIC RELATION>)+";"
<CONDITION>             ::= <TAG>(","<TAG>)* | (<SYNTACTIC RELATION>)*
<SYNTACTIC RELATION>    ::= <HEAD-DRIVEN RELATION> "(" (<NODE>";")? <NODE> ")"
<HEAD-DRIVEN RELATION>  ::= {one of the head-driven syntactic relations defined in the UNDLF Tagset} 
<NODE>                  ::= <FEATURE>(","<FEATURE>)* 
<FEATURE>               ::= <ID>|<TAG>|"""<STRING>"""|"["<STRING>"]"|<DIRECTION>|<SYNTACTIC RELATION>|<ACTION>
<ID>                    ::= "%"[a-zA-Z_0-9]+
<TAG>                   ::= {one of the tags defined in the UNDLF Tagset}
<STRING>                ::= [a..Z]+
<DIRECTION>             ::= ">"|">>"|"<"|"<<"
<ACTION>                ::= <PREFIXATION> | <SUFFIXATION> | <INFIXATION> | <REPLACEMENT> (cf. A-rule)

where
<a> = a is a non-terminal symbol
"a" = a is a constant
a | b = a or b
(a)? = a can be repeated 0 or one time
(a)* = a can be repeated 0 or more times
(a)+ = a can be repeated 1 or more times

Software