Subcategorization

From UNL Wiki
Revision as of 15:16, 14 August 2013 by Martins (Talk | contribs)
Jump to: navigation, search

Subcategorization is the definition of the number and types of the syntactic arguments that co-occurs with the base form in order to form a phrase.

What is subcategorization

The idea of a subcategorization is related to the concepts of valency and transitivity. Subcategorization rules are schemes that define the number and the type of specifiers, complements and adjuncts that a base form needs to constitute its corresponding maximal projection.

For instance, the noun "apple" does not require any adjunct, specifier or complement to form a noun phrase (as in "I love apples"). The fact that it is often combined with other forms to form more complex noun phrases (as in "the apple", "delicious apple", "apple from Argentina", etc) is rather accidental, and does not affect the fact the word does not need them to constitute the simplest maximal projection. The same happens to the forms "beautiful" and "now", which may project, alone, an adjective phrase and an adverbial phrase, respectively.

However, there are forms such as "give", "of", "and", "Netherlands" and "interested" that cannot project phrases without the help of other constituents. They require specifiers, complements or adjuncts to form a "minimal maximal projection". The verb "give", for instance, requires at least one specifier (the subject) and two objects (a direct and an indirect), even if, in several contexts, they are not explicit[1]. The same happens to "of", "and" and "interested", which always requires a complement to form a prepositional phrase, a complementizer phrase and an adjective phrase, respectively. The form "Netherlands", on the other hand, requires a specifier ("the") to project a noun phrase (I go to Netherlands).

A subcategorization rule is the syntactic device that describes these conditions, i.e., what is really necessary (the obligatory constituents) for a form to project its corresponding maximal projection.

Subcategorization rules and subcategorization frames

In the UNLarium framework, subcategorization is indicated by a set of transformations carried over the base form. This set of transformations can be represented by:

  • subcategorization frames, in case of regular behaviour (i.e., a set of transformations that is followed by several different words)
  • subcategorization rules, i.e., in case of irregular behaviour (i.e., a set of transformation that is followed by very few words); or
  • subcategorization frames and subcategorization rules, in case of quasi-regular behaviour (i.e., when the word is mainly regular but has some subcategorization particularities).

For instance, the rule "VS(NP)VC(NP);" (= the verb takes a noun phrase as the specifier and a noun phrase as a complement) is associated to all direct transitive verbs of English (to buy, to make, to do, etc) and should be defined, therefore, as a subcategorization frame. The same happens to the rule "VS(NP)VC(PP([on]));" (= the verb takes a noun phrase as the subject and a prepositional phrase headed by "on" as a complement), which is less general, but still quite comprehensive, and would be applicable to all indirect transitive verbs that select the preposition on (such as to depend, to insist, to operate, etc).

Examples of subcategorization frames
Intransitive verbs: VS(NP);
Direct transitive verbs: VS(NP)VC(NP);
Indirect transitive verbs selecting prepositional phrases headed by "on": VS(NP)VC(PH([on]));
Indirect transitive verbs selecting prepositional phrases headed by "in": VS(NP)VC(PH([in]));
Ditransitive verbs: VS(NP)VC(NP)VC(PH([to]));
Nouns selecting prepositional phrases headed by "of": NC(PH([of]));
Adjectives selecting prepositional phrases headed by "in": JC(PH([in]));
Adjectives selecting prepositional phrases headed by "of": JC(PH([of]));
Adverbs selecting prepositional phrases headed by "to": AC(PH([to]));
etc.


Syntax

Subcategorization frames and subcategorization rules are expressed by S-rules, a special formalism for representing the syntactic structure of the phrase.

Software