C-rule
(→Expressing compounds in the UNLarium) |
|||
Line 4: | Line 4: | ||
In the UNL<sup>arium</sup> framework, compounds are treated as ordinary simple words except in case of discontinuous multi-word expressions or with infixation (such as "give in" or "take into account"). In these cases, the [[lemma]] is different from the [[base form]], and the compound-formation process is expected to be defined through [[S-rule]]s such as the following: | In the UNL<sup>arium</sup> framework, compounds are treated as ordinary simple words except in case of discontinuous multi-word expressions or with infixation (such as "give in" or "take into account"). In these cases, the [[lemma]] is different from the [[base form]], and the compound-formation process is expected to be defined through [[S-rule]]s such as the following: | ||
− | < | + | <SYNTACTIC ROLE>(<ADDED>); |
Where:<br/> | Where:<br/> | ||
− | < | + | <SYNTACTIC ROLE> is the [[Syntax#Syntactic_Role|syntactic role]] (VA, VC, VS, VH, etc) of the term to be added to the base for)<br /> |
− | + | ||
<ADDED> is the term to be added to the base form to form the compound. It can be a string between "quotes" or a lemma between [brackets]<br /> | <ADDED> is the term to be added to the base form to form the compound. It can be a string between "quotes" or a lemma between [brackets]<br /> | ||
− | == Examples == | + | === Examples === |
{|border=1 cellpadding=2 | {|border=1 cellpadding=2 | ||
!Lemma | !Lemma | ||
Line 30: | Line 29: | ||
|throw <person> to the lions | |throw <person> to the lions | ||
|throw | |throw | ||
− | |VA("to the lions") | + | |VA("to the lions")VC(NP) |
|the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) and a noun phrase (NP) is to be added as a complement to the verb (VC) | |the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) and a noun phrase (NP) is to be added as a complement to the verb (VC) | ||
|} | |} |
Revision as of 14:42, 23 March 2010
Compounding or composition is the word-formation process of creating compounds by combining or putting together lexemes.
Contents |
Expressing compounds in the UNLarium
In the UNLarium framework, compounds are treated as ordinary simple words except in case of discontinuous multi-word expressions or with infixation (such as "give in" or "take into account"). In these cases, the lemma is different from the base form, and the compound-formation process is expected to be defined through S-rules such as the following:
<SYNTACTIC ROLE>(<ADDED>);
Where:
<SYNTACTIC ROLE> is the syntactic role (VA, VC, VS, VH, etc) of the term to be added to the base for)
<ADDED> is the term to be added to the base form to form the compound. It can be a string between "quotes" or a lemma between [brackets]
Examples
Lemma | BF | Compound | Description |
---|---|---|---|
give in | give | VH([in]) | the lemma "in" is to be added to the base form as part of the head of the verb (VH) |
take into account | take | VA("into account") | the string "into account" is to be added to the base form as an adjunct to the verb (VA) |
throw <person> to the lions | throw | VA("to the lions")VC(NP) | the string "to the lions" is to be added to the base form as an adjunct to the verb (VA) and a noun phrase (NP) is to be added as a complement to the verb (VC) |
Observation
- Phrasal verbs
- Particles of phrasal verbs must be represented as part of the head, if non separable, or as adjuncts, otherwise:
- give in = VH([in]); ("give in something" but
"give something in") - give back = VA([back]); ("give back something" or "give something back")
- Strings and lemmas
- In the compound-formation process, the UNLarium distinguishes between strings (to be represented between "") and lemmas (to be represented between [ ]). The difference between strings and lemmas has to do with the dictionary status. Lemmas, but not strings, are expected to be defined as dictionary entries:
- VA("into account"); (add the string "into account" as a verbal adjunct, take > take into account)
- VC([love]); (add the lemma "love" as a verbal complement, such as in make > make love)
In the above, it's unlikely to have "into account" as a single entry, whereas "love" is probably already there.
Syntax
Compounds may be explicitly expressed by S-rules, a formalism for describing the syntactic structure of phrases.