Morphology is the branch of linguistics that studies patterns of word formation within and across languages, and attempts to formulate rules that model the knowledge of the speakers of those languages.
Words, word forms and lexemes
There are several difficulties in arriving at a consistent use of the term "word" in relation to other categories of linguistic description, and several criteria (prosodical, morphological, syntactical) have been suggested for the identification of words in a language. One of the main difficulties concerns the use of the term "word" both as a class and as any of its elements. The forms "love", "loves", "loving" and "loved", for instance, may be considered to be different "words" of English or different forms (variants) of the same "word", depending on the case.
In order to avoid ambiguities, linguists differentiate between two senses of "word". The first sense, the one in which "love", "loves", "loving" and "loved" are different "words", is usually called a word form. Word forms are therefore "the physically definable units which one encounters in a stretch of writing (bounded by spaces) or speech (where identification is more difficult, but where there may be phonological clues to identify boundaries, such as a pause, or juncture features)" (Crystal, 2008, p. 522).
The second sense, the one in which "love", "loves", "loving" and "loved" are "the same word", is normally called a lexeme. The lexeme is an abstract underlying unit that corresponds to a set of different word forms reputed to be part of the same word class.
Different word forms are said to be part of the same lexeme if they share the same fundamental morphological identity. This means that word forms are analysed into smaller units, called morphemes, which are the smallest linguistic units that have semantic meaning.
Morphemes can be classified according to several different criteria. The most frequent ones are syntactic and semantic. From the syntactic perspective, morphemes can be:
- free morpheme, if they can stand alone (such as "table", "happy"); or
- bound morpheme, if they cannot stand alone (such as "un-", "-ism" and "-rupt-").
From the semantic point of view, there are again two main different types of morphemes:
- root - the primary unit of a word unit, which carries the most significant aspects of semantic content; and
- affix - a morpheme attached to the root to modify its meaning (such as "-s" in "tables", or "un-" in "undo").
Word forms may have one (“fire”, “man”, “dish”, “washer”) or several roots (“fireman”, "dishwasher"), and zero ("happy") or more ("unhappy", "unhappiness") affixes.
Affixes are divided into several categories, depending on their position and their role with reference to the root. The most important positional categories are:
- prefix (PFX) - Appears at the front of the root (such as "un-" in "undo", or "re-" in "rewrite")
- suffix (SFX) - Appears at the back of the root (such "-s" in "tables", or "-er" in "writer")
- infix (IFX) - Appears within the root (very rare in English, such as "-ma-" in "sophistimacated")
- circumfix (CCX) - Appears at the front and at the back of the root (very rare in English, such as "a-" + "-ed" in "ascattered")
As for their roles, there are two main different types of affixes:
- inflectional affix - assign grammatical properties (such as number, gender, tense, person) to the root in order to form the different word forms of the same lexeme ("-s" in "tables", "-ed" in "loved")
- derivational affix - form a new lexeme by modifying the meaning (and sometimes the category) of the root ("un-" in "unhappy", "-ness" in "happiness").
The combination of roots and derivational affixes is usually called stem (or inflectional root). The stem is therefore the longest common denominator among all word forms belonging to the same lexeme. It defines the basic structure over which inflections apply. For instance:
Morphological categories often coincide, but they correspond to different levels of morphological analysis. In non-inflectional (invariant) lexemes (such as English adjectives and adverbs), for instance, the stem is equal to the word form ("happily" = word form = stem). In non-derivational (primitive) lexemes, the stem is equal to the root ("here" = stem = root). In any case, especially in inflectional and derivational lexemes, these categories are clearly differentiated. The Spanish lexeme corresponding to the forms of the adjective "desanimado" (= discouraged), for instance, has the following morphological items:
- word forms = desanimado, desanimada, desanimados, desanimadas
- stem = desanimad-
- inflectional affixes = -o, -a, -os, -as
- derivational affixes = des-, -ad-
- root = anim-
In case of overlapping, these categories are used from the least comprehensive ("root") to the most comprehensive ("word form"). Thus;
- "friend" (word form = stem = root) is classified as root;
- "unfriendly" (word form = stem) is classified as stem; and
- "clothes" (word form > stem) is classified as word form.
In some languages, a given inflection may assume different forms. The feature ALT must be used for alternative forms.
In English, for instance, the word 'volcano' may have two different plural forms:
In case of more than one possible alternative form, the features ALT1, ALT2 and ALT3 must be used instead of ALT.
For instance, in Arabic the word 'elephant' has three plural forms, as indicated below:
In the UNLarium, we recognize six main morphological categories:
|lexeme||word forms||root||derivational affixes||inflectional affixes||stem|
|6||love, loves, loving, loved||lov-||-e,-s, -ing, -ed||lov-|
|7||desanimado, desanimada, desanimados, desanimadas||anim-||des-, -ad-||-o, -a, -s||desanimad-|
|8||unbreakableness||break||un-, -able, -ness||unbreakableness|
|9||fireman, firemen||fire, man||fireman|
|10||part of speech, parts of speech||part, of, speech||-s||part of speech|