Linguistic typology (or language typology) is a field of linguistics that studies and classifies languages according to their structural features. Its aim is to describe and explain the common properties and the structural diversity of the world's languages. Its subdisciplines include, but are not limited to: qualitative typology, which deals with the issue of comparing languages and within-language variance; quantitative typology, which deals with the distribution of structural patterns in the world's languages; theoretical typology, which explains these distributions; syntactic typology, which deals with word order, word form, word grammar and word choice; and lexical typology, which deals with language vocabulary.
Qualitative typology develops cross-linguistically viable notions or types that provide a framework for the description and comparison of individual languages. A few examples appear below.
These labels usually appear abbreviated as "SVO" and so forth, and may be called "typologies" of the languages to which they apply. The most commonly attested word orders are SOV and SVO while the least common orders are those that are object initial with OVS being the least common with only four attested instances.
In the 1980s, linguists began to question the relevance of geographical distribution of different values for various features of linguistic structure. They may have wanted to discover whether a particular grammatical structure found in one language is likewise found in another language in the same geographic location. Some languages split verbs into an auxiliary and an infinitive or participle and put the subject and/or object between them. For instance, German (Ich habe einen Fuchs im Wald gesehen - *"I have a fox in-the woods seen"), Dutch (Hans vermoedde dat Jan Marie zag leren zwemmen - *"Hans suspected that Jan Marie saw to learn to swim") and Welsh (Mae'r gwirio sillafu wedi'i gwblhau - *"Is the checking spelling after its to complete"). In this case, linguists base the typology on the non-analytic tenses (i.e. those sentences in which the verb is not split) or on the position of the auxiliary. German is thus SVO in main clauses and Welsh is VSO (and preposition phrases would go after the infinitive).
Some languages allow varying degrees of freedom in their constituent order, posing a problem for their classification within the subject-verb-object schema. Languages with bound case markings for nouns, for example, tend to have more flexible word orders than languages where case is defined by position within a sentence or presence of a preposition.[example needed] To define a basic constituent order type in this case, one generally looks at frequency of different types in declarative affirmative main clauses in pragmatically neutral contexts, preferably with only old referents. Thus, for instance, Russian is widely considered an SVO language, as this is the most frequent constituent order under such conditions--all sorts of variations are possible, though, and occur in texts. In many inflected languages, such as Russian, Latin, and Greek, departures from the default word-orders are permissible but usually imply a shift in focus, an emphasis on the final element, or some special context. In the poetry of these languages, the word order may also shift freely to meet metrical demands. Additionally, freedom of word order may vary within the same language--for example, formal, literary, or archaizing varieties may have different, stricter, or more lenient constituent-order structures than an informal spoken variety of the same language.
On the other hand, when there is no clear preference under the described conditions, the language is considered to have "flexible constituent order" (a type unto itself).
An additional problem is that in languages without living speech communities, such as Latin, Ancient Greek, and Old Church Slavonic, linguists have only written evidence, perhaps written in a poetic, formalizing, or archaic style that mischaracterizes the actual daily use of the language. The daily spoken language of Sophocles or Cicero might have exhibited a different or much more regular syntax than their written legacy indicates.
A second major way of syntactic categorization is by excluding the subject from consideration. It is a well-documented typological feature that languages with a dominant OV order (object before verb), Japanese for example, tend to have postpositions. In contrast, VO languages (verb before object) like English tend to have prepositions as their main adpositional type. Several OV/VO correlations have been uncovered.
|Correlation||OV languages||VO languages|
|Order of noun and relative clause||either||noun before relative clause|
|Order of noun and genitive||genitive before noun||noun before genitive|
|Order of adjective and standard of comparison||standard before adjective||adjective before standard|
|Order of verb and adpositional phrase||adpositional phrase before verb||verb before adpositional phrase|
|Order of verb and manner adverb||manner adverb before verb||verb before manner adverb|
|Order of copula and predicative||predicate before copula||copula before predicate|
|Order of content verb and auxiliary verb||content verb before auxiliary||auxiliary before content verb|
|Order of adverbial subordinator and clause||clause-final subordinators||clause-initial subordinators|
Several processing explanations were proposed in the 1980s and 1990s for the above correlations. They suggest that the brain finds it easier to parse syntactic patterns which are either right or left branching, but not mixed. The most widely held such explanation is John A. Hawkins' Grammar-Performance Correspondence Hypothesis which argues that language is a non-innate adaptation to innate cognitive mechanisms. Typological tendencies are considered as being based on language users' preference for grammars that are organized efficiently, and on their avoidance of word orderings which cause processing difficulty. Some languages however exhibit regular inefficient patterning. These include the VO languages Chinese, with the adpositional phrase before the verb, and Finnish which has postpositions; but there are few other profoundly exceptional languages.
Another common classification distinguishes nominative-accusative alignment patterns and ergative-absolutive ones. In a language with cases, the classification depends on whether the subject (S) of an intransitive verb has the same case as the agent (A) or the patient (P) of a transitive verb. If a language has no cases, but the word order is AVP or PVA, then a classification may reflect whether the subject of an intransitive verb appears on the same side as the agent or the patient of the transitive verb. Bickel (2011) has argued that alignment should be seen as a construction-specific property rather than a language-specific property.
Many languages show mixed accusative and ergative behaviour (for example: ergative morphology marking the verb arguments, on top of an accusative syntax). Other languages (called "active languages") have two types of intransitive verbs—some of them ("active verbs") join the subject in the same case as the agent of a transitive verb, and the rest ("stative verbs") join the subject in the same case as the patient[example needed]. Yet other languages behave ergatively only in some contexts (this "split ergativity" is often based on the grammatical person of the arguments or on the tense/aspect of the verb). For example, only some verbs in Georgian behave this way, and, as a rule, only while using the perfective (aorist).
Linguistic typology also seeks to identify patterns in the structure and distribution of sound systems among the world's languages. This is accomplished by surveying and analyzing the relative frequencies of different phonological properties. These relative frequencies might, for example, be used to determine why contrastive voicing commonly occurs with plosives, as in English neat and need, but occurs much more rarely among fricatives, such as the English niece and knees. According to a worldwide sample of 637 languages, 62% have the voicing contrast in stops but only 35% have this in fricatives. In the vast majority of those cases, the absence of voicing contrast occurs because there is a lack of voiced fricatives and because all languages have some form of plosive, but there are languages with no fricatives. Below is a chart showing the breakdown of voicing properties among languages in the aforementioned sample.
|Plosive Voicing||Fricative Voicing|
|Total||221 (35%)||416 (65%)||637|
Languages worldwide also vary in the number of sounds they use. These languages can go from very small phonemic inventories (Rotokas with six consonants and five vowels) to very large inventories (!Xóõ with 128 consonants and 28 vowels). An interesting phonological observation found with this data is that the larger a consonant inventory a language has, the more likely it is to contain a sound from a defined set of complex consonants (clicks, glottalized consonants, doubly articulated labial-velar stops, lateral fricatives and affricates, uvular and pharyngeal consonants, and dental or alveolar non-sibilant fricatives). Of this list, only about 26% of languages in a survey of over 600 with small inventories (less than 19 consonants) contain a member of this set, while 51% of average languages (19-25) contain at least one member and 69% of large consonant inventories (greater than 25 consonants) contain a member of this set. It is then seen that complex consonants are in proportion to the size of the inventory.
Vowels contain a more modest number of phonemes, with the average being 5-6, which 51% of the languages in the survey have. About a third of the languages have larger than average vowel inventories. Most interesting though is the lack of relationship between consonant inventory size and vowel inventory size. Below is a chart showing this lack of predictability between consonant and vowel inventory sizes in relation to each other.
|Consonant Inventory||Vowel Quality Inventory|
|Total||115 (17%)||345 (51%)||220 (32%)||680|
Quantitative typology deals with the distribution and co-occurrence of structural patterns in the languages of the world. Major types of non-chance distribution include:
Linguistic universals are patterns that can be seen cross linguistically. Universals can either be absolute, meaning that every documented language exhibits this characteristic, or statistical, meaning that this characteristic is seen in most languages or is probable in most languages. Universals, both absolute and statistical can be unrestricted, meaning that they apply to most or all languages without any additional conditions. Conversely, both absolute and statistical universals can be restricted or implicational, meaning that a characteristic will be true on the condition of something else (if Y characteristic is true, then X characteristic is true).