Hindustani is the lingua franca of northern India and Pakistan, and through its two standardized registers, Hindi and Urdu, a co-official language of India and co-official and national language of Pakistan respectively. Phonological differences between the two standards are minimal.
Hindustani natively possesses a symmetrical ten-vowel system. The vowels [?], [?], [?] are always short in length, while the vowels [a:], [i:], [u:], [e:], [o:], [?:], [?:] are usually considered long, in addition to an eleventh vowel /æ:/ which is found in English loanwords. The distinction between short and long vowels is often described as tenseness, with short vowels being lax, and long vowels being tense.
Schwa is a short vowel which vanishes to nothing at unstressed position. /?/ is often realized more open than mid , i.e. as near-open .
The open central vowel is transcribed in IPA by either [a:] or [?:].
In Urdu, there is further short [a] (spelled ?, as in ? kamra [k?mra]) in word-final position, which contrasts with [a:] (spelled ?, as in ? la?k? [lka:]). This contrast is often not realized by Urdu speakers, and always neutralized in Hindi (where both sounds uniformly correspond to [a:]).
Among the close vowels, what in Sanskrit are thought to have been primarily distinctions of vowel length (that is /i, i:/ and /u, u:/), have become in Hindustani distinctions of quality, or length accompanied by quality (that is, /?, i:/ and /?, u:/). The opposition of length in the close vowels has been neutralized in word-final position, only allowing long close vowels in final position. As a result, Sanskrit loans which originally have a short close vowel are realized with a long close vowel, e.g. ?akti ( - 'energy') and vastu ( - 'item') are [kti:] and [stu:], not *[kt?] and *[st?].
The vowel represented graphically as ? - (romanized as ai) has been variously transcribed as [?:] or [æ:]. Among sources for this article, Ohala (1999), pictured to the right, uses [?:], while Shapiro (2003:258) and Masica (1991:110) use [æ:]. Furthermore, an eleventh vowel /æ:/ is found in English loanwords, such as /bæ:?/ ('bat'). Hereafter, ? - (romanized as ai) will be represented as [?:] to distinguish it from /æ:/, the latter. Despite this, the Hindustani vowel system is quite similar to that of English, in contrast to the consonants.
|IPA||Hindi||ISO 15919||Urdu||Approxi. English
|?:||?||?||au||?||lot (Received Pronunciation)|
|?||h||?||(Aspirated sounds) cake|
|||?||m?||?||?||nasal vowel faun|
([ã:, õ:], etc.)
In addition, [?] occurs as a conditioned allophone of /?/ (schwa) in proximity to /?/, if and only if the /?/ is surrounded on both sides by two underlying, orthographic schwas. This change is part of the prestige dialect of Delhi, but may not occur for every speaker. Here are some examples of this process:
|? / ? "to say"||kahn?||/k.n?:/||[k.n?:]|
|/ "to wait"||?haharn?||/..n?:/||[..n?:]|
However, the fronting of schwa does not occur in words with a schwa only on one side of the /?/ such as kah?n? /ka:ni:/ ( - 'a story') or b?har /ba:r/ (? - 'outside').
The vowel [?] occurs in proximity to /?/ if the /?/ is surrounded by one of the sides by a schwa and on other side by a round vowel. It differs from the vowel [?:] in that it is a short vowel. For example, in bahut /bt/ the /?/ is surrounded on one side by a schwa and a round vowel on the other side. One or both of the schwas will become [?] giving the pronunciation [bt].
As in French and Portuguese, there are nasalized vowels in Hindustani. There is disagreement over the issue of the nature of nasalization (barring English-loaned /æ/ which is never nasalized). Masica (1991:117) presents four differing viewpoints:
Masica supports this last view.
Hindustani has a core set of 28 consonants inherited from earlier Indo-Aryan. Supplementing these are two consonants that are internal developments in specific word-medial contexts, and seven consonants originally found in loan words, whose expression is dependent on factors such as status (class, education, etc.) and cultural register (Modern Standard Hindi vs Urdu).
Most native consonants may occur geminate (doubled in length; exceptions are /b?, ?, , ?/). Geminate consonants are always medial and preceded by one of the interior vowels (that is, /?/, /?/, or /?/). They all occur monomorphemically except [?:], which occurs only in a few Sanskrit loans where a morpheme boundary could be posited in between, e.g. /n + ?i:l/ for nil [n?'?:i:l] ('without shame').
For the English speaker, a notable feature of the Hindustani consonants is that there is a four-way distinction of phonation among plosives, rather than the two-way distinction found in English. The phonations are:
The last is commonly called "voiced aspirate", though Shapiro (2003:260) notes that,
"Evidence from experimental phonetics, however, has demonstrated that the two types of sounds involve two distinct types of voicing and release mechanisms. The series of so-called voice aspirates should now properly be considered to involve the voicing mechanism of murmur, in which the air flow passes through an aperture between the arytenoid cartilages, as opposed to passing between the ligamental vocal bands."
The murmured consonants are believed to be a reflex of murmured consonants in Proto-Indo-European, a phonation that is absent in all branches of the Indo-European family except Indo-Aryan and Armenian.
Stops in final position are not released.[Do unreleased stops maintain aspiration contrast?] /?/ varies freely with [v], and can also be pronounced [w]. /r/ is essentially a trill. In intervocalic position, it may have a single contact and be described as a flap [?], but it may also be a clear trill, especially in word-initial and syllable-final positions, and geminate /r:/ is always a trill in Arabic and Persian loanwords, e.g. zar? [za:] (? - 'little') versus well-trilled zarr? [z?ra:] ( - ? 'particle'). The palatal and velar nasals [?, ?] occur only in consonant clusters, where each nasal is followed by a homorganic stop, as an allophone of a nasal vowel followed by a stop, and in Sanskrit loanwords. There are murmured sonorants, [l?, r?, m?, n?], but these are considered to be consonant clusters with /?/ in the analysis adopted by Ohala (1999).
The fricative /?/ in Hindustani is typically voiced (as [?]), especially when surrounded by vowels, but there is no phonemic difference between this voiced fricative and its voiceless counterpart [h] (Hindustani's ancestor Sanskrit has such a phonemic distinction).
Hindustani also has a phonemic difference between the dental plosives and the so-called retroflex plosives. The dental plosives in Hindustani are laminal-denti alveolar as in Spanish, and the tongue-tip must be well in contact with the back of the upper front teeth. The retroflex series is not purely retroflex; it actually has an apico-postalveolar (also described as apico-pre-palatal) articulation, and sometimes in words such as ? /?u:?a:/ (? - 'broken') it even becomes alveolar.
In some Indo-Aryan languages, the plosives [?, ] and the flaps [?, ] are allophones in complementary distribution, with the former occurring in initial, geminate and postnasal positions and the latter occurring in intervocalic and final positions. However, in Standard Hindi they contrast in similar positions, as in naj ( - ? 'bird') vs ni?ar (? - 'fearless').
Hindustani does not distinguish between [v] and [w]. These are distinct phonemes in English, but conditional allophones of the phoneme /?/ in Hindustani (written ⟨?⟩ in Hindi or ⟨?⟩ in Urdu), meaning that contextual rules determine when it is pronounced as [v] and when it is pronounced as [w]. /?/ is pronounced [w] in onglide position, i.e. between an onset consonant and a following vowel, as in pakw?n ( , 'food dish'), and [v] elsewhere, as in vrat (? , 'vow'). Native Hindi speakers are unaware of the allophonic distinctions, though these are apparent to native English speakers.
In most situations, the allophony is non-conditional, i.e. the speaker can choose [v], [w], or an intermediate sound based on personal habit and preference, and still be perfectly intelligible, as long as the meaning is constant. This includes words such as advait ( ) which can be pronounced equally correctly as [?d?'w?:t?] or [?d?'v?:t?].
Loanwords from Sanskrit reintroduced /?/ into formal Modern Standard Hindi. In casual speech it is usually replaced by /n/. It does not occur initially and has a nasalized flap  as a common allophone.
Loanwords from Persian (including some words which Persian itself borrowed from Arabic or Turkish) introduced six consonants, /f, z, ?, q, x, ?/. Being Persian in origin, these are seen as a defining feature of Urdu, although these sounds officially exist in Hindi and modified Devanagari characters are available to represent them. Among these, /f, z/, also found in English and Portuguese loanwords, are now considered well-established in Hindi; indeed, /f/ appears to be encroaching upon and replacing /p?/ even in native (non-Persian, non-English, non-Portuguese) Hindi words as well as many other Indian languages such as Bengali, Gujarati and Marathi, as happened in Greek with phi. This /p?/ to /f/ shift also occasionally occurs in Urdu. While [z] is a foreign sound, it is also natively found as an allophone of /s/ beside voiced consonants.
The other three Persian loans, /q, x, ?/, are still considered to fall under the domain of Urdu, and are also used by many Hindi speakers; however, some Hindi speakers assimilate these sounds to /k, k?, ?/ respectively. The sibilant /?/ is found in loanwords from all sources (Arabic, English, Portuguese, Persian, Sanskrit) and is well-established. The failure to maintain /f, z, ?/ by some Hindi speakers (often non-urban speakers who confuse them with /p?, d?, s/) is considered nonstandard. Yet these same speakers, having a Sanskritic education, may hyperformally uphold /?/ and [?]. In contrast, for native speakers of Urdu, the maintenance of /f, z, ?/ is not commensurate with education and sophistication, but is characteristic of all social levels. The sibliant /?/ is very rare and is found in loanwords from Persian, Portuguese, and English and is considered to fall under the domain of Urdu and although it is officially present in Hindi, many speakers of Hindi assimilate it to /z/ or /d?/.
Being the main sources from which Hindustani draws its higher, learned terms- English, Sanskrit, Arabic, and to a lesser extent Persian provide loanwords with a rich array of consonant clusters. The introduction of these clusters into the language contravenes a historical tendency within its native core vocabulary to eliminate clusters through processes such as cluster reduction and epenthesis. Schmidt (2003:293) lists distinctively Sanskrit/Hindi biconsonantal clusters of initial /kr, k?, st, s?, ?r, sn, nj/ and final /t?, , nj, lj, r?, d?j, rj/, and distinctively Perso-Arabic/Urdu biconsonantal clusters of final /ft, rf, mt, mr, ms, kl, tl, bl, sl, tm, lm, ?m, ?r/.
Stress is on the heaviest syllable of the word, and in the event of a tie, on the last such syllable. If all syllables are light, the penultimate is stressed. However, the final mora of the word is ignored when making this assignment (Hussein 1997) [or, equivalently, the final syllable is stressed either if it is extra-heavy, and there is no other extra-heavy syllable in the word or if it is heavy, and there is no other heavy or extra-heavy syllable in the word]. For example, with the ignored mora in parentheses:[clarification needed]
Content words in Hindustani normally begin on a low pitch, followed by a rise in pitch. Strictly speaking, Hindustani, like most other Indian languages, is rather a syllable-timed language. The schwa /?/ has a strong tendency to vanish into nothing (syncopated) if its syllable is unaccented.
... showed extremely regular patterns. As is not uncommon in a study of subphonemic detail, the objective data patterned much more cleanly than intuitive judgments ... [w] occurs when /? ?/ is in onglide position ... [v] occurs otherwise ...