|South, Central, Western Asia, South East Europe and the Caucasus / Total speakers = approximately 1.5 billion in 15 countries|
The approximate present-day distribution of the Indo-European branches of Eurasia:
The Indo-Iranian languages (Indo-Iranic languages), or Aryan languages constitute the largest and southeasternmost extant branch of the Indo-European language family. It has more than 1.5 billion speakers, stretching from Europe (Romani), Turkey (Kurdish and Zaza-Gorani) and the Caucasus (Ossetian) eastward to Xinjiang (Sarikoli) and Assam (Assamese), and south to Sri Lanka (Sinhala) and the Maldives (Maldivian). Furthermore, there are large communities of Indo-Iranian speakers in northwestern Europe (the United Kingdom), North America (United States and Canada), and Australia.
The common ancestor of all of the languages in this family is called Proto-Indo-Iranian--also known as Common Aryan--which was spoken in approximately the late 3rd millennium BC. The three branches of the modern Indo-Iranian languages are Indo-Aryan, Iranian, and Nuristani. A fourth independent branch, Dardic, was previously posited, but recent scholarship in general places Dardic languages as archaic members of the Indo-Aryan branch.
The Indo-Iranian languages consist of three groups:
Indo-Iranian languages are spoken by more than 1.5 billion people. The languages with the most speakers are a part of the Indo-Aryan group: Hindustani (Hindi/Urdu), (~590 million), Bengali (205 million), Punjabi (100 million), Marathi (75 million), Gujarati (50 million), Bhojpuri (40 million), Awadhi (40 million), Maithili (35 million), Odia (35 million), Marwari (30 million), Sindhi (25 million), Assamese (24 million), Rajasthani (20 million), Chhattisgarhi (18 million), Sinhala (19 million), Nepali (17 million), Bishnupuriya (12 million) and Rangpuri (15 million). Among the Iranian branch, major languages are Persian (60 million), Pashto (ca. 50 million), Kurdish (35 million), and Balochi (8 million). There are also many smaller languages.
This section needs expansion. You can help by adding to it. (December 2018)
The oldest attested Indo-Iranian languages are Vedic Sanskrit, Older and Younger Avestan and Old Persian (ancient Iranian languages). A few words from another Indo-Aryan language (see Indo-Aryan superstrate in Mitanni) are attested in documents from the ancient Mitanni and Hittite kingdoms in the Near East.
Within the Indo-European family, Indo-Iranian belongs to the Satem group. Various proposals have been made that link the Indo-Iranian languages with other subgroups of Indo-European (like Graeco-Aryan, which posits a close relationship with Greek and Armenian), but these remain without wider acceptance.
Innovations shared with other languages affected by the satem sound changes include:
'Dardic' is a geographic cover term for those Northwest Indo-Aryan languages which [..] developed new characteristics different from the IA languages of the Indo-Gangetic plain. Although the Dardic and Nuristani (previously 'Kafiri') languages were formerly grouped together, Morgenstierne (1965) has established that the Dardic languages are Indo-Aryan, and that the Nuristani languages constitute a separate subgroup of Indo-Iranian.