A Rundown of Indian Languages

A lot of people are becoming more aware that India has more than a single language that is spoken across the country. Even though Hindi is the official lingua franca, there are twenty-two official languages of India, which come from four different language families: Indo-Aryan, Dravidian, Tibeto-Burman, and Munda, excluding English. 


However, only six classical languages recognized by the government as such, which include Tamil, Sanskrit, Kannada, Telugu, Malayalam, and Oriya. In 2006, Minister of Tourism & Culture Ambika Soni defined a classical language as:

“High antiquity of its early texts/recorded history over a period of 1500–2000 years; a body of ancient literature/texts, which is considered a valuable heritage by generations of speakers; the literary tradition be original and not borrowed from another speech community; the classical language and literature being distinct from modern, there may also be a discontinuity between the classical language and its later forms or its offshoots.”

This is not to say that the other languages are rich in literature of their own. In fact, many modern works of literature from India were written in non-classical languages, as defined by Soni, including, but not limited to, Bengali and Marathi.

Some history is required to understand why there are so many different, non-mutually-intelligible languages in India. There are at least two major progenitor languages that are seen as major presences in India: Sanskrit and the Proto-Dravidian. It is hypothesized by scholars that migrants from what is modern day Turkey and Iran came to India from the northwest, through Pakistan, settling throughout the north, in around the 2nd millennium BCE. Proto-Dravidian, on the other hand, is native to the subcontinent, existing for much longer in India than Sanskrit. Proto-Dravidian was spoken primarily in the South. The origins of the Munda family are unknown, though it has been shown that they are distantly related to Khmer and Vietnamese, as well as  other minority languages through Southeast Asia. And it is important to remember that another large influence on Indian languages are the Farsi and Arabic languages, which came only much later to the subcontinent, through the Mughal empire. It is for that reason that the Arabic and Farsi heavy form of Hindi, known as Urdu, exists today.

Sanskrit is the liturgical language of Hinduism, and is used almost exclusively as such today, though it is an official language of Uttarakhand, and there are efforts to revive its usage. It is the language of the Bhagavad Gita and the Vedas, the core texts of the Hindu religion, though all of them have been translated into the other languages. It is also studied as a classical language in schools, in much same way Latin used to be a required subject in Western schools. Many languages, primarily in North India, borrow much of their vocabulary from Sanskrit, so it is very helpful to know. Classical Sanskrit’s formal grammar was standardized by Pāṇini, a Sanskrit grammarian, in his major work, Aṣṭādhyāyī (“Eight-Chapter Grammar”), written in 500 BCE, and is still used as the authority on the Sanskrit language today. Some of the South Indian languages, which are primarily Dravidian in origin, also borrow, to a lesser extent, from Sanskrit. Words from Sanskrit in Dravidian languages are often easily noticed by features such as the presence of aspirated consonants, and the consonant clusters dr as opposed to ḍr, and tr as opposed to ṭr.

Now, Dravidian languages are from a completely different family from the languages of the North, and share no similarities with them. Sanskrit penetrated South India as the language of the Maurya Empire, which included all of North India as well as much of South India, save for the tip of the peninsula, which largely encompasses the Tamil-speaking state of Tamil Nadu today. Tamil is not at all intelligible with any North Indian language, and influenced many Southern languages as well. The Dravidian ancestor language developed solely on the Indian subcontinent, eventually dividing into the Southern languages, such as Tamil, Kannada, Telugu, Konkani, and Malayalam. Tamil remains a sort of oddity among the Indian languages, as it is a liturgical language, of the Ayyavizhi tradition, and also exhibits unique traits as a language, because it distinguishes three different forms: a classical form based on the ancient form of the language, a modern literary form, and a modern colloquial spoken form. Tamil is also spoken in other countries as an official language, including Singapore and Sri Lanka, making it more relevant than just within India.


Kuzoian, Alex. “This Animated Map Shows How European Languages Evolved.” Business Insider. Business Insider, Inc, 10 Dec. 2014. Web. 26 June 2015.

“Geography and India’s Language Debate.” Z Geography. 14 Mar. 2013. Web. 26 June 2015.

“South Asian Language Families.” 27 Oct. 2007. Web. 26 June 2015.

“South Asian Language Families” links to: https://upload.wikimedia.org/wikipedia/commons/1/12/South_Asian_Language_Families.jpg

CC BY-SA 3.0 links to: http://creativecommons.org/licenses/by-sa/3.0/

The Process of “Conlanging” – Avreça, My Conlang

Since I joined a group called “Constructed languages” on Facebook, a conlanging side project of mine, called Avreça, has seen significant progress. From a document of a measly six pages, it has become an expansive grammar and vocabulary list of forty-two pages. I must confess that not long ago, I was highly opposed to the entire principle of a conlang. You can see a post from that time right here. I used to think that it was fundamentally pointless to create a language, especially out of existing ones, for use by the international community. You run into a number of important issues, such as: is it equitable to only include some languages and not others? is the range of expression of the constructed language vastly more limited than its constituent sources? is this practical to learn?

To address the first question, there is almost no solution. Actively including only a set category of languages, such as Slavic or Romance, is inherently exclusive. Whether it is difficult to learn is not the question here. This does indeed create the impression of superiority of that language family over others.

Second question: It’s almost a given that the expression in the constructed language won’t be as well developed. To arbitrarily give new meanings and connotations to words that didn’t have any previously is simply not appropriate, because those meanings develop with time and history. There has to be a context for it. However, the reason that Tolkien’s conlangs can do this is because his worldbuilding gives the context for the expression in those languages. If the con-universe gives a history to the language, then the vocabulary ought to reflect that history.

Third issue: The practicality of a language is entirely dependent on how the person chooses to construct the language. However, including certain features of the constituent languages is also a question of expression and equitability, the grammars of each languages have different nuances.

Granted, these things largely apply to conlangs that draw on existing languages for their lexicons and grammars. In the case of Tolkien, he built from the ground up. However, my conlang is the former, because I’m not experienced enough to build the morphology and roots from scratch.

I never intended my conlang to be used by a large community, but they certainly could if they so wished. As such, the issues mentioned above don’t necessarily apply, but I still stress practicality. It is a bit Romance-centric in terms of its general vocabulary and grammar, but its poetic and literary features draw from Dravidian languages, Sanskrit, and Hindi-Urdu. I started my conlang mostly for recreational purposes, but I had also started writing a story into which I wanted to incorporate my conlang. And thus, Avreça was born. You can download a document detailing the conlang here: http://www.mediafire.com/view/92ijw6jx0ii957a/Avreça.pdf.

Avreça’s grammar, as previously mentioned, is primarily Romance-centric. There exists an indicative and subjunctive version of each verb conjugation. Currently, all verbs are regular, but if it were to be used more often by actual people, I’m certain that many verbs would be become irregular. As per the tradition, so to speak, of Romance languages, there are three types of verbs, -ar, -er, and -ir.

A rather curious feature of Avreça is that it actively distinguishes between poetic/literary language and common language, which are nearly always separate. The actual poetic/literary lexicon, as I said, is largely Dravidian, Sanskrit, and Hindi-Urdu in origin. Only two words come from Japanese, al ossache and al hyachia, which are derived from sake and hyakki yagyō. The concept of ossache is similar to the Japanese tea ceremony, whereas hyachia describes any shady or suspicious part of society, usually in an abstract way.

I hope you enjoyed reading this post, and be sure to share this on Facebook, Google Plus, and Tumblr!