Language isolate

From Wikipedia, the free encyclopedia
Jump to: navigation, search

A language isolate, in the absolute sense, is a natural language with no demonstrable genealogical (or "genetic") relationship with other languages, one that has not been demonstrated to descend from an ancestor common with any other language. Language isolates are in effect language families consisting of a single language. Commonly cited examples include Ainu, Basque, Korean, Sumerian, and Elamite, though in each case a minority of linguists claim to have demonstrated a relationship with other languages.[1]

Some sources use the term "language isolate" to indicate a branch of a larger family with only one surviving daughter. For instance, Albanian, Armenian and Greek are commonly called Indo-European isolates. While part of the Indo-European family, they do not belong to any established branch (such as the Romance, Celtic or Germanic branches), but instead form independent branches. Similarly, within the Romance languages, Sardinian is a relative isolate. However, without a qualifier, isolate is understood to be in the absolute sense of having no demonstrable genetic relationship to any other known language.

Some languages once seen as isolates may be reclassified as small families. This happened with Japanese (now included in the Japonic family along with Ryukyuan languages such as Okinawan) and Georgian (now the most dominant or standard of the Kartvelian languages of the Caucasus). The Etruscan language of Italy has long been considered an isolate, but some have proposed that it is related to the so-called Tyrsenian languages, an extinct family of closely related ancient languages proposed by Helmut Rix (1998), which includes the Raetic language of the Alps and the Lemnian language of the Aegean Sea. The Japonic and Kartvelian families are widely accepted by linguists, but since the ancient family that includes Etruscan has not received a similar level of acceptance,[citation needed] Etruscan is still included in the list of language isolates.

Language isolates may be seen as a special case of unclassified languages that remain unclassified even after extensive efforts. If such efforts eventually do prove fruitful, a language previously considered an isolate may no longer be considered one, as happened with the Yanyuwa language of northern Australia, which has been placed in the Pama–Nyungan family. Since linguists do not always agree on whether a genetic relationship has been demonstrated, it is often disputed whether a language is an isolate or not.

"Genetic" or "genealogical" relationships[edit]

The term "genetic relationship" is meant in the genealogical sense of historical linguistics, which groups most languages spoken in the world today into a relatively small number of families, according to reconstructed descent from common ancestral languages. For example, English is related to other Indo-European languages and Mandarin is related to other Sino-Tibetan languages. By this criterion, each language isolate constitutes a family of its own, which explains the exceptional interest that these languages have received from linguists.[2]

Looking for relationships[edit]

It is possible that all natural languages spoken in the world today are related by direct or indirect descent from a single ancestral tongue. The established language families would then be only the upper branches of the genealogical tree of all (or most) languages, or, equally, lower progeny of a parent tongue. For this reason, language isolates have been the object of numerous studies seeking to uncover their genealogy. For instance, Basque has been compared with every living and extinct Eurasian language family known, from Sumerian to South Caucasian, without conclusive results.

There are some situations in which a language with no ancestor might arise. This frequently happens with sign languages, most famously in the case of Nicaraguan Sign Language, where deaf children with no language were placed together and developed a new language. Similarly, if deaf parents were to raise a group of hearing children who have no contact with others until adulthood, they might develop an oral language among themselves and keep using it later, teaching it to their children, and so on. Eventually, it could develop into the full-fledged language of a population. With unsigned languages, this is not very likely to occur at any one time but, over the tens of thousands of years of human prehistory, the likelihood of this occurring at least a few times increases. There are also creole languages and constructed languages such as Esperanto, which do not descend directly from a single ancestor but have become the language of a population; however, they do take elements from existing languages.

Extinct isolates[edit]

Caution is required when speaking of extinct languages as isolates. Despite their great age, Sumerian and Elamite can be safely classified as isolates, as the languages are well enough known that, if modern relatives existed, they would be recognizably related.

However, many extinct languages are very poorly attested, and the fact that they cannot be linked to other languages may be a reflection of our poor knowledge of them. Etruscan, for example, is sometimes claimed to be Indo-European. Although most historical linguists believe this is unlikely, it is not yet possible to resolve the issue. Hattian, Gutian,[3] Hurrian, Mannean and Kassite are also believed to be isolates by mainstream majority, but their status is disputed by a minority of linguists. Similar situations pertain to many extinct isolates of the Americas such as Beothuk and Cayuse. A language thought to be an isolate may turn out to be relatable to other languages once enough material is recovered, but material is unlikely to be recovered if a language was not written.

Sign language isolates[edit]

A number of sign languages have arisen independently, without any ancestral language, and thus are true language isolates. The most famous of these is the Nicaraguan Sign Language, a well documented case of what has happened in schools for the deaf in many countries. In Tanzania, for example, there are seven schools for the deaf, each with its own sign language with no known connection to any other language.[4] Sign languages have also developed outside schools, in communities with high incidences of deafness, such as Kata Kolok in Bali, the Adamorobe Sign Language in Ghana, the Urubú Sign Language in Brazil, several Mayan sign languages, and half a dozen sign languages of the hill tribes in Thailand including the Ban Khor Sign Language.

Studies are also being conducted on Al-Sayyid Bedouin Sign Language (ABSL) in an isolated village in Israel. The language was developed in isolation for over 75 years by both deaf and hearing people within the village.[5]

These and more are all presumed isolates or small local families, because many deaf communities are made up of people whose hearing parents do not use sign language, and have manifestly, as shown by the language itself, not borrowed their sign language from other deaf communities during the recorded history of these languages.[citation needed]

List of language isolates by continent[edit]

Below is a list of known language isolates, arranged by continent, along with notes on possible relations to other languages or language families.

In the Status column, "vibrant" means that a language is in full use by the community and being acquired as a first language by children. "Moribund" means that a language is still spoken, but only by older people; it is not being acquired by children, and without efforts to revive it will become extinct when current speakers die. "Extinct" means a language is no longer spoken. The terms "living" and "extinct" are defined by the classification of "Language Types" in ISO 639-3; "vibrant" is equivalent to "living" or sometimes "endangered", depending on efforts to preserve the language, and "moribund" is "endangered".

[Where do these definitions come from?]


Data for several African languages are not sufficient for classification. In addition, Jalaa, Shabo, Laal, Kujarge, and a few other languages within Nilo-Saharan and Afroasiatic-speaking areas may turn out to be isolates upon further investigation. Defaka and Ega are highly divergent languages located within Niger-Congo-speaking areas, and may also possibly be language isolates.[6]

Language Status Countries Comments
Bangime Vibrant  Mali Spoken in the Dogon Cliffs. Used as an anti-language.
Hadza Vibrant  Tanzania Once listed as an outlier among the Khoisan languages. Language use is vigorous, though there are fewer than 1,000 speakers
Sandawe Vibrant  Tanzania Tentatively linked to the Khoe languages of southern Africa.


Language Status Countries Comments
Ainu Moribund  Japan,  Russia Formerly spoken throughout Sakhalin, the Kuril Islands and Hokkaido, now reduced to a handful of speakers in Hokkaido. May actually constitute a small language family, if the extinct varieties are classed as languages rather than dialects. Possibly related to the unattested language of the Emishi.
Burushaski Vibrant  Pakistan Spoken in the Hunza Valley of northern Pakistan.
Elamite Extinct  Iran Spoken in the Elamite Empire. Some propose a relationship to the Dravidian languages (see Elamo-Dravidian), but this is not well-supported.
Enggano Vibrant  Indonesia Spoken on Enggano Island, west of the southern tip of Sumatra. Classified by some as a language isolate, and by others as Austronesian. However, general consensus holds that it has both Austronesian and non-Austronesian origins.
Hattic Extinct  Turkey Spoken in Asia Minor before the 2nd millennium BCE. Connections to all three major indigenous language families of Caucasus have been proposed.
Korean Vibrant  North Korea,  South Korea With over 78 million speakers, Korean has more speakers than all other language isolates combined. Connections to the Altaic languages had been proposed, but have been generally discredited by most linguists.[7] It has also been proposed that Korean may be related to Japanese in the Japanese-Korean classification hypothesis, both with and without a common Altaic ancestor. Others notice a connection between Korean and the Paleosiberian languages.[8] Sometimes classified as a language family, forming the Koreanic family with the Jeju language.
Kusunda Moribund    Nepal Spoken in the Gandaki Zone. The recent discovery of a few speakers shows that it is not demonstrably related to anything else.
Nihali Endangered  India Also known as Nahali. Spoken in northern Maharashtra and southwestern Madhya Pradesh. Strong lexical Munda influence. Used as anti-language by speakers
Nivkh Moribund  Russia Also known as Gilyak. Spoken in the lower Amur River basin and on the Sakhalin Islands. Dialects sometimes considered two languages. Has been linked to Chukotko-Kamchatkan languages.
Puroik Vibrant  India Also known as Sulung. Variously regarded as either a language isolate or as a Sino-Tibetan branch.
Sumerian Extinct  Iraq Long-extinct but well-attested language of ancient Sumer.


The languages of New Guinea are poorly studied, and candidates for isolate status are likely to change when more becomes known about them.

Language Status Countries Comments
Abinomn Vibrant  Indonesia Spoken in New Guinea. Also known as Baso, Foia. Language use is vigorous, despite low number of speakers.
Amberbaken Endangered  Indonesia Spoken on Bird's Head Peninsula. Tentatively linked to the West Papuan languages.
Anem Vibrant  Papua New Guinea Spoken on New Britain. Perhaps related to Yélî Dnye and Ata.
Ata Endangered  Papua New Guinea Spoken on New Britain. Also known as Wasi. Perhaps related to Yélî Dnye and Anem.
Busa Vibrant  Papua New Guinea Spoken in New Guinea, in three villages in the Upper Sepik River. Also known as Odiai.
Enindhilyagwa Vulnerable  Australia Spoken on Groote Eylandt in the Gulf of Carpentaria. Also known as Andilyaugwa. Classified as part of the Macro-Gunwinyguan languages.
Giimbiyu Extinct  Australia Part of a proposal for an Arnhem Land language family.
Isirawa Vulnerable  Indonesia Spoken in New Guinea. Formerly classified as Trans–New Guinea. Part of a proposal for a North Papuan family.
Kakadju Extinct  Australia Also known as Gaagudu. Part of a proposal for an Arnhem Land language family.
Kol Vibrant  Papua New Guinea Spoken on New Britain.
Kuot Vulnerable  Papua New Guinea Spoken on New Ireland. Also known as Panaras.
Laragiya Moribund  Australia Spoken in the Darwin area. Part of a proposal for a Darwin language family.
Massep Moribund  Indonesia Spoken in New Guinea. A link to the Trans–New Guinea languages is being explored.
Ngurmbur Extinct  Australia Extinct since ca. 1990. Spoken in northern Australia. Perhaps related to the Pama–Nyungan languages.
Pyu Endangered  Papua New Guinea Spoken in New Guinea. Formerly classified as Kwomtari–Baibai.
Sulka Vibrant  Papua New Guinea Spoken on New Britain. Primary schools teach the language
Taiap Endangered  Papua New Guinea Spoken by around a hundred people in East Sepik Province. Also known as Gapun, formerly classified as Sepik-Ramu.
Tiwi Vulnerable  Australia Spoken in the Tiwi Islands in the Timor Sea.
Umbugarla Extinct  Australia Part of a proposal for an Darwin language family.
Wagiman Moribund  Australia Spoken in the north-central area of the Northern Territory.
Wardaman Moribund  Australia Spoken in the north-central area of the Northern Territory. Sometimes classified as two languages in a Yagmanic family.
Yalë Vibrant  Papua New Guinea Spoken in New Guinea. Also known as Nagatman.
Yele Vibrant  Papua New Guinea Spoken on Rossel Island. Perhaps related to Anem and Ata.


Language Status Countries Comments
Basque Vulnerable  Spain,  France Natively known as Euskara, the Basque language, found in the historical region of the Basque Country between France and Spain, is the second most-widely spoken language isolate after Korean. It has no known living relatives, although Aquitanian is commonly regarded as related to or a direct ancestor of Basque. Some linguists have claimed similarities with various languages of the Caucasus that are indicative of a relationship, while others have proposed a relation to Iberian and to the hypothetical Dené–Caucasian languages.
Etruscan Extinct  Italy Language of the ancient Etruscans in northwestern Italy; not well attested. Some have suggested a Tyrrhenian family consisting of Etruscan, Lemnian, and possibly Raetic and Camunic.

North America[edit]

Language Status Countries Comments
Atakapa Extinct  United States Was spoken in Texas and Louisiana. Often linked to Muskogean in a Gulf hypothesis.
Chimariko Extinct  United States Was spoken in California. Part of the Hokan hypothesis.
Chitimacha Extinct  United States Was spoken in Louisiana. Often linked to Muskogean in a Gulf hypothesis.
Coahuilteco Extinct  United States,  Mexico Was spoken in Texas and northeastern Mexico. Part of the Hokan hypothesis.
Cuitlatec Extinct  Mexico Was spoken in Guerrero.
Esselen Extinct  United States Poorly known. Was spoken in California. Part of the Hokan hypothesis.
Haida Moribund  Canada,  United States Spoken in Alaska and British Columbia. Some proposals connect it to the Na-Dené languages, but these have fallen into disfavor.
Huave Endangered  Mexico Spoken in Oaxaca, Mexico. Part of the Penutian hypothesis when extended to Mexico, but this idea has generally been abandoned.
Karuk Moribund  United States Spoken in California. Part of the Hokan hypothesis.
Kutenai Moribund  Canada,  United States Spoken in Idaho, Montana and British Columbia.
Natchez Extinct  United States Was spoken in Mississippi and Louisiana. Often linked to Muskogean in a Gulf hypothesis.
Purépecha Endangered  Mexico Spoken by the Purépecha people in the state of Michoacán. Language of the ancient Tarascan kingdom. Sometimes regarded as two languages.
Salinan Extinct  United States Was spoken in California. Part of the Hokan hypothesis.
Seri Vulnerable  Mexico Spoken in Sonora. Part of the Hokan hypothesis.
Siuslaw Extinct  United States Was spoken in Oregon. Likely related to Coos, Alsea, possibly the Wintuan languages. Part of the Penutian hypothesis.
Takelma Extinct  United States Spoken in Oregon. Part of the Penutian hypothesis. A specific relationship with Kalapuyan is now rejected.
Timucua Extinct  United States Well attested. Was spoken in Florida and Georgia. A connection with the poorly known Tawasa language has been suggested, but this may be a dialect.
Tonkawa Extinct  United States Was spoken in Texas.
Tunica Extinct  United States Was spoken in Mississippi, Louisiana, and Arkansas.
Washo Moribund  United States Spoken in California and Nevada. Part of the Hokan hypothesis.
Yana Extinct  United States Was spoken in California. Part of the Hokan hypothesis.
Yuchi Moribund  United States Spoken in Georgia and Oklahoma. Connections to Siouan languages have been proposed.
Zuni Vulnerable  United States Spoken in Zuni Pueblo, New Mexico. Connections to Penutian languages have been proposed, but is generally considered unlikely.

South America[edit]

Language Status Countries Comments
Aikaná Endangered  Brazil Spoken in Rondônia. Arawakan has been suggested.
Andoque Endangered  Colombia,  Peru May be extinct now. Possibly Witotoan.
Betoi Extinct  Colombia Paezan has been suggested.
Camsá Endangered  Colombia Also known as Kamsa, Coche, Sibundoy, Kamentxa, Kamse, or Camëntsëá.
Candoshi Endangered  Peru Spoken along the Chapuli, Huitoyacu, Pastaza, and Morona river valleys.
Canichana Extinct  Bolivia A connection with the extinct Tequiraca (Auishiri) has been proposed.
Cayuvava Moribund  Bolivia Spoken in Bolivia. Speakers live west of Mamore River, north of Santa Ana del Yacuma in the Beni Department.
Chimané Vulnerable  Bolivia Also spelled Tsimané. Sometimes split into multiple languages in a Moséten family. Linked to the Chonan languages in a Moseten-Chonan hypothesis.
Cofán Endangered  Colombia,  Ecuador Also called A'ingae. Sometimes classified as Chibchan, but the similarities appear to be due to borrowings. Seriously endagered in Colombia.
Waorani Vulnerable  Ecuador,  Peru Also known as Sabela. Spoken between the Napo and Curaray rivers. Could be spoken by several uncontacted groups.
Irantxe? Moribund  Brazil Also known as Iranche or Münkü. Spoken in Mato Grosso.
Itonama Moribund  Bolivia Paezan has been suggested. 5 speakers remaining.
Kunza Extinct  Chile Was spoken in areas near Salar de Atacama. Also known as Atacameño.
Kanoê Moribund  Brazil Spoken in Rondônia. Also known as Kapishana.
Leco Endangered  Bolivia Thought to be extinct, recently rediscovered in areas east of Lake Titicaca.
Mapudungun Endangered  Chile,  Argentina Also known as Araucano or Araucanian. Considered a family of 2 languages by Ethnologue. Variously part of Andean, macro-Panoan, or macro-Waikuruan proposals. Sometimes Huilliche is treated as a separate language, reclassifying Mapudungun into an Araucanian family.
Movima Endangered  Bolivia Spoken in the Llanos de Moxos region.
Otí Extinct  Brazil Was spoken in São Paulo. Macro-Gêan has been suggested.
Páez Endangered  Colombia Several proposed relationships in the Paezan hypothesis but nothing conclusive.
Tequiraca Extinct  Peru Also known as Auishiri. A connection with Canichana has been proposed.
Trumai Endangered  Brazil Settled on the upper Xingu River. Currently reside in the Xingu National Park in the state of Mato Grosso.
Urarina Vulnerable  Peru Spoken in Loreto Region. Part of the Macro-Jibaro proposal.
Warao Endangered  Guyana,  Suriname,  Venezuela Sometimes linked to Paezan.
Yámana Moribund  Chile Spoken in southern Tierra del Fuego. Also called Yaghan. Last speaker is Cristina Calderón, who is 89 years old.
Yuracaré Endangered  Bolivia Connections to Mosetenan, Pano–Tacanan, Arawakan, and Chon have been suggested.

See also[edit]


  1. ^ Campbell, Lyle (2010-08-24). "Language Isolates and Their History, or, What's Weird, Anyway?". Annual Meeting of the Berkeley Linguistics Society. 36 (1): 16–31. doi:10.3765/bls.v36i1.3900. ISSN 2377-1666. 
  2. ^ Grey., Thomason, Sarah. Language contact, creolization, and genetic linguistics. Kaufman, Terrence, 1937-. Berkeley. ISBN 0520078934. OCLC 16525266. 
  3. ^ Jump up ^ Mallory, J.P.; Mair, Victor H. (2000). The Tarim Mummies. London: Thames & Hudson. pp. 281–282. ISBN 978-0-500-05101-6.
  4. ^ Tanzanian Sign Language (TSL) Dictionary. H.R.T. Muzale, University of Dar es Salaam, 2003
  5. ^ "American Sign Language". NIDCD. 2015-08-18. Retrieved 2017-01-25. 
  6. ^ Roger Blench, Niger-Congo: an alternative view
  7. ^ Sanchez-Mazas; Blench; Ross; Lin; Pejros, eds. (2008), "Stratification in the peopling of China: how far does the linguistic evidence match genetics and archaeology?", Human migrations in continental East Asia and Taiwan: genetic, linguistic and archaeological evidence, Taylor & Francis 
  8. ^ Vovin, Alexander (2015). "Korean as a Paleosiberian Language". 알타이할시리즈 2. ISBN 978-8-955-56053-4. Retrieved 2016-11-06. 


External links[edit]