Document Type

Journal Article

Date of this Version


Publication Source

Molecular Biology and Evolution





Start Page


Last Page





We analyzed 40 single nucleotide polymorphism and 19 short tandem repeat Y-chromosomal markers in a large sample of 1,525 indigenous individuals from 14 populations in the Caucasus and 254 additional individuals representing potential source populations. We also employed a lexicostatistical approach to reconstruct the history of the languages of the North Caucasian family spoken by the Caucasus populations. We found a different major haplogroup to be prevalent in each of four sets of populations that occupy distinct geographic regions and belong to different linguistic branches. The haplogroup frequencies correlated with geography and, even more strongly, with language. Within haplogroups, a number of haplotype clusters were shown to be specific to individual populations and languages. The data suggested a direct origin of Caucasus male lineages from the Near East, followed by high levels of isolation, differentiation, and genetic drift in situ. Comparison of genetic and linguistic reconstructions covering the last few millennia showed striking correspondences between the topology and dates of the respective gene and language trees and with documented historical events. Overall, in the Caucasus region, unmatched levels of gene–language coevolution occurred within geographically isolated populations, probably due to its mountainous terrain.

Copyright/Permission Statement

This is a pre-copy-editing, author-produced PDF of an article accepted for publication in Molecular Biology and Evolution following peer review. The definitive publisher-authenticated version Balanovsky O, Dibirova K, Dybo A, Mudrak O, Frolova S, Pocheshkhova E, Haber M, Platt D, Schurr TG, Haak W, Kuznetsova M, Radzhabov M, Balaganskaya O, Druzhinina E, Zakharova T, Koshel S, Renfrew C, Wells RS, Tyler-Smith C, Balanovska E, The Genographic Consortium. Parallel evolution of genes and languages in the Caucasus. Mol Biol Evol 28(10): 2905-2920] is available online at:


The full list of Genographic Consortium members are listed in the Appendix.


caucasian, Y chromosomes, haplogroup, population, langauge, population diversity



Date Posted: 18 December 2014

This document has been peer reviewed.