| Literature DB >> 28714977 |
Nathan Nakatsuka1,2, Priya Moorjani3,4, Niraj Rai5, Biswanath Sarkar6, Arti Tandon1,4, Nick Patterson4, Gandham SriLakshmi Bhavani7, Katta Mohan Girisha7, Mohammed S Mustak8, Sudha Srinivasan9, Amit Kaushik10, Saadi Abdul Vahab11, Sujatha M Jagadeesh12, Kapaettu Satyamoorthy11, Lalji Singh13, David Reich1,4,14, Kumarasamy Thangaraj5.
Abstract
The more than 1.5 billion people who live in South Asia are correctly viewed not as a single large population but as many small endogamous groups. We assembled genome-wide data from over 2,800 individuals from over 260 distinct South Asian groups. We identified 81 unique groups, 14 of which had estimated census sizes of more than 1 million, that descend from founder events more extreme than those in Ashkenazi Jews and Finns, both of which have high rates of recessive disease due to founder events. We identified multiple examples of recessive diseases in South Asia that are the result of such founder events. This study highlights an underappreciated opportunity for decreasing disease burden among South Asians through discovery of and testing for recessive disease-associated genes.Entities:
Mesh:
Year: 2017 PMID: 28714977 PMCID: PMC5675555 DOI: 10.1038/ng.3917
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Dataset overview
(a) Sampling locations for all analyzed groups. Each point indicates a distinct group (random jitter was added to help in visualization at locations where there are many groups). (b) PCA of Human Origins dataset along with European Americans (CEU) and Han Chinese (CHB). There is a large cluster (blue) of IndoEuropean and Dravidian speaking groups that stretch out along a line in the plot and that are well-modeled as a mixture of two highly divergent ancestral populations (the “Indian Cline”). There is another larger cluster of Austroasiatic speakers (light red) and groups that cluster with them genetically (dark red). Finally, there are groups with genetic affinity to East Asians that include Tibeto-Burman speakers (orange) and those that speak other languages (yellow).
Figure 2Example histograms of IBD segments to illustrate the differences between groups with founder events of different magnitudes
These histograms provide visual illustrations of differences between groups with different IBD scores. As a ratio relative to Finns (FIN; black), these groups (red) have IBD scores of: (A) ~26 in Ulladan, (B) ~3 in Birhor, (C) ~0.9 in Ashkenazi Jews, and (D) ~0.1 in Mahadeo_Koli. In each plot, we also show European Americans (CEU) with a negligible founder event in blue. Quantification of these founder events is shown in Figure 3 and Supplementary Table 5. The IBD histograms were normalized for sample size by dividing their frequency by , where n is the number of individuals in the sample. All data for the figure are based on the Human Origins dataset.
Figure 3IBD scores relative to Finns (FIN)
Histogram ordered by IBD score, roughly proportional to the per-individual risk for recessive disease due to the founder event. (These results are also given quantitatively for each group in Online Table 1.) We restrict to groups with at least two samples, combining data from all four genotyping platforms onto one plot. Data from Ashkenazi Jews and Finns are highlighted in red, and from South Asian groups with significantly higher IBD scores than that of Finns and census sizes of more than a million in brown. Error bars for each IBD score are standard errors calculated by weighted block jackknife over each chromosome. YRI=Yoruba (West African); CEU=European American.
South Asian groups with estimated census sizes over 1 million and IBD scores significantly greater than those of Ashkenazi Jews and Finns
Fourteen South Asian groups with IBD scores significantly higher than that of Finns, census sizes over 1 million, and sample sizes of at least 3 that are of particularly high interest for founder event disease gene mapping studies. For reference, Finns and Ashkenazi Jews (on the Human Origins array) would have IBD scores of 1.0 and 0.9, IBD ranks of 121 and 135, and FST ranks of 109 and 129, respectively (the group-specific drift is difficult to compare for groups with significantly different histories, so they were not calculated for Finns or Ashkenazi Jews).
| Group | Sample Size | IBD Score | IBD Rank | FST Rank | Drift Rank | Census Size | Location |
|---|---|---|---|---|---|---|---|
| Gujjar | 5 | 11.6 | 19 | 33 | 46 | 10,78,719 | Jammu and Kashmir |
| Baniyas | 7 | 9.6 | 24 | 22 | 18 | 42,00,000 | Uttar Pradesh |
| Pattapu_Kapu | 4 | 9.5 | 25 | 24 | 21 | 1,36,97,000 | Andhra Pradesh |
| Vadde | 3 | 9.2 | 26 | 30 | 26 | 36,95,000 | Andhra Pradesh |
| Yadav | 12 | 4.4 | 48 | 87 | 67 | 11,24,864 | Puducherry |
| Kshatriya_Aqnikula | 4 | 2.4 | 75 | 109 | NA | 1,28,09,000 | Andhra Pradesh |
| Naga | 4 | 2.3 | 76 | NA | NA | 18,34,483 | Nagaland |
| Kumhar | 27 | 2.3 | 77 | 35 | 197 | 31,44,000 | Uttar Pradesh |
| Reddy | 7 | 2 | 84 | 129 | 106 | 2,25,00,000 | Telangana |
| Brahmin_Nepal | 4 | 1.9 | 86 | 63 | 141 | 42,06,235 | Nepal |
| Kallar | 27 | 1.7 | 94 | 87 | 73 | 24,26,929 | Tamil Nadu |
| Brahmin_Manipuri | 17 | 1.6 | 99 | NA | NA | 15,44,296 | Manipur |
| Arunthathiyar | 18 | 1.3 | 108 | 109 | 81 | 11,92,578 | Tamil Nadu |
| Vysya | 39 | 1.2 | 110 | 46 | 35 | 32,00,000 | Telangana |