| Literature DB >> 32929667 |
Li Qu1,2, Luotong Wang1,3, Feifei He1,3, Yilun Han1,3, Longshu Yang1,3, May D Wang2, Huaiqiu Zhu4,5,6.
Abstract
BACKGROUND: Variations in the human genome have been studied extensively. However, little is known about the role of micro-inversions (MIs), generally defined as small (< 100 bp) inversions, in human evolution, diversity, and health. Depicting the pattern of MIs among diverse populations is critical for interpreting human evolutionary history and obtaining insight into genetic diseases.Entities:
Keywords: Evolution; Genome; High-throughput sequencing; Micro-inversions; Structural variations
Year: 2020 PMID: 32929667 PMCID: PMC7658078 DOI: 10.1007/s12539-020-00392-6
Source DB: PubMed Journal: Interdiscip Sci ISSN: 1867-1462 Impact factor: 2.233
Fig. 1A flowchart outlining the procedures of MI calling, MI analysis, and key findings
Fig. 2MIs and MIRs in human chromosomes. a Schematic showing how an MIR is a region where MIs overlap (black bars, MIRs; blue bars, MIs in one individual). b Scatter plot of MIR count against chromosome length. c Length distribution of 6968 MIs (13–45 bp) and 2,140 MIRs (14–45 bp). d Distribution of locations of MIs on chromosomes
Overview of MIs detected in 1937 samples
| Pop | Population description | Sam-num | MIR-num | MI-num | Mul-sup | MI-num/Sam-num | Mul-sup/MIR-num |
|---|---|---|---|---|---|---|---|
| CHB | Han Chinese in Beijing, China | 86 | 119 | 202 | 30 | 2.35 | 25.21% |
| JPT | Japanese in Tokyo, Japan | 104 | 209 | 416 | 61 | 4.00 | 29.19% |
| CHS | Southern Han Chinese | 110 | 116 | 183 | 27 | 1.66 | 23.28% |
| CDX | Chinese Dai in Xishuangbanna, China | 99 | 59 | 99 | 16 | 1.00 | 27.12% |
| KHV | Kinh in Ho Chi Minh City, Vietnam | 101 | 176 | 249 | 32 | 2.47 | 18.18% |
| East Asia | 500 | 483 | 1149 | 110 | 2.30 | 16.2% | |
| CEU | Utah Residents (CEPH) with Northern and Western Ancestry | 100 | 300 | 387 | 37 | 3.87 | 12.33% |
| TSI | Toscani in Italia | 98 | 137 | 195 | 23 | 1.99 | 16.79% |
| FIN | Finnish in Finland | 99 | 237 | 283 | 18 | 2.86 | 7.59% |
| GBR | British in England and Scotland | 92 | 185 | 251 | 31 | 2.73 | 16.76% |
| IBS | Iberian Population in Spain | 108 | 199 | 271 | 31 | 2.51 | 15.58% |
| Europe | 497 | 877 | 1387 | 92 | 2.79 | 10.49% | |
| YRI | Yoruba in Ibadan, Nigeria | 68 | 156 | 247 | 33 | 3.63 | 21.15% |
| LWK | Luhya in Webuye, Kenya | 105 | 161 | 278 | 44 | 2.65 | 27.33% |
| GWD | Gambians in Western Divisions in the Gambia | 113 | 177 | 384 | 65 | 3.40 | 36.72% |
| MSL | Mende in Sierra Leone | 85 | 168 | 316 | 29 | 3.72 | 17.26% |
| ESN | Esan in Nigeria | 99 | 167 | 347 | 51 | 3.51 | 30.54% |
| ASW | Americans of African Ancestry in the SW USA | 66 | 149 | 293 | 50 | 4.44 | 33.56% |
| ACB | African Caribbeans in Barbados | 96 | 152 | 281 | 51 | 2.93 | 33.55% |
| Africa | 632 | 634 | 2146 | 192 | 3.40 | 30.28% | |
| MXL | Mexican Ancestry from Los Angeles, USA | 67 | 93 | 176 | 26 | 2.63 | 27.96% |
| PUR | Puerto Ricans from Puerto Rico | 57 | 107 | 220 | 46 | 3.86 | 42.99% |
| CLM | Colombians from Medellin, Colombia | 94 | 163 | 267 | 31 | 2.84 | 19.02% |
| PEL | Peruvians from Lima, Peru | 86 | 125 | 187 | 22 | 2.17 | 17.60% |
| America | 304 | 347 | 850 | 82 | 2.80 | 23.63% | |
| GIH | Gujarati Indians from Houston, Texas | 106 | 108 | 193 | 29 | 1.82 | 26.85% |
| PJL | Punjabi from Lahore, Pakistan | 96 | 141 | 351 | 54 | 3.66 | 38.30% |
| BEB | Bengali from Bangladesh | 86 | 131 | 270 | 36 | 3.14 | 27.48% |
| STU | Sri Lankan Tamil from the UK | 103 | 139 | 334 | 51 | 3.24 | 36.69% |
| ITU | Indian Telugu from the UK | 103 | 119 | 288 | 53 | 2.80 | 44.54% |
| South Asia | 494 | 377 | 1436 | 112 | 2.91 | 29.71% | |
| – | 2427 | 2140 | 6968 | 248 | 2.87 | 11.59% |
Pop population name; Sam-num the number of samples for the population; MIR-num the number of MIRs in each population; MI-num the number of inverted alleles in each population; Mul-sup the MIRs supported by at least two MIs; MI-num/Sam-num the ratio of the number of inverted alleles to the number of samples, which indicates the average inverted alleles per individual; Mul-sup/MIR-num the ratio of MIRs with multiple MIs to the total number of MIRs
Fig. 3Average counts of inverted alleles per individual. is the average count of inverted alleles per individual. Different colors represent the five super-populations: orange, African; red, American; blue, European; purple, South Asian; and green, East Asian. a Bar plot of the average count of inverted alleles per individual among the five super-populations. The data are presented as the mean ± standard error (SE). The average count ranged from 3.03 to 3.96. b Phylogenetic tree based on the average pairwise differences between super-populations according to the MIs in gene regions. c Geographic locations of the 26 populations
Fig. 4Phylogenetic trees of 26 populations constructed by neighbor joining. a Tree for the 26 populations based on average pairwise population distances calculated with all 6968 MIs. b Tree based on average pairwise population distances calculated with the 2135 MIs in gene regions
Fig. 5PCA of 26 populations. The first and second principal components are plotted. Different colors represent the five super-populations: orange, African; red, American; blue, European; purple, South Asian; and green, East Asian
Fig. 6Venn diagrams of MIR sharing among super-populations. a Diagram of all MIRs shared among the five super-populations. b Diagram of only MIRs in gene regions according to the GENCODE database shared among the five super-populations
Counts of common inverted alleles among 7 non-human primates and five human super-populations
| Name | All inverted allele count | East Asian | South Asian | European | African | American | 1KGP |
|---|---|---|---|---|---|---|---|
| Chimpanzee (panTro4) | 2284 | 12 | 11 | 7 | 14 | 11 | 20 |
| Gorilla (gorGor3) | 2010 | 10 | 11 | 9 | 14 | 10 | 18 |
| Orangutan (ponAbe2) | 3473 | 9 | 9 | 7 | 10 | 9 | 14 |
| Gibbon (nomLeu1) | 3713 | 6 | 7 | 5 | 7 | 3 | 13 |
| Baboon (PapHam1) | 4547 | 5 | 5 | 3 | 6 | 5 | 8 |
| Rhesus macaque (rheMac3) | 4271 | 5 | 4 | 2 | 5 | 4 | 7 |
| Squirrel monkey (SaiBol1) | 4214 | 2 | 2 | 2 | 2 | 2 | 2 |
| Total | 24,476 | 49 | 49 | 35 | 58 | 44 | 82 |