| Literature DB >> 29868288 |
Hongying Ma1,2,3, Yajiang Wu4, Hai Xiang5, Yunzhou Yang6, Min Wang1,2,3, Chunjiang Zhao1,2,3,7, Changxin Wu1,2,3.
Abstract
OBJECTIVES: There are large populations of indigenous horse (Equus caballus) in China and some other parts of East Asia. However, their matrilineal genetic diversity and origin remained poorly understood. Using a combination of mitochondrial DNA (mtDNA) and hypervariable region (HVR-1) sequences, we aim to investigate the origin of matrilineal inheritance in these domestic horses.Entities:
Keywords: Divergence time; East Asia; Genetic origins; HVR-1; Horse mitochondrial genome
Year: 2018 PMID: 29868288 PMCID: PMC5985762 DOI: 10.7717/peerj.4896
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Haplogroup tree of mtDNA genome sequences from the studied horses.
Mutations were shown on the branches and were numbered according to a horse reference sequence (JN398377). Mutations toward a base identical-by-state to the JN398377 were indicated with the prefix. “W”, “S”, “M”, “K”, “R” and “Y” specifying the heteroplasmic status of A/T, C/G, A/C, G/T, A/G and C/T at a certain site, respectively. Deletions were indicated by a “d” after the deleted nucleotide position. Insertions are indicated by a “+” followed by the position number and type of inserted nucleotide(s). Repetitive align region np 16126-16352 are not considered. The information of horses in each clade was provided in Table S3 (The red Genebank numbers indicate 31 de novo mtDNA genome sequence); (B) is a continuation of (A) from the position of the asterisk.
Figure 2Median joining network constructed based on the 247- bp HVR-1 sequences.
Circles are proportional to the number of horses represented and a scale indicator (for node sizes) was provided. The length of lines represents the number of variants that separate nodes (some manual adjustment was made for visually good). In the circles, the colors of solid pie slices indicate studied horse populations: Orange, European horses; Blue, horses of West Asia; Light Green, horses from East Asia; Grey, ancient horses; Purper, Przewalskii horses.
Figure 3Haplogroup tree of HVR-1 sequences from the studied horses.
Mutations were shown on the branches and were numbered according to a horse reference sequence (JN398377). Repetitive align region np 16126–16352 are not considered. The information of horses in each clade was provided in Table S5. This haplogroup tree showed the corresponding relationship between the haplogroups of the horse mitogenomes and HVR-1 sequences from the studied horses; (B) is a continuation of (A) from the position of the asterisk.
Corresponding relationship between the haplogroups defined with the mitochondrial genome and the HVR-1 sequences.
A haplogroup of the genome corresponded to one or more of its HVR-1 sequence counterparts. The haplogroup A is the clade containing the largest number of haplotypes defined on the basis of HVR-1, but its sub-haplogroups are clustered into six haplogroups of mitochondrial genomes. Lw contained three sub-haplogroups of HVR-1 and is the largest haplogroup containing most haplotypes of the mitochondrial genome.
| HgW* | Aw | Bw | Cw | Dw | Ew | Fw | Gw | Hw |
| HgH* | A4,A5,A6(less) | A3 | H,A6(most) | E | A7 | A2 | A1 | D5,D6 |
| HgW* | Iw | Jw Kw | Lw | Mw | Nw | OwPwQw | Rw | |
| HgH* | B1,B2,B3 | I | D1,D2,D3 | C1 | C2 | F1-F3 | G1 | |
Notes.
haplogroup of whole mtDNA sequence
haplogroup of HVR-1 sequence
Geographic distribution of haplogroups of the mitochondrial genomes(%).
The two haplogroups Iw and Lw displayed frequency peaks in Europe and a decline to the east especially for Lw, which contained the largest number of European horses. However, an opposite distribution pattern was observed for haplogroups Aw, Hw, Jw, and Rw, which were harbored by more horses from East Asia than those from other regions.
| Clade of mtDNA genome | East Asia | West Asia | Europe |
|---|---|---|---|
| Aw | 9.68 | 9.33 | 6.34 |
| Bw | 4.03 | 9.33 | 5.63 |
| Cw | 6.45 | 6.67 | 2.11 |
| Dw | 7.26 | 1.33 | 2.82 |
| Ew | 2.42 | 1.33 | 0.70 |
| Gw | 12.90 | 12.00 | 9.86 |
| Hw | 5.64 | 0 | 0.70 |
| Iw | 6.45 | 9.33 | 14.08 |
| Jw | 5.64 | 2.67 | 0.70 |
| Kw | 0 | 0 | 0.70 |
| Lw | 12.90 | 8.00 | 37.32 |
| Mw | 5.64 | 4.00 | 8.45 |
| Nw | 2.42 | 1.33 | 5.63 |
| Ow | 2.42 | 6.67 | 0 |
| Pw | 4.03 | 2.67 | 1.41 |
| Qw | 9.68 | 10.67 | 2.82 |
| Rw | 2.42 | 1.33 | 0.70 |
Notes.
It indicates the areas of West Asia and its adjacent regions in the present study.
Geographic distribution of haplogroups of the HVR-1 sequences.
The haplogroups D1, D2, and D3, which displayed the highest frequencies in European horses, while I, G1 showed frequency peaks in East Asia and a decline from Asia to Europe.
| HVR-1 haplotype | East Asia | West Asia | Europe | Ancient |
|---|---|---|---|---|
| A1 | 7.62 | 9.44 | 8.62 | 5.04 |
| A2 | 0.93 | 1.55 | 0.03 | 0 |
| A3 | 3.81 | 8.73 | 8.52 | 3.36 |
| A4 | 1.10 | 0.56 | 2.33 | 0.42 |
| A5 | 7.28 | 5.21 | 2.16 | 2.94 |
| A6 | 7.96 | 11.97 | 2.26 | 15.55 |
| A7 | 2.79 | 2.96 | 1.38 | 2.52 |
| B1 | 0.85 | 2.11 | 1.05 | 0.42 |
| B2 | 3.73 | 6.62 | 3.77 | 2.10 |
| B3 | 1.02 | 0.85 | 1.54 | 0.42 |
| C1 | 4.91 | 5.49 | 9.11 | 18.91 |
| C2 | 2.88 | 2.68 | 9.44 | 4.62 |
| D1 | 3.64 | 6.20 | 11.41 | 5.46 |
| D2 | 3.90 | 6.34 | 10.82 | 3.36 |
| D3 | 5.84 | 3.80 | 7.44 | 0.42 |
| D5 | 1.35 | 0 | 0.07 | 1.68 |
| D6 | 1.10 | 1.13 | 1.21 | 1.68 |
| E | 4.49 | 1.41 | 6.06 | 3.36 |
| F1 | 6.86 | 10.56 | 2.43 | 3.78 |
| F2 | 6.69 | 5.63 | 6.78 | 2.94 |
| F3 | 7.37 | 4.08 | 1.41 | 8.82 |
| G | 1.10 | 0.70 | 0.29 | 0.42 |
| G1 | 1.95 | 0.56 | 0.85 | 1.68 |
| H | 4.49 | 0.28 | 0.03 | 0 |
| I | 6.35 | 1.13 | 0.98 | 10.08 |
Notes.
It indicates the areas of West Asia and its adjacent regions in the present study.
Figure 4Schematic phylogeny of mtDNAs genome from modern horses.
This tree includes 348 sequences and was rooted at a donkey (E. asinus) mitochondrial genome (not displayed). The topology was inferred by a beast approach, whereas a time divergence scale (based on rate substitutions) is shown on the bottom (age estimates were indicated with thousand years (KY)). The percentages on each branch represent Bayesian posterior credibility and the alphabets on the right represent the names of haplogroups. Additional details concerning ages were given in Tables S3 and S6.