| Literature DB >> 31723199 |
Krishnamoorthy Srikanth1, Nam-Young Kim2, WonCheoul Park1, Jae-Min Kim3, Kwon-Do Kim4, Kyung-Tai Lee5, Ju-Hwan Son1, Han-Ha Chai1, Jung-Woo Choi6, Gul-Won Jang1, Heebal Kim4, Youn-Chul Ryu7, Jin-Wu Nam8, Jong-Eun Park1, Jun-Mo Kim9, Dajeong Lim10.
Abstract
The Jeju horse, indigenous to the Jeju Island in Korea may have originated from Mongolian horses. Adaptations to the local harsh environment have conferred Jeju horse with unique traits such as small-sized body, stocky head, and shorter limbs. These characteristics have not been studied previously at the genomic level. Therefore, we sequenced and compared the genome of 41 horses belonging to 6 breeds. We identified numerous breed-specific non-synonymous SNPs and loss-of-function mutants. Demographic and admixture analyses showed that, though Jeju horse is genetically the closest to the Mongolian breeds, its genetic ancestry is independent of that of the Mongolian breeds. Genome wide selection signature analysis revealed that genes such as LCORL, MSTN, HMGA2, ZFAT, LASP1, PDK4, and ACTN2, were positively selected in the Jeju horse. RNAseq analysis showed that several of these genes were also differentially expressed in Jeju horse compared to Thoroughbred horse. Comparative muscle fiber analysis showed that, the type I muscle fibre content was substantially higher in Jeju horse compared to Thoroughbred horse. Our results provide insights about the selection of complex phenotypic traits in the small-sized Jeju horse and the novel SNPs identified will aid in designing high-density SNP chip for studying other native horse breeds.Entities:
Mesh:
Year: 2019 PMID: 31723199 PMCID: PMC6853925 DOI: 10.1038/s41598-019-53102-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of sequencing, mapping, and coverage details.
| Breed of horse | No of Sample | Raw_reads | Mapped_reads | Paired_reads | Alignment Rate† | Avg_fold‡ |
|---|---|---|---|---|---|---|
| Mongolian Galshar | 6 | 2056771059 | 2002716326 | 1693557220 | 96.45% | 13.9 X |
| Mongolian Domestic Horse | 4 | 1492500192 | 1453274027 | 1220426606 | 96.35% | 15.16 X |
| Mongolian Jarglant | 5 | 1798034653 | 1754687974 | 1481970526 | 96.23% | 14.64 X |
| Jeju Island | 12 | 4716082872 | 4630806096 | 3708745506 | 97.97% | 16.1 X |
| Thoroughbred | 10 | 4326958817 | 4248380754 | 3562897400 | 98.72% | 17.7 X |
| Przewalski’s | 4 | 2128278554 | 2063427021 | 1897781050 | 97.18% | 20.72 X |
| Total | 41 | 16518626147 | 16153292198 | 13565378308 | 97.15% | 16.37 X |
†Alignment rate is the percentage of reads that were aligned with the reference genome by the bowtie aligner.
‡Avg_fold– Average fold is the average coverage depth across the genome.
Summary of all the SNPs identified in this study.
| Categories | Total | JH | TB | PZ | MD | MG | MJ |
|---|---|---|---|---|---|---|---|
| Samples | 41 | 12 | 10 | 4 | 4 | 6 | 5 |
| SNPs | 21651273 | 10993471 | 9410065 | 10482878 | 16513676 | 16521183 | 16516855 |
| Transition | 14371118 | 7280776 | 6211755 | 6961756 | 10998372 | 11001791 | 10999640 |
| Transversion | 7280155 | 3712695 | 3198310 | 3521122 | 5515304 | 5519392 | 5517215 |
| Ts/Tv | 1.97 | 1.95 | 1.91 | 1.95 | 1.99 | 1.99 | 1.99 |
| Synonymous coding | 85436 | 43495 | 37033 | 41332 | 78412 | 78428 | 78419 |
| Non-synonymous coding | 73582 | 36033 | 30823 | 34080 | 53362 | 53413 | 53358 |
| Start lost | 85 | 35 | 35 | 37 | 68 | 69 | 69 |
| Stop gained | 868 | 371 | 325 | 346 | 623 | 626 | 623 |
| Stop lost | 48 | 35 | 14 | 37 | 38 | 38 | 38 |
| Non-coding exon | 27928 | 14596 | 12294 | 11570 | 19196 | 19235 | 19239 |
| 5′ Untranslated Region | 6732 | 3685 | 3186 | 3551 | 6845 | 6853 | 6852 |
| 3′ Untranslated Region | 12868 | 6042 | 5110 | 5964 | 10953 | 10955 | 10954 |
| Splice site acceptor | 561 | 358 | 315 | 335 | 491 | 495 | 496 |
| Splice site donor | 792 | 476 | 445 | 467 | 713 | 714 | 714 |
| Intron | 5714650 | 2844468 | 2421133 | 2756982 | 5484660 | 5485747 | 5484270 |
| Intergenic | 15827007 | 8094428 | 6942116 | 7676009 | 12077339 | 12083932 | 12080078 |
| missense | 73562 | 36099 | 30872 | 34134 | 53362 | 53413 | 53358 |
| LOF | 1483 | 783 | 694 | 725 | 1150 | 1156 | 1154 |
†SNPs categorized based on their effects,. ‡Coding SNPs categorized based on type of mutation.
Figure 1Variant statistics, SNP annotation, and missense and loss of function (LOF) mutants detected in this study. (a) Top panel shows the number of homozygous and heterozygous SNPs (per million base-pairs) identified in this study; the number on top of the bar indicates the ratio of heterozygous to homozygous SNPs. Bottom panel shows the number of known and novel SNPs identified (per million base-pairs), with the percentage of novel SNPs provided at the top of the bars. (b) Nucleotide diversity ratios across the genome of the six breeds analyzed in this study. (c) Venn diagrams showing the total number of MS, fixed MS, LOF, and fixed LOF mutants in the JH and TB.
Gene Ontology analysis of missense and loss of function mutants fixed in JH and TB populations.
| Breed | Type | Category | Term | Count | Genes |
|---|---|---|---|---|---|
| BP | Peptide catabolic process | 7 | ACE TRHDE TPP1 ANPEP LTA4H ENPEP RNPEP | ||
| BP | Negative regulation of canonical Wnt signalling pathway | 16 | CTNND1 NOTUM WWTR1 PTPRO LATS2 ZNRF3 DDIT3 GLI1 GPC3 SFRP2 DACT3 SFRP4 LIMD1 TMEM88 APC AXIN1 | ||
| BP | Central nervous system development | 9 | NRCAM HAPLN2 B3GNT5 TPP1 ACAN CASZ1 BCAN ADGRA2 NCAN | ||
| MF | Metalloendopeptidase activity | 21 | ADAM28 ADAMTS17 ADAM11 MMEL1 THOP1 ADAMTS16 MMP16 PAPLN MMP15 ADAM33 ADAM17 ADAM18 ATP23 ADAM19 ADAMTS12 PMPCA ADAMDEC1 PMPCB OMA1 ADAM9 ADAMTS4 | ||
| MF | ATPase activity | 19 | KIF14 KIFC1 DNAH10 KIF24 MLH1 MYO9B DNAH7 DNAH8 RNF213 DNAH6 CARNS1 TOR3A MACF1 VWA8 ATP5I MDN1 KIF21B PMS1 ABCA12 | ||
| KEGG | ECM-receptor interaction | 15 | COL4A4 ITGB4 ITGA9 LAMB4 SDC1 LAMA4 COL6A6 LAMA5 LAMC3 COMP TNR COL6A3 TNN RELN ITGA2B | ||
| KEGG | Dilated cardiomyopathy | 13 | ITGA9 ADCY4 ADCY7 MYL2 TNNC1 ADCY5 ADCY6 ITGB4 RYR2 CACNA1F CACNA2D3 CACNA2D2 ITGA2B | ||
| BP | Apoptotic process | 7 | CLPTM1L PRUNE2 CYCS BIRC6 SIAH2 BRAT1 CIDEC | ||
| MF | Glucose transmembrane transporter activity | 2 | SLC2A9 SLC2A10 | ||
| CC | Voltage-gated calcium channel complex | 2 | CACNA2D2 CACNA2D4 | ||
| KEGG | Adrenergic signalling in cardiomyocytes | 3 | ADCY5 RAPGEF4 CACNA2D2 | ||
| KEGG | Retrograde endocannabinoid signalling | 8 | SLC32A1 PLCB3 GABRA3 GABRA6 ADCY5 GRM1 PLCB2 CACNA1B | ||
| BP | Intracellular signal transduction | 14 | PLCB3 PDZD8 MAST1 MAST2 NUAK2 SNRK RPS6KA2 SPSB3 ADCY5 DEF8 PLCD3 ASB16 PLCB2 DVL1 | ||
| BP | Positive regulation of dendritic spine development | 3 | CAMK1 SHANK3 NEURL1 | ||
| BP | Cerebellum development | 3 | PTF1A GBX2 RPGRIP1L | ||
| BP | Negative regulation of canonical Wnt signalling pathway | 6 | HDAC1 DACT3 SOX2 RGS19 SIAH2 SOX9 | ||
| MF | Protein kinase A binding | 3 | SPHKAP DACT2 DACT3 | ||
| BP | Regulation of establishment or maintenance of cell polarity | 2 | LLGL1 LLGL2 | ||
| BP | Intraciliary retrograde transport | 2 | TTC21B TTC21A | ||
| BP | Regulation of Notch signalling pathway | 2 | LLGL1 LLGL2 | ||
| KEGG | Calcium signalling pathway | 6 | ATP2B1 ADCY7 PHKB HRH2 CACNA1I PDGFRB | ||
| KEGG | MAPK signalling pathway | 6 | TAOK2 CACNA1I PDGFRB FAS DUSP8 FLNA |
BP – Biological Processes, MF – Molecular Function, CC – Cellular Components, KEGG – Kyoto Encyclopedia of genes and genomes.
List of missense and loss of function mutation in jeju and thoroughbred horse
| Type of Mutation | Unique to JH | Unique to TB | Shared |
|---|---|---|---|
| Fixed missense | 2105 | 2003 | 4565 |
| Total missense | 14438 | 9211 | 21661 |
| Fixed LOF | 43 | 34 | 107 |
| Total LOF | 287 | 198 | 496 |
LOF – loss of function.
Genetic variants associated with conformation and athletic performance traits in the Jeju horse.
| Gene | CHR | Position (bp) | Associated Phenotype | Associated Genotype | Jeju Horse Genotype | Genotype Frequency | SNP_ID | Selection Signature (significance) |
|---|---|---|---|---|---|---|---|---|
| SCL26A2[ | 14 | 27301621 | Autosomal recessively inherited chondrodysplasia | A > G | GG | 1 | rs393849014 | |
| LCORL/NCAPG[ | 3 | 107374136 | Body size | C > T | TT | 1 | rs68603064 | ZHp (−3.6) |
| LASP1[ | 11 | 23334511 | Body size | G > A | GG, GA, AA | 0.25/0.5/0.25 | rs68876319 | |
| HMGA2[ | 6 | 82644800 | Body size | C > T | TT | 1 | rs68671073 | XPEHH (0.0034) |
| PROP1[ | 14 | 3000031 | Dwarfism | G > C | GG/GC/CC | 0.17/0.42/0.42 | rs68963991 | ZHp (−3.9) |
| PROP1[ | 14 | 3000132 | Dwarfism | T > C | TT/TC/CC | 0.33/0.5/0.17 | rs68963993 | ZHp (−3.9) |
| ACTN3[ | 12 | 30237132 | Muscle fibre composition | T > G | TT | 1 | rs1146699694 | XPEHH (0.004) |
| ACTN3[ | 12 | 30226577 | Muscle fibre composition | G > A | GA, AA | 0.42/0.68 | rs1144978872 | XPEHH (0.004) |
| MSTN[ | 18 | 66608679 | Racing distance/ muscle fibre composition | C > T | TT, CT | 0.9/0.1 | rs397152648 | ZHp(−3.9) |
| ACTN2[ | 1 | 75822632 | Athletic performance | A > G | AA | 1 | rs68450030 | XPEHH(0.005) |
| COX4I1[ | 3 | 33460779 | Athletic performance | C > T | CC, CT | 0.5/0.5 | rs68518550 | XPEHH(0.005) |
| COX4I2[ | 22 | 23314524 | Athletic performance | C > T | CC | 1 | rs69276449 | XPEHH(0.002) |
| ACN9[ | 4 | 40343817 | Athletic performance | C > T | TT | 1 | rs69505998 | |
| CKM[ | 10 | 16079732 | Athletic performance | G > A | GG, GA | 0.8/0.2 | rs68819557 | XPEHH(0.002) |
| PDK4[ | 4 | 39020227 | Athletic performance | C > A | CC | 1 | rs69586787 | XPEHH(0.003) |
| PDK4[ | 4 | 39024151 | Athletic performance | G > A | GG | 1 | rs69586789 | XPEHH(0.003) |
| DMRT3[ | 23 | 22391254 | Pattern of locomotion | C > A | CC, CA, AA | 0.25/0.5/0.25 | rs1150690013 | XPEHH(0.004) |
| PON1[ | 4 | 38747736 | Athletic performance | C > T | CC, CT, TT | 0.67/0.25/0.08 | rs69585308 | |
| ZFAT[ | 9 | 76901355 | Wither height | C > T | TT | 1 | rs68748127 | XPEHH(0.005) |
| ZFAT[ | 9 | 77656570 | Wither height | C > T | CC | 1 | rs68750453 | XPEHH(0.005) |
| ZFAT[ | 9 | 76901431 | Wither height | C > A | AA | 1 | rs68748129 | XPEHH(0.005) |
| ZFAT[ | 9 | 76901578 | Wither height | G > A | AA | 1 | rs68748130 | XPEHH(0.005) |
| ZFAT[ | 9 | 76904485 | Wither height | G > A | AA, GA | 0.92/0.08 | rs68748134 | XPEHH(0.005) |
Figure 2Results of population genomic analysis. (a) PCA plot of populations of the six horse breeds; principal components 1 and 2 indicate 16.3 and 10.7% of the observed variance, respectively. (b) Population structure of the 41 horses in the six breeds; the length of each colored segment represents the proportion of the genome inferred from the ancestral population (K). (c) Neighbour joining tree inferred from identity by descent distance matrix. The clades are colored according to breeds.
Figure 3Historical demography and gene flow analysis. (a) Estimated effective population size of MG, TB, JH, and PZ populations. Inset shows the 95% confidence interval for the estimates. (b) Pattern of population splits and gene flow between the breeds. The drift parameter is proportional to effective population size (Ne). The migration edge is coloured according to the percentage of ancestry received from the donor. Scale bar shows the average standard error (10 times) of the entry in the sample co-variance matrix.
Figure 4Selection signature analysis. (a,b) Manhattan plot showing the distribution of P values inferred from ZHp (a) and XP-EHH (b) analyses. The XP-EHH analysis was performed between JH and TB horses, while ZHp was performed only with JH. The total number of windows found to be significant is provided at the top of the panel. c - d) Enriched gene ontology and KEGG pathway for genes in the candidate selection signature regions in ZHp (c) and XP-EHH (d). (e) Venn diagram showing the total number of genes found within the significant candidate selection signature regions.
Gene ontology analysis of genes that overlapped between the two selection signature analysis methods.
| GO_ID | Term | Count | Genes | |
|---|---|---|---|---|
| GO:0034349 | Glial cell apoptotic process | 2 | BID, CASP9 | 0.007127 |
| GO:0097345 | Mitochondrial outer membrane permeabilization | 2 | BID, BNIP3 | 0.011379 |
| GO:0042770 | Signal transduction in response to DNA damage | 2 | BID, CASP9 | 0.015615 |
| GO:0090200 | Positive regulation of release of cytochrome c from mitochondria | 2 | BID, BNIP3 | 0.039287 |
| GO:0006915 | Apoptotic process | 5 | TMEM173, CASP9, BNIP3, BCL2L13,BID | 0.045793 |
| GO:0007067 | Mitotic nuclear division | 3 | ANAPC1, NSMCE2, PPP2R2D | 0.048449 |
Figure 5Results of RNAseq analysis of JH and TB. (a,b) Summary of gene ontology (a) and KEGG pathways (b) enriched in the five tissues acquired from JH. (c) Muscle fibre composition analysis form rump and thigh muscle biopsy in JH and TB population.