| Literature DB >> 36009658 |
Silvia Bruno1, Vincenzo Landi2, Gabriele Senczuk3, Samantha Ann Brooks4, Faisal Almathen5,6, Bernard Faye7, Suheil Semir Bechir Gaouar8, Mohammed Piro9, Kwan Suk Kim10, Xavier David11, André Eggen11, Pamela Burger12, Elena Ciani1.
Abstract
Myostatin (MSTN) is a highly conserved negative regulator of skeletal muscle in mammals. Inactivating mutations results in a hyper-muscularity phenotype known as "double muscling" in several livestock and model species. In Camelus dromedarius, the gene structure organization and the sequence polymorphisms have been previously investigated, using Sanger and Next-Generation Sequencing technologies on a limited number of animals. Here, we carried out a follow-up study with the aim to further expand our knowledge about the sequence polymorphisms at the myostatin locus, through the whole-genome sequencing data of 183 samples representative of the geographical distribution range for this species. We focused our polymorphism analysis on the ±5 kb upstream and downstream region of the MSTN gene. A total of 99 variants (77 Single Nucleotide Polymorphisms and 22 indels) were observed. These were mainly located in intergenic and intronic regions, with only six synonymous Single Nucleotide Polymorphisms in exons. A sequence comparative analysis among the three species within the Camelus genus confirmed the expected higher genetic distance of C. dromedarius from the wild and domestic two-humped camels compared to the genetic distance between C. bactrianus and C. ferus. In silico functional prediction highlighted: (i) 213 differential putative transcription factor-binding sites, out of which 41 relative to transcription factors, with known literature evidence supporting their involvement in muscle metabolism and/or muscle development; and (ii) a number of variants potentially disrupting the canonical MSTN splicing elements, out of which two are discussed here for their potential ability to generate a prematurely truncated (inactive) form of the protein. The distribution of the considered variants in the studied cohort is discussed in light of the peculiar evolutionary history of this species and the hypothesis that extremely high muscularity, associated with a homozygous condition for mutated (inactivating) alleles at the myostatin locus, may represent, in arid desert conditions, a clear metabolic disadvantage, emphasizing the thermoregulatory and water availability challenges typical of these habitats.Entities:
Keywords: dromedary; indels; myostatin; single nucleotide polymorphisms
Year: 2022 PMID: 36009658 PMCID: PMC9404819 DOI: 10.3390/ani12162068
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 3.231
Analysis of potential amino acid substitutions caused by SNPs in the coding region.
| # | Variant | Project | Chromosome | Allele 1 | Allele 2 | Base Codon | Mutation | Allele |
|---|---|---|---|---|---|---|---|---|
| Exon | Name | ID * | Position ** | Position | Type | Frequency | ||
| 1 | 5var58465820 | PRJEB55295 | 58465820 | T | C | 3rd | Silent | 0.5 |
| 2 | 5var58463788 | PRJEB55295 | 58463788 | G | T | 3rd | Silent | 0.5 |
| 3 | 5var58461341 | PRJEB55295 | 58461341 | G | A | 3rd | Silent | 0.5 |
| 3 | 5var58461338 | PRJEB55295 | 58461338 | A | T | 3rd | Silent | 0.5 |
| 3 | 5var58461332 | PRJEB55295 | 58461332 | T | C | 3rd | Silent | 0.5 |
| 3 | 5var58461263 | PRJEB55295 | 58461263 | A | G | 3rd | Silent | 0.5 |
* Accession number for the project submitted to European Variant Archive (EVA, https://www.ebi.ac.uk/eva/ accessed on 7 August 2022). ** The position refers to the CamDro3 assembly (GCA_000803125.3).
Differential acceptor and donor splice sites identified by the HSF sequence analysis using the two sequences harboring the alternative alleles.
| ALLELE 1 | ALLELE 2 | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 5var58464953 | - | - | - | Acceptor splice site | CCCGGTCTGCAGAT | 81.96 |
| 5var58464910 | Donor splice site | TATGTTATT | 72.09 | - | - | - |
| 5var58464775 | Acceptor splice site | GCACCTTAACAGAG | 77.89 | - | - | - |
| 5var58464463 | - | - | - | Donor splice site | GGTGTTAAT | 66.52 |
| 5var58464142 | Donor splice site | GAAGTAGGT | 81.61 | Acceptor splice site | GAAAGGAAGCAGGT | 68.36 |
|
|
|
|
|
|
|
|
| 5var58461332 | - | - | - | Acceptor splice site | ATTGCACCTAAGAG | 74.03 |
1 Type of splicing signals: acceptor splice sites or donor splice sites; 2 Sequence motif harboring the predicted splicing signals; 3 The signals values range from 65 (weak) to 100 (very strong).
Figure 1Expected amino acid sequences based on HSF results. (A) Simulation of retention of 793 bps in intron 1 due to the creation of an alternative donor splice site when the TAATAA allele belonging to 5var58464910 is present; (B) Simulation of retention of 871 bps in intron 1 due to the creation of an alternative acceptor splice site when the C allele belonging to 5var58464775 is present.