| Literature DB >> 24763315 |
Maria Grazia Doro1, Daniela Piras2, Giovanni Giuseppe Leoni3, Giuseppina Casu1, Simona Vaccargiu1, Debora Parracciani4, Salvatore Naitana3, Mario Pirastu5, Andrea Novelletto6.
Abstract
We sequenced to near completion the entire mtDNA of 28 Sardinian goats, selected to represent the widest possible diversity of the most widespread mitochondrial evolutionary lineage, haplogroup (Hg) A. These specimens were reporters of the diversity in the island but also elsewhere, as inferred from their affiliation to each of 11 clades defined by D-loop variation. Two reference sequences completed the dataset. Overall, 206 variations were found in the full set of 30 sequences, of which 23 were protein-coding non-synonymous single nucleotide substitutions. Many polymorphic sites within Hg A were informative for the reconstruction of its internal phylogeny. Bayesian and network clustering revealed a general similarity over the entire molecule of sequences previously assigned to the same D-loop clade, indicating evolutionarily meaningful lineages. Two major sister groupings emerged within Hg A, which parallel distinct geographical distributions of D-loop clades in extant stocks. The pattern of variation in protein-coding genes revealed an overwhelming role of purifying selection, with the quota of surviving variants approaching neutrality. However, a simple model of relaxation of selection for the bulk of variants here reported should be rejected. Non-synonymous diversity of Hg's A, B and C denoted that a proportion of variants not greater than that allowed in the wild was given the opportunity to spread into domesticated stocks. Our results also confirmed that a remarkable proportion of pre-existing Hg A diversity became incorporated into domestic stocks. Our results confirm clade A11 as a well differentiated and ancient lineage peculiar of Sardinia.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24763315 PMCID: PMC3999278 DOI: 10.1371/journal.pone.0095969
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 2Scatterplot of number of variants recorded as a function of length of each of the 13 protein-coding mtDNA genes.
Filled dots: variants scored among all 30 sequences considered. Empty dots: variants scored among the 28 Hg A sequences. Panel A: all positions. Panel B: only 3rd codon positions. Panel C: only non-synonymous variants (note that the three black dots at 0 variants overlap and obscure empty dots). coxIII is indicated by an arrow.
Figure 1Median joining network of 30 sequences showing the occurrence of 23 non-synonymous substitutions.
Each circle represents a sequence. Branch length is proportional to the number of mutations. Note that two A11 and two uncharacterized sequences are lumped into single (larger) nodes (see Materials and Methods). The haplogroup or D-loop clade (for Hg A) affiliation of clusters of sequences is shown in large empty characters. Aminoacid replacements are shown beside each branch in their polarized form. The Hg A portion of the network is magnified while the original network is in the inset. A large lozenge indicates the likely position of the root and was added in the drawing to partition three variants peculiar of Hg C. Their polarization was obtained by comparison with C. ibex.
Comparative analysis of the 23 non-synonymous substitutions in protein-coding genes inferred in the full set of 30 sequences.
| Position | Gene | Allelic variants | aa. replacement | Homologous aa. in rCRS | Variants in MITOMAP | Occurrences in GB | MutPred pathogenicity score in human | mtDNA selection score in human |
| 2767 |
| T/C | F9L | L |
| 23 | 0.625 | 0,768 |
| 4587 |
| T/C | T227I | T | T-A | 1 | 0.586 | 0.661 |
| 7053 |
| A/G | T13A | T |
| 122 | 0.378 | 0.297 |
| 7185 |
| A/G | D57N | D |
| 10, 1, 1, 3 | 0.499 | 0.473 |
| 8056 |
| A/G | V43I | I |
| 9, 5, 1127 | 0.325 | 0.242 |
| 8114 |
| A/G | N62S | N |
| 6 | 0.526 | 0.525 |
| 8134 |
| A/G | T69A | S |
| 7 | n.a. | n.a. |
| 8596 |
| T/C | H223Y | H | H-R | 3 | 0.656 | 0.865 |
| 8704 |
| T/C | I32T | A | A-V, A-A | 8, 26 | n.a. | n.a. |
| 8731 |
| T/C | A41V | T | T-A, T-M, T-T | 7, 1, 33 | n.a. | n.a. |
| 9015 |
| A/G | V136I | V |
| 8 | n.a. | n.a. |
| 9112 |
| T/C | M168T | L | L-L, L-S | 17, 1 | n.a. | n.a. |
| 9369 |
| C/G | V254L | V | V-I, V-V | 117, 9 | 0.471 | 0.425 |
| 9548 |
| T/C | V29A | G | G-S | 66 | n.a. | n.a. |
| 10433 |
| T/C | P89S | L | L-P, L-L | 64, 4 | n.a. | n.a. |
| 10493 |
| A/G | T109A | T | T-P, | n.a., 109, 6 | 0.482 | 0.443 |
| 11872 |
| A/G | C42Y | S |
| 3 | n.a. | n.a. |
| 13100 |
| A/T | I451M | L |
| 5 | n.a. | n.a. |
| 13390 |
| T/C | S548L | L |
| 62 | n.a. | n.a. |
| 13964 |
| A/G | V39A | V38 | V-L, V-I | 2, 6 | 0.544 | 0.562 |
| 14479 |
| A/T | F109L | Y | Y-N, Y-H, Y-C | n.a., 33, 1 | n.a. | n.a. |
| 14795 |
| A/G | T215A | S |
| 19 | n.a. | n.a. |
| 14865 |
| T/C | V238A | S | S-P, S-F, S-S | 28, 7, 8 | n.a | n.a. |
Non synonymous variants are bolded. For positions in which only synonymous variants are known italics is used;
from ref. [22];
same aminoacid encoded by a different codon in rCRS and goat;
n.a. = not applicable.
Comparative analysis of the substitutions in RNA-coding genes inferred in the full set of 30 sequences.
| Position | Gene | Allelic variants | Homologous pos. in rCRS | Variants in MITOMAP | Occurrences in GB |
| 165 | 12S-rRNA | T/C | 743C | not present | |
| 177 | 12S-rRNA | T/del | amb. align. | ||
| 178 | 12S-rRNA | T/del | amb. align. | ||
| 180 | 12S-rRNA | G/T | amb. align. | ||
| 181 | 12S-rRNA | A/T | amb. align. | ||
| 196 | 12S-rRNA | T/C | 773T | T-C | 1 |
| 205 | 12S-rRNA | T/C | 782A | not present | |
| 213 | 12S-rRNA | T/C | 790A | not present | |
| 605 | 12S-rRNA | T/C | 1187T | T-C | 18 |
| 944 | 12S-rRNA | T/C | 1520T | T-C | 13 |
| 1026 | 12S-rRNA | T/C | 1601C | C-T | 2 |
| 1114 | 16S-rRNA | T/C | amb. align. | ||
| 1111-20 | 16S-rRNA | C10/C11 | |||
| 1192 | 16S-rRNA | T/C | 1764C | C-T | 1 |
| 1238/39 | 16S-rRNA | A/del | amb. align. | ||
| 1369 | 16S-rRNA | T/C | amb. align. | ||
| 1671 | 16S-rRNA | A/G | 2241A | not present | |
| 1850 | 16S-rRNA | T/C | 2417C | C-G | 4 |
| 2120 | 16S-rRNA | T/C | 2686G | not present | |
| 2190 | 16S-rRNA | A/G | 2756C | C-A, C-T | 1, 5 |
| 2208 | 16S-rRNA | T/C | 2774C | not present | |
| 2619 | 16S-rRNA | T/C | 3184C | C-T | 1 |
| 3771 | tRNA-Gln | T/C | 4335C | C-T | 9 |
| 3892 | tRNA-Met | T/C | 4454T | T-A, T-C | 90, 80 |
| 6894 | tRNA-Ser (UCN) | T/C | 7468C | C-T | 5 |
| 7723 | tRNA-Lys | A/G | 8315A | A-G | 1 |
| 9827 | tRNA-Arg | T/C | 10421C | C-T | 1 |
| 9860 | tRNA-Arg | T/C | 10454T | T-C | 59 |
| 9863 | tRNA-Arg | A/G | amb. align. | ||
| 11560-7 | tRNA-His | A8/A9 | amb. align. | ||
| 11569 | tRNA-His | A/G | 12160A | not present | |
| 11624 | tRNA-Ser (AGY) | T/C | 12214C | not present | |
| 11691 | tRNA-Leu(CUN) | A/G | 12279A | A-T | 1 |
| 11702 | tRNA-Leu(CUN) | A/G | 12290A | A-G | 2 |
Ambiguous alignment.