| Literature DB >> 28848689 |
Fabrizio Ghiselli1, Liliana Milani1, Mariangela Iannello1, Emanuele Procopio1, Peter L Chang2, Sergey V Nuzhdin2, Marco Passamonti1.
Abstract
Despite the large number of animal complete mitochondrial genomes currently available in public databases, knowledge about mitochondrial genomics in invertebrates is uneven. This paper reports, for the first time, the complete mitochondrial genome of the grooved carpet shell, Ruditapes decussatus, also known as the European clam. Ruditapes decussatus is morphologically and ecologically similar to the Manila clam Ruditapes philippinarum, which has been recently introduced for aquaculture in the very same habitats of Ruditapes decussatus, and that is replacing the native species. Currently the production of the European clam is almost insignificant, nonetheless it is considered a high value product, and therefore it is an economically important species, especially in Portugal, Spain and Italy. In this work we: (i) assembled Ruditapes decussatus mitochondrial genome from RNA-Seq data, and validated it by Sanger sequencing; (ii) analyzed and characterized the Ruditapes decussatus mitochondrial genome, comparing its features with those of other venerid bivalves; (iii) assessed mitochondrial sequence polymorphism (SP) and copy number variation (CNV) of tandem repeats across 26 samples. Despite using high-throughput approaches we did not find evidence for the presence of two sex-linked mitochondrial genomes, typical of the doubly uniparental inheritance of mitochondria, a phenomenon known in ∼100 bivalve species. According to our analyses, Ruditapes decussatus is more genetically similar to species of the Genus Paphia than to the congeneric Ruditapes philippinarum, a finding that bolsters the already-proposed need of a taxonomic revision. We also found a quite low genetic variability across the examined samples, with few SPs and little variability of the sequences flanking the control region (Largest Unassigned Regions (LURs). Strikingly, although we found low nucleotide variability along the entire mitochondrial genome, we observed high levels of length polymorphism in the LUR due to CNV of tandem repeats, and even a LUR length heteroplasmy in two samples. It is not clear if the lack of genetic variability in the mitochondrial genome of Ruditapes decussatus is a cause or an effect of the ongoing replacement of Ruditapes decussatus with the invasive Ruditapes philippinarum, and more analyses, especially on nuclear sequences, are required to assess this point.Entities:
Keywords: Bivalve molluscs; Codon usage; Comparative mitochondrial genomics; Complete mitochondrial genome; Doubly uniparental inheritance; European clam; Mitochondrial length polymorphism; Mitochondrial repeats; RNA-Seq; mtDNA de novo assembly
Year: 2017 PMID: 28848689 PMCID: PMC5571815 DOI: 10.7717/peerj.3692
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Features of the contigs obtained by de novo assembly of mtDNA.
| Contig | Length | Gene content | Poly-A | Notes |
|---|---|---|---|---|
| 1 | 6,794 | Yes | Chimeric assembly. The contiguity between | |
| 2 | 1,884 | No | – | |
| 3 | 1,288 | Yes | – | |
| 4 | 1,663 | ? | The contig ends with just 8 As | |
| 5 | 1,934 | No | – | |
| 6 | 1,831 | Yes | – | |
| 7 | 5,478 | Yes | There is a polyadenylation signal (56 As) after the | |
| 8 | 2,879 | No | – | |
| 9 | 952 | No | – |
Figure 1Ruditapes decussatus mtDNA gene arrangement.
MtDNA gene arrangement of Ruditapes decussatus.
| Name | Type | Start | Stop | Length (bp) | Start | Stop | Anticodon |
|---|---|---|---|---|---|---|---|
| Coding | 1 | 1,716 | 1,716 | ATA | TAG | ||
| tRNA | 1,754 | 1,815 | 62 | TAG | |||
| Coding | 1,822 | 2,739 | 918 | ATA | TAA | ||
| Coding | 2,755 | 3,774 | 1,020 | ATG | TAG | ||
| Coding | 3,780 | 4,052 | 273 | ATA | TAG | ||
| tRNA | 4,125 | 4,190 | 66 | GAT | |||
| Coding | 4,228 | 5,499 | 1,272 | ATA | TAG | ||
| tRNA | 5,553 | 5,616 | 64 | TGG | |||
| Coding | 5,641 | 6,864 | 1,224 | ATA | TAG | ||
| rRNA | 6,865 | 8,385 | 1,521 | ||||
| Coding | 8,386 | 9,123 | 738 | ATG | TAA | ||
| Coding | 9,145 | 9,552 | 408 | ATG | TAA | ||
| Coding | 9,631 | 11,268 | 1,638 | ATG | TAG | ||
| Coding | 13,379 | 13,504 | 126 | ATA | TAA | ||
| Coding | 13,526 | 14,865 | 1,340 | ATA | TA- | ||
| tRNA | 14,866 | 14,928 | 63 | GTG | |||
| tRNA | 14,929 | 14,990 | 62 | TTC | |||
| tRNA | 14,991 | 15,052 | 62 | TGA | |||
| tRNA | 15,081 | 15,140 | 60 | GTA | |||
| tRNA | 15,218 | 15,280 | 63 | GTC | |||
| tRNA | 15,294 | 15,358 | 65 | CAT | |||
| Coding | 15,380 | 15,874 | 495 | ATG | TAA | ||
| tRNA | 15,897 | 15,959 | 63 | TTT | |||
| tRNA | 15,960 | 16,021 | 62 | TAC | |||
| tRNA | 16,030 | 16,092 | 63 | GAA | |||
| tRNA | 16,093 | 16,155 | 63 | TCA | |||
| tRNA | 16,171 | 16,232 | 62 | TCG | |||
| tRNA | 16,233 | 16,295 | 63 | TAA | |||
| tRNA | 16,297 | 16,358 | 62 | TCC | |||
| tRNA | 16,359 | 16,427 | 69 | TTG | |||
| tRNA | 16,435 | 16,497 | 63 | GTT | |||
| tRNA | 16,498 | 16,560 | 63 | TGT | |||
| tRNA | 16,565 | 16,626 | 62 | GCA | |||
| tRNA | 16,632 | 16,696 | 65 | TGC | |||
| tRNA | 16,698 | 16,764 | 67 | TCT | |||
| Coding | 16,765 | 17,730 | 966 | ATG | TAA | ||
| rRNA | 17,731 | 18,995 | 1,265 |
Note:
The anticodon of tRNAs are reported in the 5′-3′ direction.
Unassigned regions (URs).
| UR name | Start | Stop | Length (bp) |
|---|---|---|---|
| UR1 | 1,717 | 1,753 | 37 |
| UR2 | 1,816 | 1,821 | 6 |
| UR3 | 2,740 | 2,754 | 15 |
| UR4 | 3,775 | 3,779 | 5 |
| UR5 | 4,053 | 4,124 | 72 |
| UR6 | 4,191 | 4,227 | 37 |
| UR7 | 5,500 | 5,552 | 53 |
| UR8 | 5,617 | 5,640 | 24 |
| UR9 | 9,124 | 9,144 | 21 |
| UR10 | 9,553 | 9,630 | 78 |
| UR11 (LUR) | 11,269 | 13,378 | 2,110 |
| UR12 | 13,505 | 13,525 | 21 |
| UR13 | 15,053 | 15,080 | 28 |
| UR14 | 15,141 | 15,217 | 77 |
| UR15 | 15,281 | 15,293 | 13 |
| UR16 | 15,359 | 15,379 | 21 |
| UR17 | 15,875 | 15,896 | 22 |
| UR18 | 16,022 | 16,029 | 8 |
| UR19 | 16,156 | 16,170 | 15 |
| UR20 | 16,296 | 16,296 | 1 |
| UR21 | 16,428 | 16,434 | 7 |
| UR22 | 16,561 | 16,564 | 4 |
| UR23 | 16,627 | 16,631 | 5 |
| UR24 | 16,697 | 16,697 | 1 |
Nucleotide composition.
| Name | Length (bp) | T (%) | C (%) | A (%) | G (%) | A+T (%) | T3 (%) | C3 (%) | A3 (%) | G3 (%) | A3+T3 (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1,716 | 35.8 | 15.5 | 25.8 | 22.9 | 61.6 | 39 | 12.1 | 28.0 | 21.3 | 67.0 | |
| 918 | 38.7 | 12.5 | 24.0 | 24.8 | 62.7 | 38 | 10.1 | 30.7 | 21.2 | 68.7 | |
| 1,020 | 38.3 | 11.0 | 24.8 | 25.9 | 63.1 | 35 | 11.5 | 29.4 | 24.4 | 64.4 | |
| 273 | 39.9 | 12.8 | 25.3 | 22.0 | 65.2 | 34 | 14.3 | 30.8 | 20.9 | 64.8 | |
| 1,272 | 29.7 | 14.8 | 29.1 | 26.4 | 58.8 | 30 | 15.3 | 27.4 | 27.6 | 57.4 | |
| 1,224 | 37.4 | 17.2 | 22.7 | 22.6 | 60.1 | 41 | 14.7 | 21.8 | 22.1 | 62.8 | |
| 1,749 | 33.2 | 11.5 | 32.6 | 22.6 | 65.8 | 33 | 10.6 | 33.4 | 23.0 | 66.4 | |
| 510 | 42.0 | 15.7 | 20.8 | 21.6 | 62.8 | 45 | 13.5 | 21.8 | 20.0 | 66.8 | |
| 408 | 39.5 | 11.0 | 24.8 | 24.8 | 64.3 | 33 | 11.0 | 30.1 | 25.7 | 63.1 | |
| 1,638 | 37.6 | 11.7 | 27.7 | 23.0 | 65.3 | 35 | 11.0 | 34.2 | 19.8 | 69.2 | |
| 126 | 44.4 | 11.9 | 19.0 | 24.6 | 63.4 | 45 | 4.8 | 23.8 | 26.2 | 68.8 | |
| 1,340 | 38.9 | 12.9 | 22.1 | 26.1 | 61.0 | 41 | 10.8 | 24.9 | 23.5 | 65.9 | |
| 495 | 39.2 | 12.1 | 23.0 | 25.7 | 62.2 | 38 | 13.9 | 27.9 | 20.0 | 65.9 | |
| 966 | 36.9 | 12.7 | 24.8 | 25.6 | 61.7 | 39 | 9.6 | 28.6 | 23.0 | 67.6 | |
| 1,265 | 32.7 | 12.3 | 32.9 | 22.1 | 65.6 | 35 | 13.5 | 31.6 | 19.5 | 66.6 | |
| All coding | 14,920 | 36.3 | 13.2 | 26.5 | 24.0 | 63.0 | 37 | 12.0 | 28.9 | 22.4 | 65.7 |
| All | 3,014 | 32.9 | 23.8 | 32.7 | 22.3 | 65.7 | |||||
| All | 1,394 | 35.4 | 12.8 | 30.2 | 21.7 | 65.6 | |||||
| All URs | 2,681 | 28.2 | 14.1 | 34.1 | 23.6 | 62.3 | |||||
| All genic DNA | 16,314 | 36.2 | 13.2 | 26.8 | 23.8 | 63.0 | |||||
| All DNA | 18,995 | 35.1 | 13.3 | 27.9 | 23.7 | 63.0 |
Note:
URs, unassigned regions.
Codon usage.
| Amino acid | Codon | # | Frequency | %TOT | Amino acid | Codon | # | Frequency | %TOT |
|---|---|---|---|---|---|---|---|---|---|
| Ala | GCG | 29 | 0.15 | 0.73 | Pro | CCG | 16 | 0.12 | 0.40 |
| 44 | 0.23 | 1.11 | 36 | 0.27 | 0.91 | ||||
| GCT | 85 | 2.14 | CCT | 58 | 1.46 | ||||
| GCC | 30 | 0.16 | 0.76 | CCC | 24 | 0.18 | 0.61 | ||
| Cys | TGT | 94 | 2.37 | Gln | CAG | 25 | 0.44 | 0.63 | |
| 30 | 0.24 | 0.76 | 32 | 0.81 | |||||
| Asp | GAT | 54 | 1.36 | Arg | CGG | 23 | 0.31 | 0.58 | |
| 28 | 0.34 | 0.71 | 21 | 0.28 | 0.53 | ||||
| Glu | GAG | 87 | 2.19 | CGT | 25 | 0.63 | |||
| 58 | 0.4 | 1.46 | CGC | 6 | 0.08 | 0.15 | |||
| Phe | TTT | 269 | 6.78 | Ser | AGG | 69 | 0.19 | 1.74 | |
| 78 | 0.22 | 1.97 | 69 | 0.19 | 1.74 | ||||
| Gly | GGG | 131 | 3.30 | AGT | 55 | 0.15 | 1.39 | ||
| 61 | 0.19 | 1.54 | AGC | 23 | 0.06 | 0.58 | |||
| GGT | 98 | 0.3 | 2.47 | TCG | 18 | 0.05 | 0.45 | ||
| GGC | 36 | 0.11 | 0.91 | 33 | 0.09 | 0.83 | |||
| His | CAT | 37 | 0.93 | TCT | 76 | 1.92 | |||
| 23 | 0.38 | 0.58 | TCC | 22 | 0.06 | 0.55 | |||
| Ile | ATT | 165 | 4.16 | Thr | ACG | 21 | 0.17 | 0.53 | |
| 40 | 0.2 | 1.01 | 30 | 0.24 | 0.76 | ||||
| Lys | AAG | 61 | 0.41 | 1.54 | ACT | 57 | 1.44 | ||
| 87 | 2.19 | ACC | 16 | 0.13 | 0.40 | ||||
| Leu | TTG | 122 | 0.23 | 3.08 | Val | GTG | 113 | 0.3 | 2.85 |
| 210 | 5.29 | 121 | 3.05 | ||||||
| CTG | 43 | 0.08 | 1.08 | GTT | 119 | 0.32 | 3.00 | ||
| 70 | 0.13 | 1.76 | GTC | 23 | 0.06 | 0.58 | |||
| CTT | 75 | 0.14 | 1.89 | Trp | TGG | 58 | 1.46 | ||
| CTC | 20 | 0.04 | 0.50 | 49 | 0.46 | 1.24 | |||
| Met | 86 | 0.36 | 2.17 | Tyr | TAT | 103 | 2.60 | ||
| ATA | 155 | 3.91 | 47 | 0.31 | 1.18 | ||||
| Asn | AAT | 76 | 1.92 | STOP | TAG | 34 | 0.58 | 0.86 | |
| 39 | 0.34 | 0.98 | TAA | 25 | 0.42 | 0.63 |
Note:
The codons corresponding to a tRNA present in the mitochondrial genome are underlined and in bold. The highest frequency among synonymous codons is also underlined and in bold. #, number of codons; Frequency, frequency of each codon among synonymous codons; %TOT, frequency of each codon among all the codons.
Figure 2Principal features of the Largest Unassigned Region (LUR).
(A): map of the lUR; (B): DNA secondary structure predicted in the repeat region (boxed in A); (C): Logos of the two DNA motifs found in the LUR.
LUR length and number of repeats in the 13 female samples analyzed.
| Specimen | Length (bp) | Number of repeats | GenBank Acc. No. |
|---|---|---|---|
| F3 | 2,100–3,500 | 6.5–25 | |
| F5 | 5,000 | 45 | |
| F7 | 3,500 | 25 | |
| F9 | 3,500 | 25 | |
| F10 | 3,000 | 20 | |
| F11 | 3,000 | 20 | |
| F13 | 3,500 | 25 | |
| F15 | 3,000 | 20 | |
| F16 | 3,500 | 25 | |
| F17 | 2,500–3,500 | 8–25 | |
| F19 | 3,500 | 25 | |
| F20 | 2,500 | 8 | |
| F21 | 2,100 | 6.5 |
Note:
F3 and F17 are heteroplasmic with LURs of different length.
Sequence Polymorphism (SP): SPs and small indels called by GATK.
| Feature | Value | Min | Median | Mean | Max |
|---|---|---|---|---|---|
| Depth (all SPs) | – | 6 | 1,357 | 1,521 | 3,880 |
| Phred score (all SPs) | – | 3.30E+01 | 5.76E+03 | 4.18E+07 | 2.15E+09 |
| Depth (SPs in CDS) | – | 222 | 2,038 | 2,150 | 3,880 |
| Phred score (SPs in CDS) | – | 1.18E+02 | 1.01E+04 | 4.45E+07 | 2.15E+09 |
| Total number of SPs | 257 | – | – | – | – |
| Number of mRDI01 private SPs | 151 (58.7% of the total) | – | – | – | – |
| Number of SPs in CDS | 145 (56.4% of the total) | – | – | – | – |
| Number of mRDI01 private SPs in CDS | 103 (71% of the SP in CDS) | – | – | – | – |
| Number of SPs in CDS (excluding mRDI01) | 42 | – | – | – | – |
| Frequency of SPs in CDS | 0.0097 (∼1 every 103 bp) | – | – | – | – |
| Frequency of SPs in CDS (excluding mRDI01) | 0.0028 (∼1 every 355 bp) | – | – | – | – |
| Total number of indels | 18 | – | – | – | – |
| Number of indels in CDS | 6 | – | – | – | – |
| Number of indels causing frameshift | 4 | – | – | – | – |
| Males | 234 (160) | 136 (107) | |||
| Males (no mRDI01) | 84 (15) | 32 (6) | |||
| Females | 97 (23) | 38 (9) |
Note:
CDS, coding sequences; Whole mtDNA, polymorphism in the whole mitochondrial genome; the number in brackets the bottom of the table represent private SPs (e.g., there are 23 female specific SPs in the whole mtDNA and 9 female specific SPs in CDS); p-value, significance of the Fisher’s exact test on number of SPs between sexes (i.e., all males vs females, males except mRDI01 vs females).
Indels located in coding sequences.
| Position | Depth | Qual | Gene | SP | Frameshift | Sample | Allele frequency | Notes |
|---|---|---|---|---|---|---|---|---|
| 1,698 | 3,732 | 1.38E+04 | C/CAAA | No | mRDI02, mRDI03 | 0.089, 0.85 | Insertion of 1 Lysine | |
| 6,364 | 1,929 | 2.15E+09 | CT/C | Yes | fRDI04, mRDI05 | 0.80, 0.81 | Yields a shorter | |
| 10,449 | 1,780 | 2.15E+09 | C/CT | Yes | fRDI01, fRDI04, fRDI05 | 0.11, 0.10, 0.11 | Yields a | |
| 17,619 | 2,272 | 5.98E+03 | AGCG/A | No | mRDI01 | 0.97 | Deletion of one Alanine | |
| 17,621 | 2,188 | 9.99E+04 | CG/C | Yes | mRDI01 | 0.99 | Always combined with SP_17624. Together change the last 35 amino acids | |
| 17,624 | 2,287 | 5.98E+03 | C/CAT | Yes | mRDI01 | 0.99 | Always combined with SP_17621. Together change the last 35 amino acids |
Note:
Depth, sequencing depth; Qual, quality of the called SP expressed in Phred score; Allele frequency, frequency of the alternative allele in each sample indicated in the “Sample” column.
Figure 3BLASTN comparison of Ruditapes decussatus and other Veneridae.
Ruditapes decussatus mtDNA map (external gray circle), and BLASTN identity (colored inner circles) with complete mtDNAs of other 10 venerid species (see list in Table S3).
Figure 4Maximum Likelihood (ML) tree of Veneridae obtained with all mitochondrial coding genes.
ML tree obtained with the MitoPhast pipeline; the complete input and output of this analysis is available on figshare (https://doi.org/10.6084/m9.figshare.4970762.v1).
Figure 5Comparison of gene order in venerid mtDNAs.
Variation in gene order between Ruditapes decussatus and P. euglypta (A) M. lamarckii F-type (B) Ruditapes philippinarum F-type (C) and among all the four species (D).