Literature DB >> 27083010

The Complete Female- and Male-Transmitted Mitochondrial Genome of Meretrix lamarckii.

Stefano Bettinazzi1, Federico Plazzi1, Marco Passamonti1.   

Abstract

Bivalve mitochondrial genomes show many uncommon features, like additional genes, high rates of gene rearrangement, high A-T content. Moreover, Doubly Uniparental Inheritance (DUI) is a distinctive inheritance mechanism allowing some bivalves to maintain and transmit two separate sex-linked mitochondrial genomes. Many bivalve mitochondrial features, such as gene extensions or additional ORFs, have been proposed to be related to DUI but, up to now, this topic is far from being understood. Several species are known to show this unusual organelle inheritance but, being widespread only among Unionidae and Mytilidae, DUI distribution is unclear. We sequenced and characterized the complete female- (F) and male-transmitted (M) mitochondrial genomes of Meretrix lamarckii, which, in fact, is the second species of the family Veneridae where DUI has been demonstrated so far. The two mitochondrial genomes are comparable in length and show roughly the same gene content and order, except for three additional tRNAs found in the M one. The two sex-linked genomes show an average nucleotide divergence of 16%. A 100-aminoacid insertion in M. lamarckii M-cox2 gene was found; moreover, additional ORFs have been found in both F and M Long Unassigned Regions of M. lamarckii. Even if no direct involvement in DUI process has been demonstrated so far, the finding of cox2 insertions and supernumerary ORFs in M. lamarckii both strengthens this hypothesis and widens the taxonomical distribution of such unusual features. Finally, the analysis of inter-sex genetic variability shows that DUI species form two separate clusters, namely Unionidae and Mytilidae+Veneridae; this dichotomy is probably due to different DUI regimes acting on separate taxa.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27083010      PMCID: PMC4833323          DOI: 10.1371/journal.pone.0153631

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Doubly Uniparental Inheritance (DUI) [1-4] is an interesting alternative to the common Strict Maternal Inheritance (SMI) for cytoplasmic organelles in Eukaryotes. Species with DUI are characterized by the presence of two different sex-linked mitochondrial lineages, being transmitted independently by the two sexes. One lineage is called F (from Female-transmitted) and it is transmitted by females through eggs, while the other is called M (Male-transmitted) and it is transmitted by males through sperm. After fertilization, the heteroplasmic zygote contains both F and M mitochondrial lineages. During embryonic development, females become essentially homoplasmic for F, whereas males remain heteroplasmic, the M lineage being localized in germ line and (often in traces) in soma, and the F one in soma only [5]. As a result, in adult DUI bivalves, somatic tissues of both sexes are dominated by the F-mtDNA lineage, while germ-line cells contain the sex-specific mtDNA lineage [6-8]. The two sex-linked mtDNAs are therefore inherited separately, and thus they evolve independently. This results in a high level of sequence divergence between the two genomes, comprised between 10% and 50% (see, f.i., [9, 8, 5, 10]). Up to now, DUI has only been found in ten bivalve families: Arcticidae, Donacidae, Hyriidae, Margaritiferidae, Mactridae, Mytilidae, Nuculanidae, Solenidae, Unionidae and Veneridae ([11, 6, 12, 7, 9, 8, 13, 10, 14–15]; and reference therein). With the exception of mytilids and unionids, where several DUI species have been discovered, in other families only few DUI species have been found, thus affecting the possibility of comparisons between phylogenetically related species. Until very recently, among the Veneridae, DUI has been detected in Venerupis philippinarum only [16]. Although it was suggested for the species Cyclina sinensis [12], this claim was based solely on three GenBank sequences that were recently questioned [10]. In a recent paper, we investigated the presence of DUI in seven species of the subclass Heterodonta and we found DUI only in the venerid clam Meretrix lamarckii [10]. M. lamarckii, also known as the Korean hard clam, is a medium size clam widespread around coasts of Pacific Ocean, including China, Korea, Japan and South-East Asia [17-18]. This economically important mollusk lives in sandy sediments on subtidal flats [19]. A complete mitochondrial genome of M. lamarckii has already been sequenced from somatic cells by [18]: this is most likely the F-type, as indeed confirmed by phylogenetic analysis [10]. In the present work, we sequenced the complete M and F mitochondrial genomes of M. lamarckii.

Materials and Methods

Samples Collection, DNA Extraction and Dilution

24 individuals of Meretrix lamarckii were commercially purchased at the Tsukiji Wholesale Fish Market (Tokyo, Japan) in June 2012. All specimens were screened alive by microscopic inspection of gonadal extract to confirm sexual maturity and to determine the sex. A standard phenol:chloroform protocol [20] was used to extract total nucleic acid from gametes; samples were re-suspended in TE 1× and are conserved in the Mozoo Lab at the Department of Biological, Geological and Environmental Sciences (Bologna, Italy). Total nucleic acid was quantified using a Nanodrop spectrophotometer and further diluted to reach optimal Long-PCR concentration (125 ng/μL). Two individuals, whose specimen numbers are BES:TKJ:004 (female) and BES:TKJ:009 (male), were selected because of yield results for further sequencing.

PCR Amplifications, Electrophoresis and Sequencing

The two complete genomes were amplified in three large overlapping fragments using Long-PCR technique paired with primer-walking on a Gene Amp® PCR System 2720 (Applied Biosystem). To perform Long PCR amplification from 2,000 bp up to 10,000 bp we used the Herculase® II Fusion Enzyme kit (Stratagene). Reaction volume of 50 μL was composed as follows: 2 μL DNA template (125 ng/μL), 0.5 μL Herculase II Fusion DNA Polymerase, 10 μL 5× Herculase II reaction Buffer (containing Mg2+ 10 mM), 0.5 μL dNTPs mix (25 mM for each dNTP), 1.25 μL each Primer (10 μM) and 34.5 μL of sterilized distilled water. Reaction conditions (S1 Table) were first set up according to manufacturer’s instruction and further modified whenever necessary. After the initial denaturation step (92°C for 2’), 30 cycles were used as follows: denaturation at 92°C for 20”, annealing at 48–56°C for 20–30” and extension at 68°C for 10’. The final extension step was carried out at 68°C for 8’. Primers were designed using the Primer3 online tool [21]. For amplification of fragments < 2,000 bp we used a standard PCR approach using the GoTaq® Flexi DNA Polymerase kit (Promega). The reaction volume was 30 μL composed of 10.85 μL of sterilized distilled water, 6 μL of 5× Green GoTaq® Flexi Buffer, 2.4 μL of dNTPs (2.5 mM for each dNTP), 3.6 μL of MgCl2 (25 mM), 1.5 μL of each primer (10 μM), 0.15 μL of Taq Polymerase (5u/μL) and 4 μL of appropriately diluted DNA template. Typically, the cycle (S1 Table) was composed by an initial denaturation step (95°C for 2’), then 35 cycles as follows: denaturation at 95°C for 1’, annealing at 48–56°C for 1’ and extension at 72°C for 1–2’ depending on amplicons length. The final extension was carried out at 72°C for 5’. PCR results were visualized by electrophoresis onto a 1% agarose gel stained with ethidium bromide and then purified through a standard isopropanol protocol, Wizard® SV Gel and PCR Clean-Up System (Promega) and, in some cases, with an empirically modified PEG precipitation protocol [22]. Successfully purified products were sequenced with the Sanger method thanks to Macrogen Europe facility (Amsterdam, The Netherland).

Data analysis and genome annotation

MEGA 6.06 [23] was used to examine and edit electropherograms and further merge the entire mitochondrial genomes according to overlapping sequences. This software was also used to compute codon usage and nucleotide percentages. Nucleotide trends and A-T skew at four-fold degenerate sites were used to identify the Control Region (CR) and the Origin of Replication (OR) of either strand. A dedicated R [24] script was written to (i) compute these statistics over a sliding window, (ii) plot results and (iii) test for significance of the linear correlation. Any size and step for the sliding window can be specified: for the present work, we used a 700-bp sliding window with a step of 300 bp. Autocorrelograms for each nucleotide are also produced in order to evaluate the amount of autocorrelation between sliding windows and therefore the validity of the linear model approach. The user is allowed to analyze several genomes at the same time and to set any gene as starting point for the four-fold degenerate sites analysis; the script is available as S1 Script along with test files and a detailed tutorial. A GitHub repository was created, which can be found at the URL https://github.com/mozoo/4F.git. Protein Coding Genes (PCGs) were predicted using both MITOS [25] and NCBI's ORF Finder online tool [26], using invertebrate mitochondrial genetic code and alternative start codons. Potential PCGs were identified through homologous sequence similarity using BLAST [27-28]. F- and M-cox2 sequences were aligned using the software MUSCLE [29] and the alignment was graphically edited thanks to the TeXshade package [30]. Putative tRNA genes were detected using MITOS, tRNAScan-SE [31-32] and ARWEN v1.2 [33] online softwares, using default settings. rRNA sequences have been identified by comparisons with other rRNAs present in GenBank using BLAST. In both F and M genomes, the rRNAs sequences were then annotated assuming that the first base comes immediately after the last base of the previous gene and that the last base comes immediately before the first base of the following gene. Finally, the two whole-genome maps were created using GenomeVx online tool [34], conventionally setting cox1 as the starting point of the mtDNA. All putative secondary structures of rRNAs and non-coding regions were predicted using Mfold server [35] and then graphically edited through VARNA 3.7 [36]. Repeats were identified through the online software Tandem Repeat Finder [37]. Phobius [38], InterProScan [39], TMPred [40] and HMMTOP [41] online softwares were all used to predict additional Trans Membrane Helices (TMHs). Introns were searched using @TOME2 [42]. The analysis of additional putative ORFs was done using the software Glimmer3 [43]. The EMBOSS [44] package was used to extract all the possible ORFs from all available bivalve complete mitochondrial genomes (GenBank consulted in August, 2014). An alignment was computed for F_ORF141 and M_ORF138 using HHBlits [45] and the last UniProt release. The computed hidden Markov model was used as a query against the database of all bivalve mitochondrial ORFs, which was built using HHBlits and the Pdb70 database.

Phylogeny and evolutionary comparisons

A phylogenetic analysis was carried out using other complete mitochondrial genomes from the family Veneridae that were available at August, 2014. Three heterodonts, Coelomactra antiquata (Mactridae), Hiatella arctica (Hiatellidae), and Acanthocardia tuberculata (Cardiidae) were selected as outgroups; the complete dataset is available as S2 Table. The 13 PCGs and the 2 rRNAs were extracted and aligned with PSI-BLAST [46], MUSCLE, ProbconsRNA [47], RNAplfold [48], and MAFFT [49] through the T-Coffee algorithm [50-51], using the pipeline PSI-Coffee > Expresso > accurate for PCGs and the MR-Coffee mode for rRNAs. Alignments were masked using BMGE 1.1 [52]; the best partitioning scheme was selected using PartitionFinder 1.1.0 [53] under the Bayesian Information Criterion and a greedy approach. The final Maximum Likelihood (ML) tree search was carried out with RAxML 8.2.0 [54] performing 1,000 bootstrap replicates. The consensus tree was computed with PhyUtility 2.2 [55] and graphically edited with Dendroscope 3 [56]. Finally, we compared sex-linked divergence in different DUI systems; all the DUI species whose complete mitochondrial genomes were available in January, 2015 were selected for this analysis. M and F coding sequences of M. lamarckii and other DUI species were aligned gene by gene using the T-Coffee algorithm: as above, the accurate method was used for PCGs, while MR-Coffee was chosen for rRNAs and tRNAs. To account for multiple substitutions, nucleotide Jin-Nei [57] and aminoacid Kimura [58] corrected distances were then computed using the EMBOSS suite along with uncorrected p-distances. Principal Component Analysis was carried out through R on concatenated Jin-Nei and Kimura distances using the packages FactoMineR [59] for computations and ggplot2 [60] for graphics. We also computed average distances within Unionidae, within Amarsipobranchia sensu [61] (i.e., Pteriomorphia + Heterodonta; in this case, Mytilidae + Veneridae), and within the complete dataset. Nucleotide single-gene average values were ranked and rankings within Unionidae and Amarsipobranchia were compared through R using the Spearman ρ and the Kendall's τ.

Results

Overall genomic features

The Meretrix lamarckii complete F and M mitochondrial genomes are 20,025 bp and 19,688 bp long, respectively (Fig 1). Sequences are available in GenBank under the accession numbers KP244451 and KP244452, respectively. All genes are located on the same “+” strand and the two lineages share the same gene order. The only exceptions are three tRNAs, which were found only in M-mtDNA: trnL(AAG), an additional copy of trnQ(UUG), and trnF(AAA). Genome annotations are shown in Table 1.
Fig 1

Meretrix lamarckii sex-linked mitochondrial genomes.

Genomic map of F- (above) and M- (below) mtDNA of Meretrix lamarckii starting from cox1. Genes are all on "+" strand; genome lengths are shown in the middle of each map. Unassigned Regions (URs) are reported in black in the internal circle.

Table 1

Annotation of F and M mitochondrial genomes of Meretrix lamarckii.

NameTypeaStartbEndLenght (bp)UNscAnticodonStart CodonStop Codon
cox1FPCG11,8241,82432GTGTAG
MPCG11,8511,85146GTGTAA
trnL(NAG)FtRNA1,8571,918620TAG
MtRNA1,8981,959620TAG
nad1FPCG1,9192,82190377ATCTAA
MPCG1,9602,86290319ATTTAA
trnL(NAG)MtRNA2,8822,939580AAG
nad2FPCG2,8993,9541,05642GTGTAA
MPCG2,9403,9951,05615ATGTAA
nad4LFPCG3,9974,311315148GTGTAG
MPCG4,0114,30429463ATTTAG
trnIFtRNA4,4604,5236451GAT
MtRNA4,3684,4306338GAT
trnDFtRNA4,5754,6366268GTC
MtRNA4,4694,533654GTC
trnQMtRNA4,5384,60467-2TTG
cox2FPCG4,7056,0901,38671ATGTAG
MPCG4,6036,2761,67470ATGTAA
trnPFtRNA6,1626,22766-1TGG
MtRNA6,3476,4146829TGG
cytbFPCG6,2277,4921,2660ATGTAG
MPCG6,4447,7121,2690TTGTAG
rrnLFrRNA7,4938,9641,4720
MrRNA7,7139,1851,4730
atp8FPCG8,9659,11715328ATATAA
MPCG9,1869,32013522ATGTAG
nad4FPCG9,14610,4801,3359ATATAA
MPCG9,34310,6831,3417GTGTAA
trnHFtRNA10,49010,551620GTG
MtRNA10,69110,752620GTG
trnEFtRNA10,55210,61766-3TTC
MtRNA10,75310,81866-3TTC
trnS(NGA)FtRNA10,61510,679650TGA
MtRNA10,81610,880650TGA
atp6FPCG10,68011,53185231ATGTAG
MPCG10,88111,73285231GTGTAG
nad3FPCG11,56311,99743565ATGTAG
MPCG11,76412,19843546GTGTAG
nad5FPCG12,06313,7871,72576GTGTAG
MPCG12,24513,9811,73772GTGTAA
nad6FPCG13,86414,38552234ATATAA
MPCG14,05414,58453131ATATAA
trnWFtRNA14,42014,487682TCA
MtRNA14,61614,682672TCA
trnMFtRNA14,49014,5597070CAT
MtRNA14,68514,7547064CAT
trnVFtRNA14,63014,6966790TAC
MtRNA14,81914,8836586TAC
trnKFtRNA14,78714,8567039TTT
MtRNA14,97015,0386930TTT
trnFFtRNA14,89614,969747GAA
MtRNA15,06915,140726GAA
trnL(NAA)FtRNA14,97715,0406459TAA
MtRNA15,14715,2106431TAA
trnGFtRNA15,10015,163641,855TCC
MtRNA15,24215,30564694TCC
trnFMtRNA16,00016,06667690AAA
trnQFtRNA17,01917,08668125TTG
MtRNA16,75716,8256935TTG
trnRFtRNA17,21217,2776657TCG
MtRNA16,86116,9266645TCG
trnNFtRNA17,33517,4006619GTT
MtRNA16,97217,0376620GTT
trnTFtRNA17,42017,487680TGT
MtRNA17,05817,122650TGT
rrnSFrRNA17,48818,6801,1930
MrRNA17,12318,3101,1880
trnCFtRNA18,68118,7476719GCA
MtRNA18,31118,3786821GCA
trnYFtRNA18,76718,8346838GTA
MtRNA18,40018,4697037GTA
trnS(NCT)FtRNA18,87318,939670TCT
MtRNA18,50718,573670TCT
cox3FPCG18,94019,84890912ATGTAA
MPCG18,57419,49792411ATGTAA
trnAFtRNA19,86119,9337392TGC
MtRNA19,50919,57971109TGC

a PCG, Protein Coding Gene.

b All genes are located on the same strand.

c Unassigned nucleotides after the gene (negative values for overlapping nucleotides).

Meretrix lamarckii sex-linked mitochondrial genomes.

Genomic map of F- (above) and M- (below) mtDNA of Meretrix lamarckii starting from cox1. Genes are all on "+" strand; genome lengths are shown in the middle of each map. Unassigned Regions (URs) are reported in black in the internal circle. a PCG, Protein Coding Gene. b All genes are located on the same strand. c Unassigned nucleotides after the gene (negative values for overlapping nucleotides). Nucleotide composition is reported in Table 2. M. lamarckii A-T content reaches 66.01% in F-mtDNA and 67.17% in M-mtDNA. This value increases considering only the 3rd base nucleotide composition of PCG codons (73.31% for F-mtDNA and 75.08% for M-mtDNA).
Table 2

Nucleotide composition (%) of Meretrix lamarckii F and M genomes.

NameSexLengthTCAGA-TT3aC3aA3aG3aA-T3a
cox1F1,82443.1012.1022.1022.7065.2056.004.9018.9019.7074.90
M1,85144.3011.3022.7021.6067.0058.354.2120.1017.3478.44
nad1F90345.9611.5219.2723.2665.2355.814.9818.9420.2774.75
M90347.0710.6318.8323.4865.8957.813.3217.6121.2675.42
nad2F1,05644.417.3923.2025.0067.6146.023.9830.6819.3276.70
M1,05644.896.6321.6926.8066.5748.012.2723.5826.1471.59
nad4LF31550.168.2519.3722.2269.5252.383.8117.1426.6769.52
M29451.709.5218.7120.0770.4153.067.1416.3323.4769.39
cox2F1,38638.899.5225.6925.9064.5756.285.6318.8319.2675.11
M1,67437.489.1528.2125.1665.6953.234.8422.4019.5375.63
cytbF1,26643.6813.0321.8821.4165.5654.277.1121.3317.3075.59
M1,26944.5213.0020.7221.7565.2554.856.6220.8017.7375.65
atp8F15345.1012.4222.2220.2667.3243.143.9227.4525.4970.59
M13550.379.6319.2620.7469.6351.110.0017.7831.1168.89
nad4F1,33545.3910.4919.7824.3465.1751.245.6221.8021.3573.03
M1,34144.7410.0722.2222.9766.9649.665.3725.2819.6974.94
atp6F85243.7810.9223.1222.1866.9045.076.3425.0023.5970.07
M85244.1310.2123.4722.1867.6146.833.8729.9319.3776.76
nad3F43543.458.9725.7521.8469.2045.524.8329.6620.0075.17
M43542.997.1325.5224.3768.5144.142.7624.8328.2868.97
nad5F1,72546.0910.9617.7425.2263.8349.398.7016.0025.9165.39
M1,73747.678.5219.8024.0167.4754.063.6320.0322.2874.09
nad6F52247.5110.3418.2023.9565.7156.326.3216.6720.6972.99
M53148.968.1018.2724.6767.2358.764.5212.9923.7371.75
cox3F90944.6611.8819.4723.9864.1462.054.6215.5117.8277.56
M92446.109.6319.5924.6865.6962.341.9515.9119.8178.25
rrnSF1,19336.4610.7328.5024.3164.96
M1,18837.299.7628.9623.9966.25
rrnLF1,47239.279.7129.8221.2069.09
M1,47339.249.9130.6920.1669.93
All PCGsbF12,68144.2110.7921.3023.7065.5152.785.8220.5320.8773.31
M13,00244.679.7522.0523.5266.7353.884.1521.2020.7775.08
All rRNAsF2,66538.0110.1729.2322.5967.24
M2,66138.379.8529.9121.8768.28
All tRNAsF1,46437.7010.9329.7121.6567.42
M1,65339.219.7629.4521.5868.67
All coding DNAcF16,81042.6610.7023.2923.3565.95
M17,31643.189.7723.9723.0867.15
All URsdF3,21640.147.3726.1826.3166.32
M2,37441.296.5926.2025.9167.50 
Complete genomeF20,02542.2610.1723.7523.8366.01
M19,68842.959.3924.2323.4367.19 

a Computed only on third codon positions.

b PCGs, Protein Coding Genes.

c PCGs + rRNAs + tRNAs

d URs, Untranslated Regions.

a Computed only on third codon positions. b PCGs, Protein Coding Genes. c PCGs + rRNAs + tRNAs d URs, Untranslated Regions. Both F-type and M-type mtDNAs contain a large numbers of Unassigned Regions (URs; 27 in F-mtDNA and 29 in M-mtDNA), which are detailed in S3 Table.

Protein Coding Genes (PCGs)

We found all 13 canonical protein coding genes, including the atp8 gene, reported as missing in several bivalve species [7, 62–63]. ATG start codon is used in cox2, cytb, atp6, nad3, and cox3 of the F genome and in nad2, cox2, atp8 and cox3 of the M one. Like most invertebrate mitochondrial genomes, the two M. lamarckii mtDNAs show alternative start codons: GTG, ATC, ATA, ATT and TTG (see Table 1). Observed stop codons are TAG and TAA, as expected. Overall, TAA is the most common stop codon, while TAG is used in F- cox1, nad4L, cox2, cytb, atp6, nad3, and nad5 and in M- nad4L, cytb, atp8, atp6, and nad3. Truncated TA-/T—stop codons ([8, 63]; and reference therein) were not found in M. lamarckii mtDNAs. M. lamarckii F and M protein coding genes (PCGs) contain 4,227 codons and 4,333 codons, respectively (Table 3). In both F and M mtDNAs, the most used codons is UUU (410 and 434 hits, respectively). Less used codons are CGC and ACC (both 7 hits) in F-mtDNA and UGC (2 hits) in M-mtDNA. The most common aminoacid in both F- and M-mtDNA is leucine, while the rarest is glutamic acid.
Table 3

Meretrix lamarckii codon count (#) and usage (%).

aacodona#%aacodona#%aacodona#%aacodona#%
Phe(F)UUUbF4109.70Ser(S)UCUF1613.81Ala(A)GCUF1343.17Asp(D)GAUF932.20
M43410.02M1583.65M1222.82M1142.63
UUCF420.99UCCF90.21GCCF90.21GACF190.45
M170.39M110.25M70.16M150.35
Leu(L)UUAF2305.44UCAF160.38GCAF260.62Glu(E)GAAF461.09
M2445.63M210.48M260.60M671.55
UUGF1904.49UCGF150.35GCGF280.66GAGF781.85
M1934.45M180.42M210.48M701.62
CUUbF821.94AGUF882.08Tyr(Y)UAUF1523.60Cys(C)UGUF841.99
M912.10M1032.38M1713.95M992.28
CUCF90.21AGCF130.31UACF250.59UGCF100.24
M30.07M110.25M180.42M20.05
CUAF250.59AGAF661.56STOP(*)UAAF60.14Trp(W)UGAF581.37
M110.25M521.20M80.18M521.20
CUGF170.40AGGF511.21UAGF70.17UGGF661.56
M170.39M671.55M50.12M721.66
Ile (I)AUUF1984.68Pro(P)CCUF892.11His(H)CAUF511.21Arg(R)CGUF400.95
M2355.42M952.19M511.18M390.90
AUCF140.33CCCF110.26CACF200.47CGCF70.17
M90.21M80.18M160.37M40.09
Met(M)AUAF841.99CCAF160.38Gln(Q)CAAcF270.64CGAF230.54
M1142.63M180.42M250.58M150.35
AUGF1172.77CCGF80.19CAGF300.71CGGF130.31
M902.08M110.25M290.67M140.32
Val(V)GUUF2756.51Thr(T)ACUF872.06Asn(N)AAUF1022.41Gly(G)GGUF1854.38
M2666.14M902.08M1022.35M1653.81
GUCF140.33ACCF70.17AACF240.57GGCF130.31
M110.25M60.14M210.48M210.48
GUAF711.68ACAF180.43Lys(K)AAAF852.01GGAF711.68
M952.19M130.30M1002.31M571.32
GUGF1162.74ACGF200.47AAGF471.11GGGF791.87
M1202.77M160.37M531.22M1042.40

a Codons that match a corresponding mtDNA-encoded tRNA are underlined.

b The corresponding tRNA is present only in the M genome.

c Two trnQ(UUG) tRNAs were detected in the M genome.

a Codons that match a corresponding mtDNA-encoded tRNA are underlined. b The corresponding tRNA is present only in the M genome. c Two trnQ(UUG) tRNAs were detected in the M genome. Post-transcriptional cleavage sites could be indicated by the presence of a tRNA between two PCGs [64]. In absence of a tRNA, the cleavage role can be played by intergenic non-coding sequences that form a stem-loop secondary structure ([8]; and reference therein). According to the previous statement, for each M. lamarckii unassigned region located between a pair of PCGs, the predicted hairpin was determined and reported in S1 Fig. Finally, M-cox2 gene is significantly different from the F one. More specifically, it includes a 100-aminoacid long region in the middle of the gene, which is not present in F-cox2 (Fig 2).
Fig 2

Meretrix lamarckii cox2 gene alignment.

Aminoacid alignment between female-type (F) and male-type (M) Meretrix lamarckii sequences of the cox2 gene. Identical aminoacids are shaded following their hydropathy (see the legend below the figure for the meanings of the different colors); purple bars show aminoacid similarity. The 100-aminoacid insertion found in M-mtDNA is boxed in red. Sites are numbered above the sequences, conventionally starting at 1.

Meretrix lamarckii cox2 gene alignment.

Aminoacid alignment between female-type (F) and male-type (M) Meretrix lamarckii sequences of the cox2 gene. Identical aminoacids are shaded following their hydropathy (see the legend below the figure for the meanings of the different colors); purple bars show aminoacid similarity. The 100-aminoacid insertion found in M-mtDNA is boxed in red. Sites are numbered above the sequences, conventionally starting at 1.

rRNAs and tRNAs

Standard rRNAs were found in both genomes: rrnS is located between trnT and trnC, while rrnL between cytb and atp8. The F and the M rRNAs predicted secondary structures are reported in S2 Fig. F-mtDNA shows all 22 canonical tRNAs, with two serine-encoding tRNAs and two leucine-encoding tRNAs. They may differ from each other in terms of anticodon. Like many other metazoan taxa (see, f.i., [65, 8, 63]), both F- and M-trnS(UCU) present a shortened DHU arm. In addition, the M genome presents three sex-specific tRNAs, totaling 25 tRNAs: supernumerary trnL(AAG) (between nad1 and nad2), trnQ(UUG) (between trnD and cox2), and trnF(AAA) (within M Long Unassigned Region). All secondary structures of tRNAs are reported in S3 and S4 Figs. To better understand their origin, M supernumerary tRNAs were compared with the URs mapped in the same position of F-mtDNA. In all cases, a very similar sequence was found, albeit the canonical cloverleaf structure is essentially unrecoverable (S5 Fig).

Long Unassigned Region (LUR)

A Long Unassigned Region (LUR) is located between trnG and trnQ. F-LUR measures 1,855 bp, whereas M-LUR is apparently divided in two regions (LUR1 and LUR2 of 694 bp and 690 bp, respectively) by the putative supernumerary trnF(AAA) (see above). A complex secondary structure was found in both M and F mtDNAs in the middle LUR sequence. This highly folded structure is comprised between bases 15,698 and 15,952 in F-LUR and between bases 15,838 and 16,512 in M-LUR. In the F-LUR two tandem-repeated motifs were also found, both with two tandem copies. The first motif is 15 bp long (positions 15,722–15,736 and 15,737–15,751) and the second one is 109 bp long (positions 16,767–16,875 and 16,880–16,988, at the end of F-LUR region) (S6 Fig). In M-mtDNA only the 15 bp-long motif was found (positions 15,845–15,858 and 15,860–15,873). A BLAST search of the Termination-Associated Sequence (TAS; [66-67]) element found a significant hit (S7 Fig) in the F-LUR (positions 16,340–16,354), but not in the M-LUR. The first PCG downstream of the LUR is cox3; therefore, we set cox3 as the starting point of the sliding window computing nucleotide composition at four-fold degenerate sites. We found 1,911 degenerate sites in the F M. lamarckii genome (15.01% of PCG sites) and 1,907 in the M one (14.67%). In both cases, four-fold degenerate sites are highly T-rich (59.65% for F and 59.20% for M) and definitely weak, but significant trends were uncovered (Fig 3A): while a significant A trend was never found, we detected a significant increase in C in both sexes, even if with very low R2 values. A negative trend for G was found in F-mtDNA, while a positive one for G and T was found in M-mtDNA, again with very low R2 values. The autocorrelograms show a significant value of the autocorrelation function (acf) only at lag-1 for F genomes, or at lag-1 and lag-2 (or lag-5) for three M nucleotides (S8 Fig). Given the T-richness of four-fold degenerate sites, the A-T skew is always negative or equal to 0, but two peaks were found, corresponding to the LUR and to the atp6/nad3 boundary (Fig 3B).
Fig 3

Origins of Replication.

A, nucleotide composition at four-fold degenerate sites, using a sliding window of 700 bp with a step of 300 bp. The starting point is the first PCG after the LUR, i.e. cox3. Equations are as follows, for F/M, respectively. A (green): y = 0.0004x+14.84; R = 0.0681; p = 0.0616 / y = 0.0002x+14.62; R = 0.0409; p = 0.1546. C (blue): y = 0.0003x+2.15; R = 0.2263; p = 0.0004*** / y = 0.0002x+3.17; R = 0.1087; p = 0.0181*. G (black): y = −0.0004x+20.39; R = 0.0796; p = 0.0428* / y = 0.0004x+17.33; R = 0.1448; p = 0.0059**. T (red): y = −0.0004x+62.61; R = 0.0595; p = 0.0815 / y = −0.0008x+64.88; R = 0.2123; p = 0.0007***. B, A-T skew at four-fold degenerate sites, using a sliding window as for (A); the starting point is again cox3.

Origins of Replication.

A, nucleotide composition at four-fold degenerate sites, using a sliding window of 700 bp with a step of 300 bp. The starting point is the first PCG after the LUR, i.e. cox3. Equations are as follows, for F/M, respectively. A (green): y = 0.0004x+14.84; R = 0.0681; p = 0.0616 / y = 0.0002x+14.62; R = 0.0409; p = 0.1546. C (blue): y = 0.0003x+2.15; R = 0.2263; p = 0.0004*** / y = 0.0002x+3.17; R = 0.1087; p = 0.0181*. G (black): y = −0.0004x+20.39; R = 0.0796; p = 0.0428* / y = 0.0004x+17.33; R = 0.1448; p = 0.0059**. T (red): y = −0.0004x+62.61; R = 0.0595; p = 0.0815 / y = −0.0008x+64.88; R = 0.2123; p = 0.0007***. B, A-T skew at four-fold degenerate sites, using a sliding window as for (A); the starting point is again cox3. The structure of the F-LUR is comparable to the LUR of the published genome of M. lamarckii (GenBank Accession Number NC_016174), with some differences. The LUR of the available M. lamarckii mtDNA is found at positions 14,982–18,044; again, a highly folded region can be inferred (15,470–16,433). At the 5' side of the highly folded region there is a sequence very similar to that of the F-ORF (15,242–15,392); this sequence would be a putative ORF located on the reverse strand, were it not for a stop codon (TAA) right after the start one and for an insertion of a G, which triggers a frameshift mutation leading to the loss of the stop codon (S9 Fig). Conversely, at the 3' side of the highly folded region a 100-bp repeated motif was found; the repeat unit shows some similarities with the 109-bp motif of our F genome (S10 Fig). However, in the GenBank M. lamarckii mtDNA it is repeated 13 times; this higher number of repeats accounts for the great difference in length between the two LURs (1,855 against 3,063 bp).

Supernumerary Open Reading Frames (ORFs)

Several additional putative Open Reading Frames (ORFs) were found within the LUR of both F and M M. lamarckii mtDNAs. Among all these sequences, we found only one ORF in each genome (F_ORF141 and M_ORF138) that does not overlap with the highly folded structure revealed in the LUR (see above). To better understand whether these putative ORFs are expressed or not, the prediction software Glimmer3 was used. At first, the software was trained with M. lamarckii standard gene data. All mitochondrial PCGs were given a score comprised between 8.71 and 16.34 (for F) and between 10.47 and 18.08 (for M). According to these values, the two potential supernumerary ORFs should not be considered as expressed, because they showed extremely low scores (i.e., 2.34 for F_ORF141 and 3.36 for M_ORF138; see S4 Table). The presence of F_ORF141 was also searched for in all available bivalve complete mitochondrial genomes using HHBlits. In all the other available Meretrix species, a homolog was found in the reverse strand, within the LUR. All homologous ORFs have a probability over 90%, while E-value and p-value were always lower than 0.05; this holds also for the F_ORF141/M_ORF138 comparison (S5 Table).

Phylogenetic analysis

The complete dataset was composed by 5,035 aminoacids (PCGs) and 4,554 nucleotides (rRNAs). 3,841 aminoacids and 1,232 nucleotides were left for phylogenetic analysis (69.14% and 27.05%, respectively) after masking with BMGE; the most affected PCG was cox2 (only 29.82% aminoacids were selected), while the least affected was nad4 (85.57%); rrnL and rrnS were similarly affected by the masking phase (28.20% and 25.57%, respectively). PartitionFinderProtein suggested the partition of PCGs in two clusters, namely ATP synthase/NADH dehydrogenase subunits (atp6, atp8, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6) and cytochrome c oxidase subunits/cytochrome b (cox1, cox2, cox3, and cytb); conversely, PartitionFinder suggested to keep together the two ribosomal genes (rrnL and rrnS). Best-fitting molecular evolution models were JTT [68], LG [69], and GTR [70], respectively. The consensus tree computed over 1,000 bootstrap replicates is highly supported (Fig 4), being the bootstrap proportion equal to 100% for all nodes, with one exception (the Paphia clade). Veneridae were recovered as monophyletic, being the mactrid Coelomactra antiquata the sister taxon. Tapetinae and Meretricinae are also monophyletic; within Tapetinae, the deepest split separates the DUI species R. philippinarum from R. decussatus + Paphia; within Meretricinae, only the genus Meretrix is represented in our tree and the available M. lamarckii mtDNA clusters with our F genome.
Fig 4

Phylogenetic analysis.

Maximum Likelihood phylogenetic analysis of the family Veneridae using complete mitochondrial genomes and Acanthocardia tuberculata (Cardiidae), Hiatella arctica (Hiatellidae), and Coelomactra antiquata (Mactridae) as outgroups. Shown is the consensus of ML trees obtained from 1,000 bootstrap replicates; number at the nodes are bootstrap proportions. Purple bars mark known DUI species.

Phylogenetic analysis.

Maximum Likelihood phylogenetic analysis of the family Veneridae using complete mitochondrial genomes and Acanthocardia tuberculata (Cardiidae), Hiatella arctica (Hiatellidae), and Coelomactra antiquata (Mactridae) as outgroups. Shown is the consensus of ML trees obtained from 1,000 bootstrap replicates; number at the nodes are bootstrap proportions. Purple bars mark known DUI species.

Genetic variability

Nucleotide Jin-Nei distances and aminoacid Kimura distances were calculated between M. lamarckii F-mtDNA and M-mtDNA for each PCG, tRNA, rRNA, UR and for concatenations of these, up to the whole genome. Jin-Nei and Kimura distance values are reported in Table 4 and S6 Table (for single tRNAs).
Table 4

Bivalves nucleotide and aminoacid (boldface) distances.

ReferenceAll coding DNAaPCGsbrRNAstRNAsatp6atp8ccox1cox2cox3cytbnad1nad2nad3nad4nad4Lnad5nad6rrnLrrnS
Anodonta anatinaFNC_02280392.2490.0298.76112.28114.22109.7080.8688.6278.9785.9790.20100.7890.0096.4848.0388.90119.6498.9998.70
MKF03096381.4687.98186.7142.3482.9254.9854.3566.80133.9898.29150.7692.06101.01194.11
Hyriopsis cumingiiFNC_01176310.0911.595.804.8510.704.327.0414.789.9612.8013.8216.565.5910.235.4715.957.325.176.76
MHM3476681.980.434.200.202.711.171.582.393.181.721.032.963.570.00
Hyriopsis schlegeliiFNC_0151107.528.183.517.176.9411.778.5826.104.5710.037.326.795.465.793.266.267.324.531.93
MHQ6414071.341.297.810.393.630.391.052.040.940.001.031.121.402.51
Meretrix lamarckiiFKP24445149.6555.0337.8627.4161.6543.8245.8359.9545.0851.3841.2252.5651.5858.0840.7670.8162.3437.4738.34
MKP24445219.5713.559.7112.9543.1715.4217.814.8213.7518.909.9316.9033.4528.73
Musculista senhousiaFGU00195364.7371.5344.6345.5180.6654.7971.2674.2436.1472.3894.9876.2482.7363.2690.9873.8961.1524.44
MGU00195423.1927.215.5243.6617.406.0225.8439.2223.5421.0529.1834.4531.22
Mytilus californianusFGQ52717286.8290.9773.9476.0097.1361.2893.2978.2479.7175.7689.64106.51124.15127.89105.55135.2777.6469.09
MGQ52717339.8235.9611.8735.0521.5026.5335.3672.8946.9243.9449.8079.8579.85
Mytilus edulisFNC_00616162.2571.8539.7232.3673.3360.5264.7068.8762.8772.1079.5282.8080.5788.8275.6682.0443.2035.14
MAY82362316.6617.474.7011.609.2419.4014.8527.8612.1716.8516.9420.6056.50
Mytilus galloprovincialisFNC_00688664.6873.2245.1836.4774.6053.9576.9069.8067.6277.2692.8838.7382.1687.2181.5183.5645.7644.43
MAY36368715.2014.864.5010.6611.9115.2816.0725.514.4422.4416.4222.2625.29
Mytilus trossulusFHM46208065.3575.6336.4436.8267.1299.8355.7070.5975.0666.1571.6978.4489.0185.4897.4391.6190.8137.6434.79
MHM46208119.1213.8467.995.8612.1112.6113.6114.2427.3119.6522.4417.5846.4823.50
Pyganodon grandisFNC_013661102.42101.92102.53117.28119.5632.1975.6691.7090.26108.94118.28131.56149.8199.0997.67101.23106.48100.56106.33
MFJ80975585.1888.68111.4743.2478.6856.8957.66100.86132.61118.81138.8997.42116.94143.55
Quadrula quadrulaFNC_013658113.61115.31100.34113.1883.49377.3676.6783.0384.97108.5397.81112.5887.4580.06120.2292.09125.57109.2088.44
MFJ80975184.2388.77n/ad40.2362.4761.3852.4493.92141.4279.85223.27109.69103.39175.18
Ruditapes philippinarumFNC_00335475.3381.2859.1064.0786.4963.97101.9677.7374.3765.1499.0080.8844.7350.7884.4685.8362.2054.80
MAB06537447.0446.3612.4687.1647.5039.6125.5966.0138.69259.2844.5846.1399.67
Solenaia carinatusFNC_02325096.2396.2298.70107.45104.10100.1694.1984.2178.87115.07101.08120.46101.9088.66122.6891.8198.03100.4396.16
MKC84865579.1686.13140.8840.4575.1755.7758.7773.80135.1089.38156.1581.05101.35141.96
Utterbackia peninsularisFHM856636101.63100.76102.52122.15106.38100.6986.1599.7390.5991.34105.68121.58102.8599.22149.8695.98144.58103.13102.13
MNC_01547786.4095.57186.7141.7779.8558.3360.8878.05137.17112.12193.71101.15108.45175.68
Venustaconcha ellipsiformisFFJ80975397.7196.96103.10109.1498.2385.2473.6996.0187.2597.83143.08106.9271.4599.65142.0491.67130.07101.71105.26
MNC_01365983.3481.05137.4838.5068.9850.6652.8396.40137.1090.95150.13103.84113.59231.95
Unionoideae (N = 6)100.64100.20100.99113.58104.33134.2281.2090.5585.15101.28109.36115.65100.5893.86113.4293.61120.73102.3499.50
83.3088.03152.6541.0974.6856.3456.1684.97136.2398.23168.8297.54107.46177.07
Amarsipobranchiaf (N = 7)66.9774.2248.1245.5277.2871.8356.5876.9569.8662.6167.9483.8675.1179.7079.4585.8087.6852.1543.00
25.8024.1838.858.2734.7719.3719.7519.5438.9423.4756.5627.3440.4649.25
Overallg (N = 15)72.6876.0363.4867.4878.9796.5159.9374.8667.6371.2576.8586.9576.0275.8183.0378.9690.1865.9260.45
45.5846.6194.7720.3346.5231.6831.8543.4072.9450.3694.0652.0562.1993.98

a Complete genome excluding untranslated regions.

b Protein Coding Genes.

c atp8 gene is not annotated in some species. See text for details.

d The sequences are too divergent to compute the Kimura distance.

e Average values of Unionoidea. Hyriopsis cumingii and H. schlegelii were excluded because we are most probably comparing two female genomes; see text for details.

f Average values of Amarsipobranchia.

g Average values of all species.

a Complete genome excluding untranslated regions. b Protein Coding Genes. c atp8 gene is not annotated in some species. See text for details. d The sequences are too divergent to compute the Kimura distance. e Average values of Unionoidea. Hyriopsis cumingii and H. schlegelii were excluded because we are most probably comparing two female genomes; see text for details. f Average values of Amarsipobranchia. g Average values of all species. Between the F- and the M-mtDNA the nucleotide Jin-Nei distance of the complete genome (coding + non-coding) is 53.13, corresponding to a 16.19% divergence. Jin-Nei nucleotide distances are 81.53 (25.81%) for non-coding regions and 49.65 (14.89%) for coding genes. PCG concatenation has an aminoacid Kimura score of 19.57 and, within that, the highest values belong to cox2 (43.17) and nad5 (33.45), while lowest values are associated with nad1 (4.82) and atp8 (9.71). The average Jin-Nei distance between rRNAs is 37.86; the average Jin-Nei distance between tRNAs is 27.41. We also compared the two M. lamarckii mitochondrial genomes obtained in this paper (F and M) to the already sequenced M. lamarckii mtDNA present in literature (GenBank Accession Number NC_016174). The uncorrected distance (p-distance) between this genome and our F genome is 9.81% for all the coding DNA, being 10.67% for all PCGs, 8.17% for all rRNAs and 5.33% for all tRNAs. On the other side, the divergence between this genome and our M genome scored is 14.65% for all coding DNA, scoring 15.76% for all PCGs, 11.43% for all rRNAs and 10.72% for all tRNAs. In both cases, the most divergent gene is cox2 (15.09% and 21.72%, respectively), whereas the less divergent is atp8 (4.17% and 9.17%, respectively). The aminoacid Kimura distance was also computed to account for synonymous substitutions (S7 Table): again, the NC_016174 sequence is always more similar to the F-mtDNA than to the M-mtDNA and the most divergent gene is cox2 (19.28 and 39.86, respectively), while the less divergent genes are atp8 (0.00 and 8.13) and nad1 (0.67 and 4.12). Generally speaking, with the exception of cox1 and cox2, the Kimura distance from M-mtDNA is always one order of magnitude higher than from F-mtDNA. The F vs M distances were also computed for all known DUI species whose complete mitochondrial genomes have been published (see Table 4). M. lamarckii has lower divergence values, when compared to other DUI families such as Unionidae or Mytilidae. The Unionidae show, by far, the highest values, with divergence scores of coding DNA ranging from 92.24 of Anodonta anatina to 113.61 of Quadrula quadrula; Hyriopsis species show abnormally low distance values, not even comparable with average Unionidae family values. Unionidae are followed by Mytilidae with an all-coding Jin-Nei distance comprised between 62.25 of Mytilus edulis and 86.82 of M. californianus. Within the Veneridae, Venerupis philippinarum has a higher divergence value (75.33) with respect to M. lamarckii (49.65). The most divergent PCGs are atp8, nad4, and nad6 with average Kimura distances of 94.77, 94.06, and 93.98, respectively; the most conserved is cox1 (20.33). The resulting PCA plot (Fig 5) uses the first two components to explain the 73.20% + 6.19% = 79.39% of distance variability (using Jin-Nei and Kimura distances together). Datasets are roughly arranged by overall divergence levels along the first principal component (Hyriopsis spp. < Mytilidae+Veneridae < Unionidae); the second principal component further separates venerids and mytilids from unionids. Finally, it is impossible to reject the null hypothesis that nucleotide distance rankings among single PCGs in Unionidae and Mytilidae + Veneridae are unrelated, using both the Spearman ρ (p = 0.2392) and the Kendall's τ (p = 0.2044). For example, cytb and nad1 are highly divergent for Unionidae, but they are among the least variable for Amarsipobranchia, while the opposite is true for nad4 and nad5.
Fig 5

PCA plot.

Principal Component Analysis (PCA) based on both Jin-Nei and Kimura distances reported in Table 4. Colors refer to different families: blue, Unionidae with the exception of Hyriopsis spp. (brown; see text for details); Indian red, Mytilidae; green, Veneridae. AnAn, Anodonta anatina; HyCu, Hyriopsis cumingii; HySc, Hyriopsis schlegelii; MeLa, Meretrix lamarckii; MuSe, Musculista senhousia; MyCa, Mytilus californianus; MyEd, Mytilus edulis; MyGa, Mytilus galloprovincialis; MyTr, Mytilus trossulus; PyGr, Pyganodon grandis; QuQu; Quadrula quadrula; RuPh, Ruditapes philippinarum; SoCa, Solenaia carinatus; UtPe; Utterbackia peninsularis; VeEl, Venustaconcha ellipsiformis.

PCA plot.

Principal Component Analysis (PCA) based on both Jin-Nei and Kimura distances reported in Table 4. Colors refer to different families: blue, Unionidae with the exception of Hyriopsis spp. (brown; see text for details); Indian red, Mytilidae; green, Veneridae. AnAn, Anodonta anatina; HyCu, Hyriopsis cumingii; HySc, Hyriopsis schlegelii; MeLa, Meretrix lamarckii; MuSe, Musculista senhousia; MyCa, Mytilus californianus; MyEd, Mytilus edulis; MyGa, Mytilus galloprovincialis; MyTr, Mytilus trossulus; PyGr, Pyganodon grandis; QuQu; Quadrula quadrula; RuPh, Ruditapes philippinarum; SoCa, Solenaia carinatus; UtPe; Utterbackia peninsularis; VeEl, Venustaconcha ellipsiformis.

Discussion

Comparison with the previously published Meretrix lamarckii mitogenome

The two mitochondrial genomes of Meretrix lamarckii (F and M) sequenced here are slightly shorter than the one previously reported in GenBank: 20,025 (F) and 19,688 (M) bp against 21,209 bp [18]. This genome was extracted from a somatic tissue (the foot muscle; [18]) and, indeed, it was previously attributed to the female type by Plazzi and colleagues [10]. The phylogenetic analysis of the present work further corroborates this hypothesis (Fig 4). It shares the same gene content and gene order, with the exception of trnL(AAG), trnQ(UUG) and trnF(AAA), which have been found only in M-mtDNA. This, again, strengthens the idea that the previously published genome is from the female lineage. However, comparison between either sex and the published F genome showed surprisingly high divergence values. These were generally one order of magnitude higher when comparing it with our M genome, and, as in the case of our F/M comparison, divergence ranking is similar: f.i., highest values are obtained from cox2, while lowest scores are obtained from atp8. Significant divergence values are still observed at the aminoacid level for both sexes; the distances between our M-mtDNA and the published F genome are comparable to those computed between the two sexes in the present study (Table 4 and S6 Table). It is possible to find similarities in the LUR structure between the two F genomes, a highly folded region being the divide between a supernumerary ORF and a region with tandem repeats of about 100 bp in length. However, in the published F genome it was not possible to find a functional ORF (S9 Fig). This may be due to sequencing errors; if the available sequence is confirmed, it is hard to say whether an ORF was originally present in the species and was subsequently pseudogenized in some populations or a novel ORF appeared in some others. Given the widespread presence of supernumerary mitochondrial ORFs in bivalves [71, 62, 72, 63, 5, 73–74], we largely favor the first hypothesis. On the other side, it is possible to align the 3' repeated motifs (S10 Fig). The great variation in length between the two LURs is due to the different number of repeats: 2 in our F genome, 13 in the published one. This difference, in turn, accounts for the aforementioned difference in length between the two genomes. Interestingly, intra-specific variability in the number of repeats in a mitochondrial LUR has been reported elsewhere [75-76] for (DUI) bivalves. The two M. lamarckii specimens sampled for this research come from the Tokyo area (Japan), whereas the F genome available in GenBank comes from Zhejiang (China) [18]. Recall that the Chinese specimen was only tentatively identified as M. lamarckii due to its similar morphology, despite showing some differences in color, shell shape and thickness [18], we cannot completely rule out the hypothesis that these specimens belong to different species; however, it is not unconceivable that the differences found here simply reflect the high degree of taxonomic distinctness between Japanese and Chinese clams belonging to very distant populations.

Nucleotide composition and codon usage

A-T content in M. lamarckii F and M genomes is slightly higher with respect to the average A-T content in bivalves, but it is comparable with those of other DUI organisms such as V. philippinarum and M. senhousia [63]. The coding strand (+) is G-T rich: this is expected [77-78] and in good agreement with [63], where it was stated that a higher G-T percentage is related with mtDNAs characterized by most (if not all) genes located on the same strand. Codons usage reflects the general nucleotide composition of the two genomes, with a high presence of T in most used codons. In almost all cases, except for trnL(TAA), trnK(TTT), and trnM(CAT) in F-mtDNA and for trnL(TAA), trnF(AAA) and trnK(TTT) in M-mtDNA, within four-fold or two-fold degenerate codon families the most used codons do not have a complementary anticodon in mitochondrially-encoded tRNAs (Table 3). Moreover, they differ for only one base (the third one) with respect to the synonymous codon for which a complementary tRNA exists in the mitochondrion. The codon usage table demonstrates the presence of high degrees of third-base wobbling in M. lamarckii, as previously seen in other bivalves [8, 63]: a tRNA can have a non-standard base at the first anticodon position pairing with more than one base and allowing to bind codons that are not perfectly complementary.

PCGs and the cox2 insertion

With the exception of the supernumerary ORF, all genes are located on the same strand in both F and M mtDNAs of M. lamarckii. This is commonly found in all Amarsipobranchia, while unionids [71, 9, 63] and Solemya velum [63] encode genes on either strand. This finding reinforces the hypothesis that the one of M. lamarckii is a derived state, which evolved once in the common ancestor of Pteriomorphia and Heterodonta ([9]; and reference therein). The atp8 gene was declared as missing in several bivalve species [8], especially in the genus Mytilus [79], even if it was recently found in some bivalves like Solemya velum [63], Musculista senhousia [8], Venerupis philippinarum [80], and presently in M. lamarckii. In addition, recent studies [9, 81] found this gene in species in which it was not previously annotated. The use of Jin-Nei corrected distance to evaluate nucleotide divergence unveiled that atp8 is not less conserved than other mitochondrial genes (see Table 4). As suggested elsewhere [5, 82], there is a strong possibility that the absence off this gene is simply due to past annotation difficulties or inaccuracy. Up to now, the presence/absence of ATPase subunit 8 does not appear linked with DUI [9], but rather, if confirmed, to phylogeny [63]. It was suggested to be the commonest situation in metazoans that the two ATPase subunits, atp8 and atp6, are adjacent and overlapping [79]. This especially holds if the co-translation of these genes from a bicistronic transcript (as is the case in mammals; [83]) is confirmed as a widespread rule. In fact, this association is present in the Unionidae [71] and it was recently found in S. velum [63]; however, a disjointed location of atp8 and atp6 has already been highlighted for some heterodont bivalves, like Hiatella arctica [80] and Macoma balthica [82]. Similarly, in M. lamarckii atp8 and atp6 are not neighboring in either F- and M-mtDNA: again, the contiguity of these genes may be an example of an ancestral state that was subsequently lost in derived bivalves. M M. lamarckii genome presents an insertion of 100 codons in cox2 gene, which is totally absent in the F counterpart (Fig 2). It is not the first time that a M-cox2 gene is longer than F-cox2; generally, however, these extensions map to the 3’ end of the gene. In fact, the M-cox2 3’ tail is present in all three subfamilies of Unionidae [5]. This extension (Mcox2e) has been found only in M mtDNAs and varies in length between 177 and 192 bp [84-85]. Mcox2e has been found in poly-adenylated transcripts of cox2 obtained from male gonads, and also proved to be translated and localized in both inner and outer mitochondrial membranes [84-86]. The structural analysis of unionid Mcox2e sequences reveals the presence of the two canonical N-terminal trans-membrane helices (TMHs). In addition to that, several additional TMHs were found in Mcox2e [87]. For the above mentioned reasons, a proposed hypothesis was that such extension may be a mitochondrial tag implicated in male mitochondria survival to elimination and differential segregation during development [87]. Outside from the unionid family, the pattern of cox2 variations among DUI M and F lineages is unclear and not easy to unravel. In mytilids, no extensions were found in M genomes of Mytilus [5], but a duplicated cox2 gene (cox2b) is found in M M. senhousia [8], with the duplicated gene being longer than the original one at 3’. A putative TMH of 41 residues was found in the cox2b tail [8], allowing the authors to hypothesize a correlation between the unionid Mcox2e and the cox2b tail of M. senhousia. V. philippinarum is with M. lamarckii the only known DUI species of the family Veneridae: a duplication of the cox2 gene, similar to that of M. senhousia (i.e. longer at 3’), was found, but, contrastingly, it is located in the F genome [8]. However, additional TMHs (either in insertions or tails) are not detectable in V. philippinarum cox2, nor in M. lamarckii (S11 Fig). Moreover, @TOME analysis did not find any intron, and the coding frame is apparently kept (see Fig 2). Concluding, it was impossible to properly assign a function in silico to this region, and further analyses are therefore mandatory in this regard. Non-canonical features in cox2 gene are often coupled with DUI, but a general rule is still not evident and each DUI system seems to follow its own evolutionary pathway. However, despite the relationship between cox2 variations and DUI phenomenon has not been demonstrated yet, the finding of a new M-cox2 gene insertion (albeit differently located in the gene) in another DUI bivalve is an interesting clue.

Supernumerary tRNAs in M-mtDNA

As mentioned above, M and F genomes basically share the same gene arrangement, the only difference being three tRNAs in M-mtDNA. As a consequence of the high variability of their mitochondrial genomes, additional tRNA copies are common in bivalves [88-90]. In fact, when aligning the M additional tRNAs with the region mapped in the same position in the F-mtDNA, high levels of sequence similarity were always detected (see S5 Fig). Therefore, we may hypothesize that the duplication of trnL, trnQ, and trnF took place before the separation of the two sex-linked lineages, and that, afterwards, the F copies became pseudogenes or remain functional tRNAs that the in silico methods are not able to retrieve. Anyway, it has also to be noted that the anticodon region of the F counterpart of M-trnQ(TTG) would be complementary to the stop codon TAA. The presence of a tRNA in the middle of M-LUR (trnF(AAA)) is intriguing and deserves further investigation: possibly, the cloverleaf structure of a tRNA was co-opted as part of the signaling structure of the putative control region (see below) and, thus, would not correspond to a functional tRNA. However, it is noteworthy that the anticodon of the middle-LUR tRNA is AAA, which is complementary to TTT, the most used codon in both genomes (see Table 3). The presence of a functional tRNA in the middle of a control region, where it may work also as a signaling sequence, would make of the trnF(AAA) gene of M-mtDNA of M. lamarckii a good example of an evolutionary spandrel [91] and/or a case of molecular exaptation: this region, being a tRNA, necessarily had a complex secondary structure, and this became useful in the wider context of the control region as well (or vice versa, even if the presence of a degenerated tRNA in the F-mtDNA makes us to prefer the first hypothesis). The presence of a tRNA-like structure was already signaled by [67] in the Mytilus spp. LUR, but in the case of M. lamarckii it seems that the tRNA maintained its functionality. Other expected non-canonical tRNA structures are found in our genomes: f.i., in both F- and M-mtDNA two trnS were found and the DHU arm was not recovered in trnS(UCU). However, as mentioned by [8], this unusual tRNA has been found in several other animal groups and it evolved early in Metazoans group [92]. In vitro analysis further confirmed its functionality [93].

Control Region (CR) and the Origin of Replication (OR)

Several parameters have been proposed to identify the mtDNA control region (CR). The most used are the presence of repetitive elements, palindromes, length, high A-T content and secondary structures with T-rich loops [67, 94, 71]. M. lamarckii Long Unassigned Region (LUR) is the longest UR in both F and M mtDNAs, although the M one is apparently split into two parts by a phenylalanine supernumerary tRNA (as mentioned above). A-T content is roughly the same found in the entire genomes, 64.2% for F and 65.8% for M, even if several poly-T have been found during (and heavily hampered) sequencing. The short (15-bp long) repeat is essentially a stretch of G and A and may simply reflect the general G-T-/A-T-richness of both genomes in a region where less selective constraints are working; however, this repeat is conserved in both F and M genomes and it is known that similar G-rich sequences are present in Mytilus and human control regions, being related with replication and/or transcription [67]. The 109-bp long repeats are located near to the 3' end of the F-LUR sequence, and, due to their proximity with the putative origin of replication (see below), they may play a functional role in F mtDNA duplication (but recall that they are not detectable in M). Both M- and F-LUR present a central region which appears to be heavily folded (S6 Fig): again, this secondary structure may play some role for the replication/transcription process to begin [67, 71]. The nucleotide composition at fourfold degenerate sites is related with single-strand state duration during mtDNA replication. As detailed in ([95–98, 71]; and reference therein), the more the heavy (H) strand remains unpaired, the more the spontaneous hydrolytic deamination of C to U and A to hX (hypoxanthine) takes place. Such an increase of T and hX in the H strand leads to a corresponding increase in the percentages of A and C in the complementary lagging (L) strand where the H strand remains for longer time in the single-stranded condition, i.e. near to the OR. Moreover, single-stranded-guanine may spontaneously oxidize to 8-hydroxyguanine, which basepairs with adenine: thus, in this case, G decreases and T increases on the H strand. In a nutshell, T will only tend to accumulate near to the origin of replication of the H strand, while the opposite is true for A and C; finally, G may behave in either way [95, 98]. This asymmetrical composition can leave a neutral signature in fourfold degenerate sites, being them under no or weak selection. The 700-bp sliding window analysis on these sites is in agreement with this model (Fig 3): with the exception of A (and of T in F-mtDNA), all correlations are significant, even if R2 values are very low (<25%). Setting cox3 as the starting point of the pattern, T in M-mtDNA tends to decrease, C tends to increase, and G decreases in the F-mtDNA and increases in the M one, which would be expected if the OR is located upstream to cox3. This also point to the conclusion that the "+" strand is in fact the H strand (as predictable, being all genes located on it). The A-T skew at four-fold degenerate sites is known to be correlated with the position of the ORs as well: extreme (i.e., closer to ±1) values are associated with PCGs located near to the OR of the H strand, while balanced (i.e., closer to 0) values are associated with PCGs located near to the OR of the L strand [95, 97, 99]. Given the overall high T-richness of these genomes, the A-T skew at four-fold degenerate sites is always negative: however, lowest values (i.e., closer to −1) are associated with the LUR and with the cox3 gene (Fig 3B), while highest values (i.e., closer to 0), are associated with the nad2/nad4L and atp6/nad3 regions. Therefore, we have further evidence that the LUR contains the OR of the H strand; moreover, it is tempting to conclude that either the nad2/nad4L or the atp6/nad3 region is the OR of the L strand. Both regions are neighbored by a two- or three-tRNA cassette, and it has been shown that an array of tRNAs on a strand may act as OR in the opposite one through alternative secondary structure [100]. If the OR of the L strand were located in the atp6/nad3 region, that shows A-T skews closer to 0 (Fig 3B) and is near to three tRNAs in either sex (Fig 1), this would leave the OR of the L strand quite distant from the OR of the H strand, a situation very similar (if not more extreme) to that of Mytilus [98] and unionids [71]. However, it is possible that more complex patterns are shadowed by the presence of all genes on the same strand and by the high T-richness of both genomes (recall that the A-T skew for the third codon position of PCGs is −0.44 for both genomes; Table 2). As a conclusion, we gathered seven pieces of evidence that the F-LUR and the M-LUR are the control regions of M. lamarckii mtDNAs; as detailed above, most of these features are shared with other DUI species, namely Mytilus spp. [67, 98] and Unionidae [71]. First of all, we have (i) a complex secondary structure, that, if the supernumerary ORFs are expressed (see below), would involve the complete LUR. Within that, we found, approximately from 5' to 3': (ii) the presence of G-rich elements; (iii) the presence of a tRNA (only in M); (iv) a sequence with some homology to the human TAS element (only in F); (v) the 109-bp long repeats (only in F). Finally, downstream from the LUR, we detected (vi-vii) the two above-mentioned nucleotide composition trends. Being these features unique to this region, we propose the LUR to act as the CR and to contain the OR of the H strand.

Supernumerary ORFs

Many DUI species, like M. senhousia and Mytilus spp., present supernumerary ORFs with no known homologies with other proteins (i.e., ORFans; [101]), which are located in the LUR. M. lamarckii is no exception. For such ORFs a correlation with the DUI phenomenon has been suggested [71–72, 5], even if the opposite was also proposed, interpreting the RNA transcripts as degradation intermediates [102]. Supernumerary ORFs were also found in the basal species S. velum, leading to the hypothesis that they constitute a plesiomorphy among bivalves [63]. Although some of these ORFs have uncontrovertibly proved to be translated [71-73], it is uncertain whether the M. lamarckii ones are even transcribed. This issue can be assessed only by looking at expression data, but, currently, without an available transcriptome/proteome of M. lamarckii, we cannot confirm nor disprove the functionality of either ORF. However, a precise homology between F and M ORFans was detected, which would not be expected if these sequences did not share a common ancestor; furthermore, an ORFan with high homology to F_ORF141 has been found in all species belonging to genus Meretrix. Interestingly, this would make of this supernumerary ORF the only gene located on the reverse strand in all the Meretrix genomes (S5 Table).

Sex-linked mtDNA diversification and evolution in DUI bivalves

The two entire M. lamarckii genomes diverge by a 16% on average (see also [10]), hence the divergence between F and M mtDNAs is somewhat lower than other DUI species. On the other hand, the most diversified genomes belong to unionids (around 35%), followed by mytilids (around 25%). The other venerid, V. philippinarum, shows levels of divergence comparable to those of mytilids (26%). In this work, nucleotide Jin-Nei and aminoacid Kimura distance values of all DUI species (whose complete mitochondrial genomes are available in GenBank) were calculated between M- and F-type mitogenomes to estimate divergences and give an idea of the rate of independent evolution between the two sex-linked genomes. We strongly advocate the use of corrected distance methods, like the Jin-Nei and Kimura formulae, over the uncorrected p-distance, because of the high divergence between sex-linked mtDNAs in many DUI species and the overall high variability of the molluscan mitochondrial genome, where significant level of saturation and multiple-hits events are quite common (see, f.i., [61]). Both Hyriopsis cumingii and H. schlegelii Jin-Nei distance values are surprisingly low, in contrast with all other sequenced unionids (and, in general, DUI) mtDNAs. However, there is a chance that there was an error in assigning the paternal route of transmission to genomes retrieved from males. Actually, as reported in GenBank, H. cumingii M genome (GenBank Accession Number HM347668) was indeed extracted from mantle tissue, whereas the source of the F (GenBank Accession Number NC_011763) is not reported. Conversely, H. schlegelii F and M genomes (GenBank Accession Numbers NC_015110 and HQ641407, respectively) were extracted by gonad tissue–not from gametes–which is known to contain somatic cells carrying the F genome. Furthermore, the study of cox2 gene reveals that M mtDNAs of both species do not have additional putative TMHs typical of all other unionids M-cox2 (see above; S11 Fig). These evidences point to the fact that both genomes do belong to F-type and their minimal divergence is due to normal intraspecific variability. More interestingly, in the PCA of distance scores (Fig 5), DUI species clustering follows the taxonomic arrangement of bivalves. Two large assemblages are visible: unionid species, one side, and Amarsipobranchia (i.e. Veneridae + Mytilidae), the other. In fact, the divergence of the two mtDNAs is higher in unionids (Table 4). This may point to the conclusion that DUI is somehow different in these two lineages, leading to distinct patterns of sequence evolution. This is not a new observation, since differences in many respects of DUI were repeatedly evidenced between Unionidae and Mytilidae + Veneridae (see, e.g., [5, 14]; and reference therein). In particular, the main difference is that unionids have established M- and F-mtDNA lineages earlier than species radiation, thus leading to a higher divergence between sex-linked lineages and, thus, to a very strict "gender-joining" phylogenetic pattern [84–85, 5]. Conversely, in Amarsipobranchia, two tentative, perhaps overlapping explanations were given to account for the observed "species-joining" pattern, with the only exception of the fairly recent Mytilus edulis species complex ([5]; see also Fig 4 for Veneridae): (i) multiple role reversal events, as well as reversions to SMI, may have blurred the phylogenetic and diversification pattern [103–104, 5], and/or (ii) DUI and the establishment of the two sex-linked mitogenomes may have happened many times in different lineages. This was a conceivable hypothesis given the model described in [105, 5], where a relatively simple switch, called factor Z, is proposed to trigger DUI/SMI swaps. However, it is also worth pointing out that genetic divergence behaves differently in single-genes pairwise comparisons, and this is not expected if we consider the observed variability as a function of the DUI onset time only. For example, unionids show high distance values for cytb and nad1, which are among the most conserved within Amarsipobranchia, and vice versa for nad4 and nad5 (Table 4). Currently, it is not possible to speculate on the reasons of such divergence patterns, and more comparative and structural analyses have to be done.

Conclusions

The present phylogenetic reconstruction (Fig 4) corroborates previous evolutionary trees of venerids [106, 10] and, above all, indicates future research lines: the detection of DUI in other genera of the family Veneridae and/or in other species of the genus Meretrix would add consistency in the single DUI origin hypothesis (at least for Heterodonta; [14]), while the direct observation of SMI in those groups would probably lead to a re-evaluation of the parsimony approach to the origin of DUI proposed in [14]. Furthermore, investigating the distribution of DUI within the genus Meretrix would open the field for comparisons with the Mytilus species complex, which is the only known case of a gender-joining pattern among Amarsipobranchia. The great genome variability shown by bivalves at the mitochondrial level may somehow veil mtDNA similarities between distantly related DUI species, so that comparisons between taxonomically closer DUI species are needed to further characterize and understand the DUI mechanism and the related molecular machinery. This opportunity was unavailable for venerids so far. Therefore, the sequencing and characterization of M. lamarckii mtDNAs presented here makes this species a useful experimental counterpart of V. philippinarum, which in turn has been thoughtfully characterized in recent years (see, f.i., [107–111, 74]).

Stem-loop secondary structures of F and M Meretrix lamarckii.

Inferred stem-loop secondary structures of all Unassigned Regions (URs) comprised between two neighboring protein coding genes (PCGs). The label of each structure is obtained by concatenating "UNs" (Unassigned Nucleotides) and the two PCG names. (PDF) Click here for additional data file.

Meretrix lamarckii F and M rRNA secondary structures.

A, F-rrnL; B, F-rrnS; C, M-rrnL; D, M-rrnS. (PDF) Click here for additional data file.

Meretrix lamarckii F tRNA secondary structures.

All aminoacids are reported with their one-letter code; anticodons are highlighted in yellow. (PDF) Click here for additional data file.

Meretrix lamarckii M tRNA secondary structures.

All aminoacids are reported with their one-letter code; anticodons are highlighted in yellow. (PDF) Click here for additional data file.

Alignments of M additional tRNAs and corresponding F Unassigned Regions (URs).

The M anticodons are highlighted, while stretches of nucleotides involved in tRNA stem-loop structures are underlined. Only the relevant part of the corresponding F-UR is shown. (PDF) Click here for additional data file.

Meretrix lamarckii F and M Long Unassigned Regions.

F (A) and M (B) M. lamarckii Long Unassigned Region (LUR) inferred secondary structures. The 109-bp tandem repeat that was detected in F-LUR is detailed in the upper-right insert; yellow lines, first repeat; purple lines, second repeat. (PDF) Click here for additional data file.

Alignment of the F Large Unassigned Region (F-LUR) and the human Termination-Associated Sequence (TAS) element.

Numbers refer to the positions on the mitochondrial genomes. The TAS element was taken from [66] and located on the revised Cambridge Reference Sequence (GenBank Accession Number NC_012920). (PDF) Click here for additional data file.

Autocorrelograms.

Autocorrelograms for nucleotide trends shown in Fig 3: the autocorrelation function (acf) is plotted for lags from 0 to 17. Page 1, female mitochondrial genome; page 2, male mitochondrial genome; dashed lines, large-lag 95% standard errors. (PDF) Click here for additional data file.

Female unassigned ORFs.

Alignment between the unassigned ORF found in the LUR of the female mitochondrial genome (MeLaF) and the corresponding region of the published Meretrix lamarckii mitochondrial genome (MeLaNC_016174); numbers refer to positions on the GenBank sequences. Regions of mutations pseudogenizing the putative lost ORF are shaded in black in the published sequence; asterisks mark identical nucleotides. (PDF) Click here for additional data file.

Female repeated motifs.

MUSCLE alignment between the 109-bp repeated motif of the female LUR (MeLaF) and the 100-bp repeated motif of the published LUR (MeLaFNC_016174). Asterisks mark identical nucleotides. (PDF) Click here for additional data file.

Transmembrane helices (TMHs) of F- and M-cox2 of all DUI species.

Phobius predictions of cox2 residue locations for all DUI species used for this work. The cox2 length is reported in the x axis; the y axis refers to the posterior probability of a given position to be part of a TMH (gray), cytoplasmic (green), non-cytoplasmic (blue), or part of a signal peptide (red). (PDF) Click here for additional data file.

R script used to compute nucleotide composition and A-T content at four-fold degenerate sites over a sliding window.

The script is called 4F; example files and a tutorial are also provided. The same script can be downloaded at the GitHub repository https://github.com/mozoo/4F.git. (GZ) Click here for additional data file.

Primers used to amplify Meretrix lamarckii F and M mitochondrial genomes.

Primers are listed by pairs, showing for each pair the forward (F) and the reverse (R) primers in the column "Strand". In the column "Sex" it is specified if a given pair was used for the female genome (F), for the male genome (M), or for both (both). For amplicons > 2,000 bp the Herculase enzyme was used (see text for details). Where two annealing temperatures are listed, the first one refers to the female genome and the second one to the male genome. (PDF) Click here for additional data file.

Phylogenetic dataset.

Sequences in boldface were obtained for this study. Taxonomy is taken from GenBank. (PDF) Click here for additional data file. Meretrix lamarckii F (A) and M (B) Unassigned Regions (URs). (PDF) Click here for additional data file.

Genes located by Glimmer3 software in F and M mtDNAs.

The canonical 13 PCGs (bold) and all other Open Reading Frames (ORFs) are reported along with their start base, stop base, frame, and Glimmer score. F_ORF141 and M_ORF138 are shown in bold as well. (PDF) Click here for additional data file.

Putative supernumerary ORFs in Meretrix spp.

Homologies of F_ORF141 with ORFs in other Meretrix mitochondrial genomes; the first entry is F_ORF141 itself. (PDF) Click here for additional data file.

Meretrix lamarckii single tRNA Jin-Nei distances.

(PDF) Click here for additional data file.

Present study vs published genome Kimura distances.

The Kimura aminoacid distance is listed for each PCG. F, female genome; M, male genome; NC_016174, published Meretrix lamarckii mitochondrial genome. The F-atp8 gene has 11 aminoacids at the 5' end that are lacking in the published atp8 gene; as the remaining part of the peptide sequence is identical, the pairwise deletion led to a Kimura distance of 0. (PDF) Click here for additional data file.
  98 in total

1.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

2.  Purification of nucleic acids by extraction with phenol:chloroform.

Authors:  Joseph Sambrook; David W Russell
Journal:  CSH Protoc       Date:  2006-06-01

3.  Platyhelminth mitochondrial DNA: evidence for early evolutionary origin of a tRNA(serAGN) that contains a dihydrouridine arm replacement loop, and of serine-specifying AGA and AGG codons.

Authors:  J R Garey; D R Wolstenholme
Journal:  J Mol Evol       Date:  1989-05       Impact factor: 2.395

4.  The spandrels of San Marco and the Panglossian paradigm: a critique of the adaptationist programme.

Authors:  S J Gould; R C Lewontin
Journal:  Proc R Soc Lond B Biol Sci       Date:  1979-09-21

5.  Interspecies transfer of female mitochondrial DNA is coupled with role-reversals and departure from neutrality in the mussel Mytilus trossulus.

Authors:  H Quesada; R Wenne; D O Skibinski
Journal:  Mol Biol Evol       Date:  1999-05       Impact factor: 16.240

6.  Complete mitochondrial DNA sequence and phylogenetic analysis of Zhikong scallop Chlamys farreri (Bivalvia: Pectinidae).

Authors:  Kefeng Xu; Manami Kanno; Hong Yu; Qi Li; Akihiro Kijima
Journal:  Mol Biol Rep       Date:  2010-02-04       Impact factor: 2.316

7.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

8.  Complete mitochondrial DNA sequence of the eastern oyster Crassostrea virginica.

Authors:  Coren A Milbury; Patrick M Gaffney
Journal:  Mar Biotechnol (NY)       Date:  2005-08-23       Impact factor: 3.727

9.  Mitochondrial genomes of the Baltic clam Macoma balthica (Bivalvia: Tellinidae): setting the stage for studying mito-nuclear incompatibilities.

Authors:  Alice Saunier; Pascale Garcia; Vanessa Becquet; Nathalie Marsaud; Frédéric Escudié; Eric Pante
Journal:  BMC Evol Biol       Date:  2014-12-21       Impact factor: 3.260

10.  The Expression of a Novel Mitochondrially-Encoded Gene in Gonadic Precursors May Drive Paternal Inheritance of Mitochondria.

Authors:  Liliana Milani; Fabrizio Ghiselli; Andrea Pecci; Maria Gabriella Maurizii; Marco Passamonti
Journal:  PLoS One       Date:  2015-09-04       Impact factor: 3.240

View more
  14 in total

1.  Linking paternally inherited mtDNA variants and sperm performance.

Authors:  Stefano Bettinazzi; Sugahendni Nadarajah; Andréanne Dalpé; Liliana Milani; Pierre U Blier; Sophie Breton
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2019-12-02       Impact factor: 6.237

2.  Bioenergetic consequences of sex-specific mitochondrial DNA evolution.

Authors:  Stefano Bettinazzi; Liliana Milani; Pierre U Blier; Sophie Breton
Journal:  Proc Biol Sci       Date:  2021-08-18       Impact factor: 5.530

3.  Relaxed selection on male mitochondrial genes in DUI bivalves eases the need for mitonuclear coevolution.

Authors:  Gerald P Maeda; Mariangela Iannello; Hunter J McConie; Fabrizio Ghiselli; Justin C Havird
Journal:  J Evol Biol       Date:  2021-09-29       Impact factor: 2.516

4.  The longest mitochondrial protein in metazoans is encoded by the male-transmitted mitogenome of the bivalve Scrobicularia plana.

Authors:  Mélanie Tassé; Thierry Choquette; Annie Angers; Donald T Stewart; Eric Pante; Sophie Breton
Journal:  Biol Lett       Date:  2022-06-08       Impact factor: 3.812

5.  No evidence of DUI in the Mediterranean alien species Brachidontes pharaonis (P. Fisher, 1870) despite mitochondrial heteroplasmy.

Authors:  Marek Lubośny; Beata Śmietanka; Marco Arculeo; Artur Burzyński
Journal:  Sci Rep       Date:  2022-05-20       Impact factor: 4.996

Review 6.  Molluscan mitochondrial genomes break the rules.

Authors:  Fabrizio Ghiselli; André Gomes-Dos-Santos; Coen M Adema; Manuel Lopes-Lima; Joel Sharbrough; Jeffrey L Boore
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-04-05       Impact factor: 6.671

7.  Transcriptome analysis of sex-related genes in the blood clam Tegillarca granosa.

Authors:  Heng Chen; Guoqiang Xiao; Xueliang Chai; Xingguan Lin; Jun Fang; Shuangshuang Teng
Journal:  PLoS One       Date:  2017-09-21       Impact factor: 3.240

8.  Variability of mitochondrial ORFans hints at possible differences in the system of doubly uniparental inheritance of mitochondria among families of freshwater mussels (Bivalvia: Unionida).

Authors:  Davide Guerra; Manuel Lopes-Lima; Elsa Froufe; Han Ming Gan; Paz Ondina; Rafaela Amaro; Michael W Klunzinger; Claudia Callil; Vincent Prié; Arthur E Bogan; Donald T Stewart; Sophie Breton
Journal:  BMC Evol Biol       Date:  2019-12-19       Impact factor: 3.260

9.  Comparative Large-Scale Mitogenomics Evidences Clade-Specific Evolutionary Trends in Mitochondrial DNAs of Bivalvia.

Authors:  Federico Plazzi; Guglielmo Puccio; Marco Passamonti
Journal:  Genome Biol Evol       Date:  2016-09-02       Impact factor: 3.416

10.  First complete female mitochondrial genome in four bivalve species genus Donax and their phylogenetic relationships within the Veneroida order.

Authors:  Jenyfer Fernández-Pérez; Ana Nantón; Francisco J Ruiz-Ruano; Juan Pedro M Camacho; Josefina Méndez
Journal:  PLoS One       Date:  2017-09-08       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.