Literature DB >> 28789640

The purplish bifurcate mussel Mytilisepta virgata gene expression atlas reveals a remarkable tissue functional specialization.

Marco Gerdol1, Yuki Fujii2, Imtiaj Hasan3,4, Toru Koike2, Shunsuke Shimojo2, Francesca Spazzali5, Kaname Yamamoto2, Yasuhiro Ozeki3, Alberto Pallavicini5, Hideaki Fujita2.   

Abstract

BACKGROUND: Mytilisepta virgata is a marine mussel commonly found along the coasts of Japan. Although this species has been the subject of occasional studies concerning its ecological role, growth and reproduction, it has been so far almost completely neglected from a genetic and molecular point of view. In the present study we present a high quality de novo assembled transcriptome of the Japanese purplish mussel, which represents the first publicly available collection of expressed sequences for this species.
RESULTS: The assembled transcriptome comprises almost 50,000 contigs, with a N50 statistics of ~1 kilobase and a high estimated completeness based on the rate of BUSCOs identified, standing as one of the most exhaustive sequence resources available for mytiloid bivalves to date. Overall this data, accompanied by gene expression profiles from gills, digestive gland, mantle rim, foot and posterior adductor muscle, presents an accurate snapshot of the great functional specialization of these five tissues in adult mussels.
CONCLUSIONS: We highlight that one of the most striking features of the M. virgata transcriptome is the high abundance and diversification of lectin-like transcripts, which pertain to different gene families and appear to be expressed in particular in the digestive gland and in the gills. Therefore, these two tissues might be selected as preferential targets for the isolation of molecules with interesting carbohydrate-binding properties. In addition, by molecular phylogenomics, we provide solid evidence in support of the classification of M. virgata within the Brachidontinae subfamily. This result is in agreement with the previously proposed hypothesis that the morphological features traditionally used to group Mytilisepta spp. and Septifer spp. within the same clade are inappropriate due to homoplasy.

Entities:  

Keywords:  Bivalve; Lectins; Mollusk; Mussel; Phylogenomics; RNA-seq; Transcriptome

Mesh:

Substances:

Year:  2017        PMID: 28789640      PMCID: PMC5549309          DOI: 10.1186/s12864-017-4012-z

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

The purplish bifurcate mussel Mytilisepta virgata (Wiegmann, 1837), also known as Septifer virgatus, is a small bivalve mollusk species commonly found in the middle/upper intertidal zone of moderately wave-exposed shores along the coasts of Japan, Taiwan, and South Eastern China [1], between −10 and +70 cm above the mean tide level [2]. M. virgata is usually found in dense mussel beds, whose lower limit of vertical distribution is often determined by the association with the lower-intertidal mussel Hormomya mutabilis [3]. This mussel species developed remarkable morphological adaptations to cope with a wave-exposed environment, including a ventrally flattened, particularly resistant shell and stronger byssal attachment to the substrate compared to other subtidal and lower-intertidal sympatric mussel species [3-5]. Furthermore, M. virgata appears to be much more resistant to high temperature stress and air exposure than M. edulis, another invasive mussel species which preferentially occupies the lower part of the intertidal zone in the same geographical region [2]. Mature individuals are usually 45 mm long and live for approximately 4–5 years, although exceptional cases of 65 mm long specimen surviving up to 12 years have been recorded. Sexual maturity is reached after 12 months and, although fertility is maintained throughout the entire year, spawning events follow a bimodal pattern (the first one occurring in February–March, the second one in September–December) [6]. From a taxonomical point of view, M. virgata has long been considered as a member of the Septifer genus (and therefore named S. virgatus) and placed within the subfamily Septiferinae [7]. However, this clade was later found to be polyphyletic and, based on the revised hierarchical classification of NCBI Taxonomy, M. virgata, along with and most of the other species previously paced within this subfamily, has been moved to Mytilinae (Rafinesque, 1815). However, the current taxonomical classification still does not appear to accurately reflect the evolutionary history of these mussels. Indeed, more than a decade ago, molecular studies based on Cytochrome c oxidase subunit I (COI) first pointed out that M. virgata and the morphologically similar Septifer excisus (Wiegmann, 1837) pertained to two different clades within the order Mytiloida [8]. This observation is strongly supported by a recent study by Trovant and colleagues, which reported that COI and 18S/28S–based phylogeny identified both M. virgata and Mytilisepta bifurcata (Conrad, 1837) as members of the same clade, together with Perumytilus purpuratus (Lamarck, 1819) and Brachidontes rostratus (Dunker, 1857) (both pertaining to the Brachidontinae family), but distantly related to Septifer bilocularis [9]. Based on these results, the authors further suggested that Mytilisepta should be retained as a separate genus within Brachidontinae and that the septum structure involved in the insertion of the adductor muscle and used for the current morphological classification of different species within the Septifer genus is the product of homoplasy. Apart from its disputed taxonomical placement, very limited scientific attention has been so far focused on M. virgata, with only a handful of studies dealing with its morphological adaptations [5], embryonic development [10], reproductive cycle [6] and population dynamics [2]. To date, only 31 nucleotide and 11 amino acid sequences have been deposited in public repositories for this species (data retrieved from NCBI, June 2017). These include several partial sequences used for phylogenetic analyses [9, 11, 12], microsatellites [13], a complete mitochondrial sequence (KX094521.1) and a few unrelated sequences referring to still unpublished manuscripts, highlighting the still nearly non-existing molecular knowledge of M. virgata, in stark contrast with the extensive investigations carried out in Mytilus spp. since the early 2000’s [14-18]. The only reliable data available about the genome of M. virgata is an assessment of its size made by flow cytometry. The calculated genome c-value, 1.08, further confirmed by a very similar estimate for the closely related Mytilisepta keenae (1.06) [19], reveals that the purplish Japanese mussel genome is about 2/3 the size of those of Mytilus spp. and among the smallest known mytiloid genomes, but quite in line with the average genome size for all bivalves (Animal Genome Size Database, http://www.genomesize.com/). Due the narrow scientific interest posed so far on mussel species other than Mytilus spp., deep sea vent mussels (Bathymodiolus spp.) and Perna viridis, M. virgata represents an interesting alternative subject for the study of the evolution of Mytiloida and for the investigation of some peculiar gene family expansion events which specifically occurred in this lineage [20]. The main aim of the present work was to provide the first curated and publicly accessible genomic resource for this species. The de novo assembled and functionally annotated transcriptome, together with gene expression data from five different adult tissues, will serve as a sequence and gene expression database for future genetic and molecular studies. The data we present will provide a substantial contribution in the improvement of scientific studies in this marine mussel. While discussing the potential function of tissue-specific genes, we put a particular emphasis on expanded families encoding lectin-like proteins involved in carbohydrate recognition, a class of molecules with a great biotechnological potential which could find a practical application in many areas of biomedical and biological research [21, 22].

Methods

Collection of samples

The selected sampling site was a natural mussel bed found in a tide pool in Oshima, Saikai city (Nagasaki prefecture, Japan). Based on national and local fishing regulations, no permits were required for the collection of shellfish. Five adult mussels, whose shell size approximately ranged between 4 and 5 cm, were collected, dissected using razor blades and scissors, and tissues were immediately placed in RNAlater (Thermo Fisher Scientific, Waltham, USA). Namely, the following tissues were dissected and stored for subsequent RNA extraction: hemocytes (collected with a syringe), posterior adductor muscle, inner mantle, mantle rim, digestive gland, gills and foot (Fig. 1). The collected tissues were subsequently chopped into smaller parts weighting approximately 20 mg; for each of the sampled tissues, one of these slices of tissue was selected for each of the five specimens, placed in a vial containing TRIzol (Thermo Fisher Scientific, Waltham, USA) and homogenized. The total RNA, thereby representing a pool of five individual mussels, was extracted following the manufacturer’s instructions. The quality and quantity of the extracted RNAs were assessed with an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, USA). Only samples whose RNA Integrity Number was > = 9 were selected for RNA sequencing. Unfortunately the quality of the RNA extracted from hemocytes and inner mantle was not sufficient to proceed with the preparation of Illumina sequencing libraries.
Fig. 1

Internal anatomy of an adult Mytilisepta virgata specimen with indication of the five main tissues selected for this study and geographical location of the selected sampling site in Saikai city, Nagasaki prefecture (Japan)

Internal anatomy of an adult Mytilisepta virgata specimen with indication of the five main tissues selected for this study and geographical location of the selected sampling site in Saikai city, Nagasaki prefecture (Japan)

Library preparation and sequencing of the samples

The preparation of cDNA libraries compatible with Illumina sequencing was carried out using the Lexogen SENSE mRNA-seq library prep kit v2 (Lexogen, Wien, Austria), aiming at an average fragment size of 400 nt. RNA-sequencing was performed at the DNA Sequencing Center of the Brigham Young University using a 2 × 125 bp paired-end strategy on a single lane of an Illumina HiSeq 2500 in high output mode with v4 reagents.

De novo assembly, quality assessment and annotation

Raw reads were demultiplexed, imported in the CLC Genomics Workbench v.9.5 environment (Qiagen, Hilden, Germany) and trimmed according to base-calling quality scores and presence of residual sequencing adaptors. Resulting reads shorter than 75 bp were discarded. Trimmed reads were used as an input for Trinity v.2.3.2 [23] on the Galaxy platform [24] with default parameters. The minimum contig length was set at 200 bp. As suggested by previous studies [25, 26], in order to remove unreliable sequences likely originated from excessive fragmentation of longer mRNA molecules or residual contamination from exogenous RNA, contigs displaying a very low sequencing coverage were removed; this procedure was based on a threshold of TPM = 1, calculated with the back-mapping of all trimmed reads on the full assembled transcriptome using the software Kallisto v0.43.0 [27]. All the contigs that did not reach this threshold were discarded. Contigs originated from the assembly of mitochondrial and ribosomal RNAs were identified by BLASTn [28] using the sequences KX094521.1 (mitochondrial) and KJ453817.1 (ribosomal) as queries (e-value threshold 1E−10). The quality of the transcriptome was assessed with BUSCO v.2 [29] based on the set of conserved orthologous genes of the Metazoa lineage according to OrthoDB v.9 [30]. For annotation purposes, only the longest transcript per gene was selected and subjected to virtual protein translation with TransDecoder v.3.0.1 (http://transdecoder.github.io), using a minimum predicted protein length of 100 amino acids. All the sequences were annotated with the Trinotate pipeline v.3.0 (https://trinotate.github.io) and assigned Gene Ontology categories [31] based on positive BLASTp and BLASTx [28] matches against the UniProtKB/Swiss-Prot database (e-value threshold, 1E−5). The annotation of Pfam conserved protein domains [32] was based on the identification by Hmmer v.3.1b2 [33]. In specific cases where no annotation could be assigned based on these criteria, remote structural similarities with templates deposited in the Protein DataBank were inspected with HHpred [34].

Gene expression analysis

Trimmed reads for each of the five tissues were mapped on the annotated transcriptome with the RNA-seq mapping tool of the CLC Genomics Workbench v.9.5 (Qiagen, Hilden, Germany) using the following parameters: length fraction: 0.75; similarity fraction: 0.98; maximum number of matching contigs: 10; paired-end reads distance was automatically estimated. The number of matched reads were converted into Transcripts Per Million (TPM) expression values [35], a measure which ensures efficient data normalization and comparability both within and between samples. The obtained TPM values, representing the average expression levels of a pool of five biological replicates, were subjected to a statistical analysis of gene expression with a Kal’s Z-test [36]. In detail, the expression profile of a given tissue was compared with the other four to identify differentially expressed genes (DEGs). The thresholds of fold change and False Discovery Rate-corrected p-value were set at 2 and 0.05, respectively. All DEGs exceeding these thresholds in all the four comparisons were marked as “tissue-specific”. TPM expression were processed by log2 transformation for visualization purpose to generate scatter and volcano plots. A gene expression heat map was created by Euclidean distance-based hierarchical clustering (with average linkage) of a representative set of genes whose expression exceeded 3000 TPM in any of the five tissues taken into account in this study. Heat maps were also similarly generated for genes encoding lectins. Tissue-specific genes were subjected to a functional evaluation, assessed through hypergeometric tests on Gene Ontology and Pfam annotations [37], implemented in the CLC Genomics Workbench v.9.5. Significantly over-represented annotations were detected based on a p-value <0.01 and observed – expected value > = 5.

Validation by real-time PCR

The expression data obtained by in silico analyses as detailed in the section above was validated by Real-Time PCR on three individual adult mussel, collected and dissected as previously described. Extracted RNAs were used to synthetize cDNA with a qScript™ Flex cDNA Synthesis Kit, QuantaBio (Quanta BioSciences Inc., Gaithersburg, MD) following manufacturer’s instructions. Ten target genes (two for each tissue) were selected for validation among those displaying high tissue specificity in RNA-seq experiments. The primer sequences, designed with Primer3Plus (https://primer3plus.com/) to obtain amplicons of 100–150 nt size, are reported in Table 1. In addition, we selected two stable housekeeping genes, the elongation factor 1 alpha and the ribosomal protein S2 (the former from literature data [38, 39], the latter due to its constant TPM values), to normalize gene expression data among samples.
Table 1

Primers used for real-time PCR validation

target geneforward primer (5′ - > 3′)reverse primer(5′ - > 3′)
ChitotriosidaseTACTGCGATTGGCCATACAATGCCTGTGTAGTTCGAGGTG
Meprin AGCTTGGGTACATCGACCACTCTGCTTTGGTTCCAGTGTCA
ParamyosinCCGAACTCGCAGAAAAGAACCGTAGAGCTTCCTCGGTGTC
Myosin, striated muscleCTCTTGTTGCCCCAGGATTACTGGTAGCTCCACCAGCTTC
Acetylcholine receptorGTCAAAGTCGGCCACTCACTGTCTGACCGTCGTTGGTTCT
Glycine-rich secreted proteinCACACGGTCTTACTGGAGCAGTTCGCCTTGTTCACCTTGT
Valine-rich secreted proteinAAAGTGCCATTCGAGACACCGGGCTGGGAACTCTGAATTT
SCP domain containign proteinTCAAACACTGCGTCAAGACCGGATCCACGTTTTCTCTTGC
Serine protease inhibitorAGGCCAACTGCAAAAACGCCGTCAACTCCACACACG
Tyrosinase-like protein 2CAGAGCCCTACCTCCAGATGTGACTGCTCGCTTTGTATGG
EF1alphaCTCTTCGTCTCCCACTCCAGACCAGGGAGAGCTTCAGTCA
Ribosomal protein S2GCCATTGCCAATACCTATGCGCCTGGTTGACGAGTATGGT
Primers used for real-time PCR validation In detail, PCR reactions were prepared as follows: 5 μL of SsoAdvanced SYBR Green Supermix (Bio-Rad, Hercules, CA), 0.2 μL of the 10 μM forward and reverse primers, 1 μL of 1:20 diluted cDNA and water to reach a final reaction volume of 10 μL. Polymerase chain reaction (PCR) amplifications were carried out on a Real-Time C1000-CFX96 platform (Bio-rad), using the following thermal profile: 95° for 30 s, by 40 cycles at 95° (10 s) and 60° (20 s). The absence of non-specific amplification products was assessed with a melting curve analysis (65° to 95°). Gene expression values for the target genes were calculated using the delta Ct method and normalized on the average expression level of the two housekeeping genes. Results are shown as the average plus standard deviation of three technical replicates.

Phylogenomic analysis

The phylogenomic analysis was carried out based on a set of 445 single-copy orthologous genes present in the transcriptomes of M. virgata and 8 other mytiloid bivalve mollusk species, using the same strategy previously used by Biscotti and colleagues [25]. Namely, the species selected for this purpose were: Bathymodiolus azoricus, Bathymodiolus manusensis, Bathymodiolus platifrons, Geukensia demissa, Lithophaga lithophaga, Modiolus kurilensis, Modiolus modiolus, Modiolus philippinarum, Mytilus californianus, Mytilus coruscus, Mytilus galloprovincialis (as a representative of the M. edulis species complex), Perna viridis and Perumytilus purpuratus. The full set of the proteins encoded by the recently published genomes of B. platifrons and M. philippinarum were recovered. For all the other species, publicly available raw sequencing data was downloaded from the NCBI SRA database, imported in the CLC Genomics Workbench 9, trimmed based on quality as described above and assembled with the de novo assembly tool, setting both the word size and bubble size parameters to “automatic”. Coding sequences (CDSs) were then predicted with TransDecoder v.3.0.1 (http://transdecoder.github.io), as described above for M. virgata. Sequence data from the genome of the Pacific oyster Crassostrea gigas v.9 [40] were also included to provide a reliable outgroup species for the analysis, based on the recent identification of Ostreoida as a sister group to Mytiloida [41]. Following this procedure, reciprocal BLASTp searches were performed, between the target species, M. virgata and C. gigas (the outgroup), using an e-value threshold of 1 × 10−10; taking into account only the best BLAST hit and discarding all the sequences lacking significant homology in any of the comparisons (either because of an e-value lower than the threshold or because of sequence identity <50%). This finally allowed the identification of a set of conserved orthologous sequences, which were aligned with MUSCLE [42] and further processed with Gblocks v.0.91b to remove highly divergent regions or fragments missing in one or more species due to the incompleteness of transcriptome data [43]. The resulting trimmed alignments were then concatenated and used as an input for a ProtTest v.3.4 analysis [44] in order to identify the best-fitting model of molecular evolution based on the corrected Akaike Information Criterion [45]. The concatenated multiple sequence alignment file, consisting of 247 orthologous proteins unambiguously detected in all the species taken into account, was subjected to Bayesian phylogenetic inference analysis with MrBayes v.3.2 [46], based on the LG model of molecular evolution, with a Gamma-shaped distribution of rates across sites, a proportion of invariable sites and fixed (empirical) priors on state frequencies (LG + G + I + F), identified by ProtTest as the best-fitting model. Phylogenetic inference was carried out with two independent analyses run in parallel. The convergence of the parameters generated by the two MCMC analyses was evaluated with Tracer v.1.6 (http://beast.bio.ed.ac.uk/Tracer). The analysis was stopped when the Effective Sample Size of each estimated parameter reached a value higher than 200, without considering the initial 25% of the generated trees (removed due to the burnin process). Sampled trees were used to calculate a consensus phylogenetic tree, where poorly supported nodes (those displaying posterior probability <50%) were collapsed.

Results and discussion

The high quality transcriptome of Mytilisepta virgata

Following the application of quality filters, the de novo transcriptome assembly of M. virgata comprised 49,501 contigs with an average length of 679 nucleotides (Table 2). This number, apparently rather high if compared to the number of annotated genes in bivalve genomes (e.g. ~26,000 in C. gigas and ~33,000 in P. fucata) [40, 47], is possibly justified by at least three factors: (i) the poor knowledge of bivalve non-coding RNAs, that are abundant in mussels [14], including in M. virgata, where they account for 71% of the assembled contigs; (ii) the partial fragmentation of mRNAs in smaller contigs; (iii) the high genomic complexity and heterozygosity of the genome of mytiloids, previously pointed out in Mytilus spp. [16].
Table 2

Sequencing, de novo assembly and annotation statistics

Digestive gland trimmed reads48,106,544
Posterior adductor muscle trimmed reads46,456,019
Gills trimmed reads49,760,348
Mantle rim trimmed reads53,492,730
Foot trimmed reads86,035,069
Number of contigs49,501
Longest contigs9778 nt
Average contig length679 nt
N501046
Contigs longer than 5Kb68
Complete BUSCOs66%
Fragmented BUSCOs10%
Absent BUSCOs24%
Predicted protein-coding contigs (>100 aa)29%
GO cellular component annotation rate19%
GO molecular function annotation rate17%
GO biological process annotation rate17%
PFAM annotation rate23%
Mytiloid innovations (protein coding)2%
Sequencing, de novo assembly and annotation statistics N50, the metric most commonly used to assess the quality of a de novo assembly [48] was 1046, in line with those obtained in the de novo assembly of other species of the order Mytiloida using Illumina sequencing technologies [14, 15]. While mean contig length (679 nt) and N50 metrics can only provide a measure of the effectiveness of the de novo assembly pipeline, the completeness of a transcriptome can be only theoretically estimated through the comparison with its reference genome. However, a similar estimate can be performed also when a reference genome is not available, using a set of highly conserved single-copy orthologous sequences. These genes, expected to be found across all the genomes within a target taxa, can be defined as “Benchmarking Universal Single-Copy Orthologs” (BUSCOs) [29]. This analysis revealed that ~66% of the conserved metazoan orthologous genes was present and complete in the assembled transcriptome of M. virgata, whereas only ~10% were present but fragmented, and ~24% were absent (Fig. 2). This result highlights a slightly higher level of completeness of the M. virgata transcriptome compared to other mussel transcriptomes obtained with Illumina sequencing technologies from single tissues [14, 49], much higher than those obtained with more error-prone or lower throughput technologies (454 Life Sciences and Sanger sequencing) [18, 50, 51] and just slightly lower than bivalve genome-scale protein prediction in oysters [40, 47]. This supports our choice of combining RNA-seq from multiple tissues and different specimens to obtain a representative nearly-complete transcriptome. The residual incompleteness of the de novo assembled transcriptome, evidenced by the missing BUSCOs, can be explained by the fact that some genes were not expressed at all in the five tissues taken into account. Specifically, as no transcriptome from larval stages was sequenced in this study, genes related to embryonic development might have been entirely missing. At the same time, since inner mantle (invaded by gonads during the spawning season) and hemocytes could not be analyzed due to insufficient RNA quality, a number of genes displaying high specificity of expression in these two highly specialized tissues are likely to be absent in the assembled transcriptome. Finally, tightly regulated genes whose expression is turned on in response to particular stimuli are also expected to be absent due to naïve condition of the mussel specimens collected.
Fig. 2

Comparison of transcriptome completeness, estimated with Busco v.2, among publicly available de novo assembled transcriptomes from Mytiloida and gene model predictions from the fully sequenced genomes of Crassostrea gigas and Pinctada fucata

Comparison of transcriptome completeness, estimated with Busco v.2, among publicly available de novo assembled transcriptomes from Mytiloida and gene model predictions from the fully sequenced genomes of Crassostrea gigas and Pinctada fucata Overall, poor expression can also lead to contig fragmentation, together with other factors such as transcriptome complexity, heterozygosity and inter-individual variability. Taking into account that mussels pertaining to the genus Mytilus display an astounding level of heterozygosity and are subject to significant genetic introgression [52], the relatively low rate of fragmented/complete BUSCOs (0.15) was somewhat unexpected. Since no information is currently available concerning the genetic diversity of M. virgata populations, these results possibly suggest that the mean level of heterozygosity in this species is significantly lower than that found in Mytilus spp., which would be consistent with a smaller effective population size. The annotation rate by Trinotate was quite low, as only about 25% of the assembled contigs could be associated to a Gene Ontology term or to a Pfam domain. This observation, which is in line with previous results obtained from transcriptome studies in other mytiloids [14], is linked to at least three different factors: (i) partial fragmentation of contigs, as evidenced by BUSCO; (ii) high prevalence of taxonomically restricted genes; (iii) high prevalence of non-coding transcripts. In detail, the presence of gene families restricted to Mytiloida has been previously described [53, 54] and, while this topic should be better investigated once the first complete mussel genome will become available, a brief analysis revealed that ~2% (1078) of the assembled sequences pertained to gene families exclusively found in mytiloids (as evidenced by the lack of BLAST homology with non-mytiloid bivalve genomes and transcriptomes, as opposed to significant homology against the publicly available genomes and transcriptomes of other mytiloids). At the same time, the universe of non-coding RNAs in invertebrates remains to be fully investigated. While genome annotation pipelines, in most cases, currently disregard non-coding genes in newly assembled invertebrate genomes due to lack of homology and experimental evidence, the number of non-coding genes annotated in the genome of the model organism Caenorhabditis elegans has recently surpassed that of coding genes, starting to unveil the important role these RNAs cover in the biology of protostomes. Consistently with this observation, only 29% of the M. virgata contigs were found to contain an ORF longer than 100 codons, suggesting that a high fraction of non-coding sequences was indeed the primary reason of the low annotation rate. Complete annotation data and gene expression levels are provided in Additional file 1.

Phylogenetic relationship with other mytiloids

The phylogenetic position of M. virgata and the classification of this species within the Septifer (Récluz, 1848) or the Mytilisepta (Habe, 1951) genus have been a matter of debate. Currently, although Septifer virgatus and Mytilisepta virgata are considered as synonyms by the World Register of Marine Species and by MolluscaBase [55], only the latter is marked as an accepted species name. Based on the same taxonomy reference databases, the genus Mytilisepta currently includes two other species, M. bifurcata (Conrad, 1837) and M. keenae (Nomura, 1936). On the other hand, nine different species are currently recognized as pertaining to the Septifer genus by this taxonomical authority. However, the Mytilisepta genus is currently not accepted by the NCBI taxonomy authority and M. virgata is therefore still listed as S. virgatus and further placed within Mytilinae subfamily. The traditional classification of both Mytilisepta and Septifer species as members of the same genus is strictly based on morphological features, namely the presence of a septum which is involved in the attachment of the adductor muscle to the shell. However, recent molecular phylogenetic studies based on the analysis of 18S/28S nucleotide sequence have strongly suggested that the presence of this structure in the two mussel genera is the result of convergent evolution. Indeed, M. virgata and S. bilocularis pertain to highly divergent clades, with the former being related to mussel species pertaining to the Brachidontinae subfamily [9]. Taking advantage from the large amount of sequence data generated in the present study and transcriptomic dataset publicly available for other mytiloid species, we inferred the phylogenetic placement of M. virgata within the order Mytiloida by the use of Bayesian methods. Our phylogenomics approach, which took into account a set of 247 single-copy orthologous genes detected in all the studied species, is expected to provide a highly reliable estimate of the evolutionary relationship of M. virgata with other mussel species, as recently demonstrated by a series of phylogenomics studies which have helped to discern the phylogenetic relationship of Mollusca and Bivalvia [41, 56–58]. Before proceeding to the discussion of the placement of M. virgata, it is worth to briefly remind that the classical classification of mytiloids, currently used by NCBI Taxonomy, comprises seven subfamilies: (i) Bathymodiolinae, giant deep see mussels associated with hydrothermal vents; (ii) Brachidontinae (Nordsieck, 1969), small-sized “scorched mussels” adapted to life in the mid-intertidal zone; (iii) Crenellinae (Gray 1840), small-size byssal nets-creating benthic species commonly found in sandy and muddy seabeds; (iv) Lithophaginae (H. Adams & A. Adams, 1857), rock- or coral-boring mussels with a cylindrical, elongated shape; (v) Modiolinae (G. Termier & H. Termier, 1950), presenting a rounded outline and subterminal umbones, which usually live buried in the soft sediments of the subtidal zone; (vi) Mytilinae (Rafinesque, 1815), usually presenting a triangular shape and terminal umbones and commonly creating dense mussel beds in the intertidal zone of rocky shores; (vii) Septiferinae, previously including also M. virgata and other morphologically similar mussel species with am arcuate and convex anterior margin of the shell. The phylogenetic tree obtained was highly supported in all its nodes by posterior probability values higher than 99%, indicating the nearly certain and unambiguous inference of the most likely evolutionary scenario which led to the radiation of mussels (Fig. 3). Remarkably, M. virgata was found to be closely related to P. purpuratus, supporting the results previously published by Trovant and colleagues based on ribosomal RNA sequence data [9], and clearly placed within the Brachidontinae (Nordsieck, 1969) subfamily, together with the ribbed horsemussel G. demissa. While no –omic scale sequence data is available for other species currently classified in the Septifer genus, molecular phylogeny produced by other authors suggest that S. excisus, S. bilocularis and other congeneric species are distantly related to M. virgata and do not belong to Brachidontinae, being more closely related to Mytilus (Linnaeus, 1758) and other species of the subfamily Mytilinae.
Fig. 3

Bayesian phylogeny of Mytiloida. The classical taxonomical classification of mytiloids, according to NCBI Taxonomy, is shown on the right side of the figure. No species pertaining to the subfamilies Crenellinae and Septiferinae could be included due to the absence of available sequence data. Note the placement of M. virgata within the Brachidontinae subfamily. The number indicated on the nodes of the tree indicate posterior probability support values

Bayesian phylogeny of Mytiloida. The classical taxonomical classification of mytiloids, according to NCBI Taxonomy, is shown on the right side of the figure. No species pertaining to the subfamilies Crenellinae and Septiferinae could be included due to the absence of available sequence data. Note the placement of M. virgata within the Brachidontinae subfamily. The number indicated on the nodes of the tree indicate posterior probability support values Based on these results, the current official naming of M. virgata as S. virgatus at the NCBI taxonomy database does not appear to be appropriate, and it should be updated following the official nomenclature already adopted by WoRMS and MolluscaBase. Moreover, Mytilisepta appears to be clearly pertaining to the Brachidontinae subfamily, which at the present time only lists the Brachidontes (Swainson, 1840), Geukensia (Van der Poel, 1956), Ischadium (jules-Brown, 1905) and Perumytilus (Olsson, 1961) genera in the NCBI taxonomy database.

Overview of gene expression profiles

As an overall trend, the five analyzed tissues displayed widely diverse gene expression profiles, characterized by a broad prevalence of tissue-specific genes (Fig. 4). Cumulative expression plots revealed that one of the major differences among tissues can be attributed to the high energetic investment in the synthesis of a very few mRNA species by some tissues (e.g. foot) compared to a more even distribution of the transcriptional efforts by other tissues (e.g. gills). This factor can be efficiently estimated by the relative contribution to global transcriptional activity of the 100 most highly expressed genes in any given sample. We define the reciprocal of this value as Transcriptomic Diversity Index (TDI). Therefore, a high TDI indicates the expression of a broad range of transcripts, whereas a low TDI points out a low diversity in the mRNA population. The TDI values calculated for the Mytilisepta tissues were, from the highest to the lowest: 3.57 (gills), 3.03 (mantle rim), 2.63 (digestive gland), 2.08 (posterior adductor muscle) and 1.59 (foot) (Fig. 4, panel a). As a further confirmation of the remarkable functional specialization of bivalve tissues, just a relatively low number of genes (235), mostly encoding fundamental housekeeping genes, were found to be expressed at relatively high levels (TPM > 100) in all the five tissues analyzed (Fig. 4, panel b).
Fig. 4

Panel (a) cumulative gene expression of the 1000 most expressed genes per tissue. The gene expression level on the Y axis is measured as TPM. Panel (b) Venn diagram displaying the overlap between highly expressed transcripts (TPM > 100) among the five tissues subjected to RNA-seq

Panel (a) cumulative gene expression of the 1000 most expressed genes per tissue. The gene expression level on the Y axis is measured as TPM. Panel (b) Venn diagram displaying the overlap between highly expressed transcripts (TPM > 100) among the five tissues subjected to RNA-seq The functional differentiation of the five mussel tissues appears to be evident from the hierarchical clustering of genes based on their pattern of expression (Fig. 5, panel a). The digestive gland and the foot, in particular, are characterized by two clusters of genes with high tissue specificity, nearly undetectable outside the main site of production. At a first sight, the hierarchical clustering analysis does not permit to clearly identify equally important differences among mantle rim, gills and posterior adductor muscle. However, while these differences are not as evident as those observed for foot and digestive gland, they might influence in a highly significant manner the function of a tissue (see the following sections).
Fig. 5

Panel (a) Euclidean distance-based hierarchical clustering, calculated based on average linkage, of the Mytilisepta virgata transcripts reaching an expression level higher than 3000 TPM in at least on out of the five tissues analyzed. Panel (b) Gene expression scatter plot comparing the expression profiles of digestive and foot in Mytilisepta virgata. The log2 normalized TPM gene expression levels are reported on the X and Y axes. Differentially expressed genes are marked in red

Panel (a) Euclidean distance-based hierarchical clustering, calculated based on average linkage, of the Mytilisepta virgata transcripts reaching an expression level higher than 3000 TPM in at least on out of the five tissues analyzed. Panel (b) Gene expression scatter plot comparing the expression profiles of digestive and foot in Mytilisepta virgata. The log2 normalized TPM gene expression levels are reported on the X and Y axes. Differentially expressed genes are marked in red

Digestive gland transcriptional profile

The digestive gland is the main tissue involved in the digestion of food particles, in the conjugation of nutrients to carriers for their transportation through circulation, in the metabolism of xenobiotics and in the excretion process. In addition, it also covers an important role in the long-term storage of nutrient reserves to be used during gametogenesis or long-lasting stress periods. Two main cell types are responsible for the fundamental functions of this organ, i.e. digestive cells and basophil secretory cells. These cells are located in the blind end of digestive tubules branching from the main digestive ducts. Here, highly abundant columnar digestive cells adsorb food particles by pinocytosis and proceed to their digestion upon the fusion of endocytotic phagosomes with lysosomes. On the other hand, pyramid-shaped secretory cells are thought to be mainly involved in the release of enzymes involved in the extracellular breakdown of major food particles [59]. Mussels feed on a wide variety of food particles found in the water column, including phytoplankton, micro-zooplankton, bacteria and detritus, whose abundance largely varies both seasonally and geographically. For this reason, mussels display limited food preference, mostly based on size selection [60], and they actively produce of a broad range of digestive enzymes required for the assimilation of diverse nutrients, i.e. fat, carbohydrates and proteins found in different food sources. Coherently with these observations, the expression profile of the M. virgata digestive gland was typical of a highly specialized tissue, displaying tissue-specificity for 2043 genes, as evidenced by the expression scatter plots (Fig. 5, panel b) and a low TDI (2.63). The transcriptional profile was dominated, as expected, by digestive enzymes. The gene set enrichment analysis indeed clearly highlighted that a major energetic effort is spent in the synthesis of enzymes involved in the processing of the two most abundant biomasses in nature, cellulose and chitin [61], i.e. cellulases (identified as members of the glycosyl hydrolase families 9 and 10) and chitinases (pertaining to the glycosyl hydrolases family 16) (Table 3). The overwhelming presence of these classes of enzymes among the most highly expressed genes is probably ascribable to the wide availability of micro-zooplankton and microalgae as potential food sources in the oceans.
Table 3

The 25 most highly expressed genes in Mytilisepta virgata digestive gland and representative significantly over-represented annotations by gene set enrichment analysis

Most expressed genesOver-represented annotations
AnnotationTPMTermClass p-value
Vdg343,813.51Extracellular spaceCC0.00
Probable apolipoprotein14,067.11ShK domain-likePFAM0.00
Probable apolipoprotein12,230.17EpendyminPFAM0.00
Alpha tubulin11,375.80Lectin C-type domainPFAM1.11E-16
Elongation factor 1 alpha11,074.06C1q domainPFAM2.22E-16
Ependymin-related protein11,054.83von Willebrand factor type C domainPFAM1.07E-14
Ependymin-related protein8275.80Cellulose catabolic processBP3.83E-12
Vdg38084.66LysosomeCC7.27E-12
Chitotriosidase7926.27Cellulase activityMF4.21E-11
Unknown6439.28Glycosyl hydrolase family 9PFAM8.40E-11
Probable apolipoprotein6230.41alpha/beta hydrolase foldPFAM2.00E-10
Cytoplasmic actin6103.04Carboxylesterase familyPFAM3.77E-10
Calcium binding protein5887.19TrypsinPFAM6.07E-10
Meprin A5695.11Glycosyl hydrolases family 16PFAM8.76E-9
60S ribosomal protein P05364.73Serine-type endopeptidase activityMF1.34E-8
Ependymin-related protein5342.49MAM domain, meprin/A5/muPFAM2.76E-8
Urokinase-like protein5283.92Carbohydrate metabolic processBP5.91E-8
Ganglioside GM2 activator5163.45Lipid transporter activityMF8.20E-8
Putative metalloprotease4892.49Antistasin familyPFAM1.36E-7
Alpha tubulin4698.32SulfatasePFAM1.17E-6
Spondin-like protein4503.80Glycosyl hydrolase family 10PFAM5.83E-6
40S ribosomal protein SA4288.17Peptide metabolic processBP4.26E-6
C1q domain-containing protein4279.26LipasePFAM8.19E-6
Ependymin-related protein3725.69Prolyl oligopeptidase familyPFAM8.39E-5
Calcium binding protein3528.24Xenobiotic metabolic processBP4.56E-4

See the complete list in Additional file 4

TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

The 25 most highly expressed genes in Mytilisepta virgata digestive gland and representative significantly over-represented annotations by gene set enrichment analysis See the complete list in Additional file 4 TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component At the same time, fat-digesting enzymes (lipases, in particular) and proteases were also found to be expressed at highly significant levels. The latter include trypsin-, meprin- and papain-like proteinases, cathepsins, several serine-type endopeptidases and a large family of putative metalloproteases characterized by the ShK domain [62]. The high expression of protease inhibitors, such as antistasin, targeting trypsin [63], and Kazal-type serine-protease inhibitors [64], is also in line with the strong production of the aforementioned proteolytic enzymes, whose functional dysregulation could lead to serious cellular and tissue damage. Altogether, while some of these annotations are likely linked to extracellular digestive processes, others clearly point towards lysosomes, which are found in high numbers in vacuolated digestive cells. Among these, sulfatases and of ganglioside GM2 activator, highly expressed in the digestive gland, are probably the most relevant. The former is involved in the lysosomal cleavage of sulfated carbohydrates; the latter is a cofactor for the lysosomal digestion of gangliosides. As a result of its pronounced endocytotic activity, the digestive gland is also a main site of bioaccumulation for pollutants, heavy metals and biotoxins [65, 66], which are processed by detoxifying enzymes. The specialization of the digestive gland in this activity is indeed evidenced by the overrepresentation of oxidoreductases and carboxylesterases, phase I detoxifying enzymes which introduce polar groups in xenobiotics to increase their solubility and promote their excretion. Furthermore, consistently with the function homologous to that covered by liver in vertebrates, a few highly expressed genes in the digestive gland of Mytilisepta encode apolipoproteins, devoted to the binding and transportation of lipids in the circulatory system. In particular, the second and the third most highly expressed genes (Table 3), despite lacking detectable primary sequence similarity, display a remarkable predicted structural similarity with apolipoproteins, identified by HHpred as detailed in the materials and methods section. Similarly to what has been previously reported in M. galloprovincialis [14, 67], the most highly expressed gene in the digestive gland of Japanese mussels was vdg3, a developmental marker of the maturation of this organ in juveniles [68], whose precise biological function still remains to be fully elucidated. Vdg3, which is probably encoded by a few highly similar paralogous genes (like in the Mediterranean mussel [69]), accounts for more than 4% of the total transcriptional activity of the M. virgata digestive gland. In addition, it is worth of note that the digestive gland, besides being involved in the secretion of enzymes for extracellular digestion, also releases in the extracellular environment a large number of proteins with potential carbohydrate binding properties, most notably C-type lectins and C1q domain-containing (C1qDC) proteins (Table 3; see the following sections for a detailed overview on these gene families).

Foot transcriptional profile

The foot is an important tissue both in the larval and in the adult phases of mussel life. Although the foot has progressively lost its ancestral function as the main locomotory organ along bivalve evolution, it still retains a limited role in the movement of pediveliger larvae and juvenile mussels. In adult specimens, the foot is the main responsible of the attachment to suitable substrates through the secretion of byssal threads, robust and elastic fibers produced by a specialized secretory glands, which have attracted a considerable interest due to their potential biotechnological applications [70]. The gene expression profile (TDI = 1.59, see Fig. 4 panel a) pointed out that the foot is by far the most specialized tissue among those subjected to RNA-seq and the one producing the lowest variety of transcripts. As evidenced by gene set enrichment analyses, most of the 769 foot-specific transcripts identified in M. virgata were indeed clearly associated with the production of byssus. In particular, ~15% of the total transcriptional activity was used for the production of collagens and ~13.5% for the production of byssal cuticle proteins (Table 4). Precollagens are the major constituents of the fibrous core of byssal threads, contribute to their self-assembly and account for more than 70% of their mass [71, 72]. On the other hand, the astounding resistance of these threads is provided by a stiff coating by proteins rich in modified amino acids, such as trans-4-hydroxyproline, trans-2,3-cis-3,4-dihydroxyproline, and L-3,4-dihydroxyphenylalanine (L-DOPA), which form a 2–5 μm thick highly resistant cuticle [73]. We found a significant sequence similarity between the byssal cuticle proteins of M. virgata, those of the deep sea vent mussel Bathymodiolus thermophilus, and an adhesive polyphenolic foot protein of Septifer bifurcatus. These low-complexity proteins in fact consist of a high number of consecutive repeats of conserved 17-mers, which present, in all the three species, a well detectable Y-X-X-X-Y-K motif required for adhesion [74] (Additional file 2). Besides these two classes of sequences, the gene expression profile of the foot was dominated by other low-complexity proteins, including YGH-rich proteins similar to those previously identified in Mytilus coruscus [75], and proteins containing EGF-like domains, which characterize cross-linking mussel adhesive plaque proteins [76].
Table 4

The 25 most highly expressed genes in Mytilisepta virgata foot and representative significantly over-represented annotations by gene set enrichment analysis

Most expressed genesOver-represented annotations
AnnotationTPMTermClass p-value
Byssal cuticle protein45,262.97Collagen triple helix repeat (20 copies)PFAM0.00
Byssal cuticle protein43,379.14Serine-type endopeptidase inhibitor activityMF0.00
Collagen-like protein42,699.96Negative regulation of coagulationBP1.11E-16
Byssal cuticle protein34,293.81Common central domain of tyrosinasePFAM3.33E-16
Collagen-like protein33,886.43Extracellular regionCC3.33E-16
Serine protease inhibitor30,056.14Negative regulation of endopeptidase activityBP6.77E-15
Low complexity protein26,250.15Animal haem peroxidasePFAM6.77E-14
Collagen-like protein17,017.56Kazal-type serine protease inhibitor domainPFAM4.35E-12
YGH-rich protein-216,065.78Peroxidase activityMF5.69E-12
Collagen-like protein15,351.63Proteinaceous extracellular matrixCC6.47E-12
Collagen-like protein13,389.57Tissue inhibitor of metalloproteinasePFAM6.83E-10
Byssal cuticle protein12,178.71Collagen trimerCC2.14E-8
Collagen-like protein10,965.75Response to oxidative stressBP5.45E-8
Elongation factor 1 alpha10,722.32Regulation of keratinocyte differentiationBP8.93E-8
Collagen-like protein10,461.88Hydrogen peroxide catabolic processBP1.05E-7
YGH-rich protein-110,019.89Transforming growth factor beta bindingMF1.63E-7
Precollagen NG9810.75Metal ion bindingMF8.77E-7
YGH-rich protein-18119.26Extracellular spaceCC9.20E-7
Serine protease inhibitor7406.30Heme bindingMF2.55E-6
Serine protease inhibitor7362.32Oxidoreductase activityMF6.45E-6
Collagen-like protein7349.68WAP-type ‘four-disulfide core’PFAM1.01E-5
Collagen-like protein7307.02Quinone bindingMF1.26E-5
Unknown7104.95Human growth factor-like EGFPFAM1.78E-5
Kazal-type serine proteinase inhibitor7095.67EGF-like domainPFAM6.84E-5
Low complexity protein6445.09Kunitz/Bovine pancreatic trypsin inhibitor domainPFAM7.62E-5

See the complete list in Additional file 4

TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

The 25 most highly expressed genes in Mytilisepta virgata foot and representative significantly over-represented annotations by gene set enrichment analysis See the complete list in Additional file 4 TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component Interestingly, tyrosinases were the most important class of enzymes expressed in this tissue: 13 out of the 18 tyrosinase sequences found in the purplish mussel transcriptome were indeed specifically expressed in the foot. This data is in agreement with the importance of this enzymatic activity in the conversion of tyrosine residues to L-DOPA and, consequently, to dopaquinone in Tyr-rich byssal cuticle proteins [77, 78]. At the same time, the high observed activity of antioxidant enzymes (peroxidases in particular) is explained by the need of maintaining a redox balance to allow the modification of L-DOPA-rich adhesion proteins [79, 80]. Consistently with previous reports, many proteinase inhibitors were selectively expressed at high levels in the foot. Among these, the strong expression of serine-type endopeptidases and alpha-2 macroglobulins is certainly worth of a note. Moreover, the foot-specificity of six WAP-like proteins suggest that they may act as suppressors of elastase-type serine proteases, preventing laminin degradation [81]. These enzyme inhibitors may altogether constitute an efficient protecting system that prevents the degradation of byssal threads by extracellular proteases [75].

Gills transcriptional profile

In all non-protobranch bivalves, gills are large organs whose structure follows the curvature of the shell, dividing the mantle cavity between inhalant and exhalant chambers and providing a large surface area exposed to the incoming water flow. Their lamellar structure, strengthened by collagen, makes them well suited for both gas exchange and filter-feeding. Gills are highly vascularized by vessels which bring deoxygenated hemolymph to the filaments, where the gas exchange takes place, and subsequently recirculate oxygenated fluid to the whole body through an efferent vein. The feeding function on the other hand is carried out by specialized cilia that cover the entire branchial surface and convey food particles carried by inflowing water towards the labial palps. Overall, gills are very complex organs that combine functionally and structurally diverse cell types, including neuronal cells that depart from a visceral ganglion. Dissimilarly from other highly specialized tissues (i.e. foot and digestive gland), gills do not show the expression of a well-evident group of highly tissue-specific genes in the scatter plots (Additional file 3). As mentioned above, this is possibly linked to the diversity of cell types present in this complex organ, resulting in a partial overlap with the transcriptomes of the other four tissues analyzed. As a matter of fact, gills represent the richest out of the five tissues in terms of transcriptional diversity (TDI = 3.57) (Fig. 4, panel a). In any case, the number of genes whose expression was statistically significantly higher in this tissue compared to the others was rather high (2877). These genes were, for the most part, linked to the movement of motile cilia, the main players in the filter-feeding process, present at very high densities on the whole surface of the gills. In particular the expression of axonemal dyneins, that drive ciliary movement by regulating the relative sliding of microtubule doublets [82], was very prominent, as well as that of tektins, fundamental for ciliar assembly and functionality [83] (Table 5). As the function of dyneins require ATP, mitochondria-eating proteins, significantly overexpressed, may be used to improve mitochondrial function, enhancing energy production [84]. The strong production of several genes encoding neuronal acetylcholine receptor subunits [85] was also likely related to the coordinated movement of cilia. In this context, acetylcholine could lead to a strong enhancement of ciliary beat frequency and Shisa-like proteins, also over-represented, may act as regulators of its trafficking [86].
Table 5

The 25 most highly expressed genes in Mytilisepta virgata gills and representative significantly over-represented annotations by gene set enrichment analysis

Most expressed genesOver-represented annotations
AnnotationTPMTermClass p-value
Alpha tubulin55,938.25Dynein heavy chain and region D6 of dynein motorPFAM1.43E-13
40S ribosomal protein SA16,641.12MicrotubuleCC0.00
Elongation factor 1 alpha11,612.37Microtubule motor activityMF6.20E-14
Alpha tubulin8853.84Microtubule-based movementBP2.52E-10
Beta tubulin8710.17Motile ciliumCC3.06E-10
Actin8589.14Axoneme assemblyBP1.57E-9
Alpha tubulin8158.42Cilium movementBP2.22E-9
Low complexity glycine-rich protein7335.45Acetylcholine-activated cation-selective channel activityMF5.94E-8
Beta tubulin5932.28C1q domainPFAM3.23E-7
Actin5614.26ATP-binding dynein motor region D5PFAM1.32E-6
Beta tubulin5144.94Tektin familyPFAM2.15E-6
Beta tubulin5004.75Axonemal dynein complexCC2.63E-6
Actin4296.64Neurotransmitter-gated ion-channel ligand binding domainPFAM6.18E-6
60s ribosomal protein P04296.41Dynein complexCC1.06E-5
non coding RNA4137.29ATPase activityMF1.56E-5
Ferritin3614.02Shisa/Wnt and FGF inhibitory regulatorPFAM3.71E-5
Ribosomial Protein S103367.63Microtubule-binding stalk of dynein motorPFAM4.65E-5
Cyclophilin B3364.42Collagen trimerCC5.40E-5
Transcription factor ATF43351.91Immunoglobulin C1-set domainPFAM7.45E-5
Beta tubulin3296.13Fibrinogen beta and gamma chains, C-terminal globular domainPFAM1.44E-4
non coding RNA3201.67LITAF-like zinc ribbon domainPFAM1.60E-4
Cytochrome c oxidase polypeptide3059.53TIR domainPFAM2.28E-4
Outer dense fiber protein 33038.49Mitochondria-eating proteinPFAM2.42E-4
Ribosomal Protein L152921.58Neurotransmitter-gated ion-channel transmembrane regionPFAM6.71E-4
Ribosomal Protein S42769.43Cadherin domainPFAM7.75E-4

See the complete list in Additional file 4

TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

The 25 most highly expressed genes in Mytilisepta virgata gills and representative significantly over-represented annotations by gene set enrichment analysis See the complete list in Additional file 4 TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component The high expression of a number of genes encoding extracellular matrix components is certainly connected with the lamellar organization of branchial filaments, which facilitates gaseous exchanges for the respiratory function. Collagens, cadherins and the over-represented “secreted acidic and rich in cysteine Ca binding region proteins”, corresponding to osteonectin-like glycoproteins, appear to be the most important players in the establishment of these complex morphological structures. No specific domain or annotation related to oxygen binding could be identified as unequivocally linked to gills, probably due to the fact that this function is carried out by circulating cells which are also found in other tissues. Immunity markers certainly represent one of the most noticeable characterizing gene expression signatures of gills. This may be correlated with the large surface of contact with the external environment and, consequently, with potentially pathogenic microorganisms. Both the gene families of C1qDC and FREPs (discussed in detail in the following sections), lectin-like proteins potentially involved in Pathogen Associated Molecular Patterns (PAMPs) recognition [87, 88], were over-represented in the gills transcriptome of the Japanese purplish mussel. At the same time, some TIR-domain containing proteins were also preferentially expressed in the gills. The TIR domain is found in a number of proteins linked to the detection of foreign ligands and to the ransduction of immune signals inside the cell. In detail, the expression of the cytosolic gene products MyD88, STING and ecTIR-DC families 6, 11 and 13 was particularly elevated in this tissue [89]. Moreover, a number of LITAF-like transcription factors, which may positively regulate the production of pro-inflammatory cytokines [90], were also selectively expressed in the gills. While these signatures could be, at least in part, linked to the presence of hemocytes carried by hemolymph vessels, these data suggest that the gills of M. virgata might play an important role as a first line of defense towards pathogens, specifically by releasing soluble factors in the mucus that covers most of the branchial surface.

Mantle rim transcriptional profile

In mussels, the mantle is a two-lobed tissue that covers the entire inner face of the shells and hosts gonadal development. During the non-reproductive season however, the mantle becomes a very thin and nearly-transparent tissue, whose main functions are the storage of nutrients and the deposition of the shell. The region close to the edges, external to the pallial line of attachment to the shell, is known as mantle rim; this modified part of the mantle tissue is darkly pigmented and possesses tentacles with sensorial function which can be extruded from the shell when the valves are open. Furthermore, the mantle rim is rich in muscle fibers and nerves that permit the retraction of tentacles upon chemical, physical or light stimuli. The portion of tissue sampled in the present experiment corresponds to the large inhalant syphon that regulates the flux of water to the mantle cavity. In mussels, this modified region of mantle is less specialized than other bivalves, as it does not form an exhalant tube like in burrowing bivalve species. The gene expression profile of the mantle rim only displayed little specialization, as evidenced by the expression scatter plots (Additional file 3). It was indeed characterized by a remarkable overlap with those of the other tissues and a relatively low number of genes (694) were specifically expressed in this tissue and just a very few out of these were expressed at very high levels (Table 6). Possibly owing to the combination of multiple cell types with widely diverse functions, the mantle rim produced a very broad range of transcripts and was only second to the gills for transcriptional diversity (TDI = 3.03, Fig. 4, panel a).
Table 6

The 25 most highly expressed genes in Mytilisepta virgata mantle rim and representative significantly over-represented annotations by gene set enrichment analysis

Most expressed genesOver-represented annotations
AnnotationTPMTermClass p-value
Actin40,034.02Proteinaceous extracellular matrixCC1.84E-7
Elongation factor 1 alpha17,159.08Extracellular regionCC1.87E-7
alpha tubulin16,259.80Chitin binding Peritrophin-A domainPFAM4.58E-7
60s ribosomal protein L2315,108.07MatrixinPFAM1.14E-6
Actin15,102.97Calcium ion bindingMF2.65E-6
40S ribosomal protein SA11,849.66WAP-type (Whey Acidic Protein) ‘four-disulfide core’PFAM3.65E-6
Actin11,356.99Extracellular matrix structural constituentMF3.66E-6
Alpha tubulin8547.04Immunoglobulin C1-set domainPFAM1.51E-5
Ferritin7899.68Serine-type endopeptidase inhibitor activityMF3.27E-5
Myosin7382.76Collagen catabolic processBP1.09E-4
low complexity protein - collagen-like6579.48Homeobox domainPFAM1.24E-4
60s ribosomal protein P05142.34CD80-like C2-set immunoglobulin domainPFAM1.42E-4
Beta tubulin4972.09Metalloendopeptidase activityMF1.58E-4
Beta tubulin4461.74Cell junctionCC8.89E-4
Superoxide Dismutase4220.54Cadherin domainPFAM9.51E-4
Actin4066.75Extracellular spaceCC1.77E-3
small heat shock protein 24.13933.60Immunoglobulin V-set domainPFAM3.06E-3
Paramyosin3898.32Basement membraneCC3.57E-3
non coding RNA?3796.60Immunoglobulin I-set domainPFAM7.51E-3
non coding RNA?3710.19Integral component of membraneCC7.74E-3
Myosin3690.69///
non coding RNA?3538.31///
Cyclophilin3464.80///
Ribosomal protein S43459.95///
Adenine nucleotide translocator3431.98///

See the complete list in Additional file 4

TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

The 25 most highly expressed genes in Mytilisepta virgata mantle rim and representative significantly over-represented annotations by gene set enrichment analysis See the complete list in Additional file 4 TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component The most enriched annotations were linked to the extracellular matrix building and remodeling. The included matrixins, Zinc-dependent extracellular proteases involved in the degradation of components of the extracellular matrix [91], metalloendopeptidases and several proteinase inhibitors (e.g. WAP-like proteins, serine-type endopeptidase inhibitors, etc.), which might regulate their enzymatic activity. Consistently with the high prevalence of connective tissue-related genes, annotations linked to cell-cell and cell-extracellular matrix (e.g. immunoglobulin and cadherin domains) were also over-represented. Another class of enriched proteins was that encoded by homeobox genes; while most of these only displayed limited sequence similarity to homeobox genes whose function has been previously characterized, one of these was clearly homologous to Mohawk, a critical regulator of tendon differentiation [92].

Posterior adductor muscle transcriptional profile

In bivalves, valve closure is controlled by the contraction of the anterior and posterior adductor muscles. However, in all byssus-producing bivalves the anterior muscle is generally much smaller than the posterior one, to the point that in some mussel species (i.e. Perna spp.) the anterior muscle scar is not visible anymore. The large posterior adductor muscle is also closely associated to a sinus (the pericardial sac), which is part of the mussel open circulatory system and is commonly used in laboratory practice for the withdrawal of hemolymph with a syringe [93]. The transcriptional profile of the M. virgata posterior adductor muscle was particularly poor in terms of transcriptional diversity (TDI = 2.08), being only second to the foot (Fig. 4, panel a). Although the muscle can be certainly considered as a highly specialized tissue, it was characterized by a relatively small number (1089) of tissue-specific genes, probably due to the presence of muscular tissue also in the foot and mantle rim (Fig. 4, panel b; Fig. 5, panel a). As expected, the most highly expressed genes encoded structural components of muscle fibers, including actin and myosin, approximately accounting for 2 and 0.7% of total transcriptional activity, respectively. The enriched annotations contained conserved protein domains commonly associated with muscle adhesion proteins, such as immunoglobulins and fibronectin type III domains, and typical muscle components, such as myofibrils, sarcomere, Z disc, A, I and M bands, and functions, such as telethonin and actinin binding, contraction, response to calcium and sarcomere morphogenesis (Table 7). Curiously, “condensed nuclear chromosome” and “mitotic chromosome condensation” were also listed among the most significantly enriched GO terms. This is linked to the high expression of many contigs corresponding to titin, one of the longest known proteins in metazoans, with over 30,000 amino acids, which mRNA was fragmented in the Mytilisepta de novo assembly. Besides its main function as a structural component of the sarcomere, titin has been indeed linked to chromosome stabilization [94].
Table 7

The 25 most highly expressed genes in Septifer virgatus posterior adductor muscle and representative significantly over-represented annotations by gene set enrichment analysis

Most expressed genesOver-represented annotations
AnnotationTPMTermClass p-value
Actin150,779.49Condensed nuclear chromosomeCC4.11E-15
60s ribosomal protein L2369,416.79Skeletal muscle myosin thick filament assemblyBP1.09E-14
Actin44,602.27Immunoglobulin I-set domainPFAM1.89E-14
Paramyosin39,709.12Sarcomere organizationBP1.42E-13
Myosin28,949.91M bandCC2.14E-13
RS-rich protein 127,745.06I bandCC4.35E-13
Myosin16,064.48Structural constituent of muscleMF2.31E-12
40S ribosomal protein SA15,155.63Muscle alpha-actinin bindingMF2.64E-12
Myosin12,525.17Z discCC3.45E-12
Actin11,992.63A bandCC7.25E-12
Myosin9275.62Mitotic chromosome condensationBP3.29E-11
Alpha tubulin8768.82MyofibrilCC3.65E-10
Ferritin8517.40Myosin filamentCC5.51E-10
Elongation factor 1 alpha8411.27Structural molecule activity conferring elasticityMF5.85E-10
PDZ domain containign protein7804.25Telethonin bindingMF5.85E-10
Non coding RNA5844.46SarcomereCC5.99E-10
Ribosomal Protein S105212.90Striated muscle thin filamentCC1.73E-9
Actin4986.83Muscle myosin complexCC1.73E-9
Non coding RNA4561.28Actinin bindingMF1.73E-9
Alpha tubulin4344.93Myosin tailPFAM5.07E-7
Unknown4209.22Fibronectin type III domainPFAM5.33E-6
60s ribosomal protein P04157.43Skeletal muscle thin filament assemblyBP4.27E-9
Small heat shock protein 24.13846.08SarcomerogenesisBP4.27E-9
Beta tubulin3725.55Response to calcium ionBP4.86E-7
Superoxide dismutase3712.29Muscle contractionBP6.84E-7

See the complete list in Additional file 4

TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

The 25 most highly expressed genes in Septifer virgatus posterior adductor muscle and representative significantly over-represented annotations by gene set enrichment analysis See the complete list in Additional file 4 TPM Transcript Per Million, PFAM Pfam conserved domains, BP Gene Ontology Biological Process, MF Gene Ontology Molecular Function, CC Gene Ontology Cellular Component

Validation of gene expression patterns

Taking into account previous reports of high heterozygosity [69] and high inter-individual variability of expression [95] in marine mussels, and considering the fact that RNA sequencing was carried out on a pool of five specimens, we proceeded to a validation of gene expression levels on individual mussels. This was achieved by Real-Time PCR experiments, performed on three biological replicates, which targeted ten tissue-specific genes (Table 1). Overall, the results obtained fully confirmed the observations collected by the in silico analysis of RNA-seq data, supporting our experimental approach for the detection of significant differences of expression among tissues (Fig. 6). In particular, chitotriosidase and meprin A, typical digestive enzymes, displayed high specificity to the digestive gland. Paramyosin and striated muscle myosin, encoding structural components of muscles, were highly expressed only in the posterior adductor muscle. A gills-specific acetylcholine receptor possibly involved in coordinating the movement of cilia and a glycine-rich secreted protein of unknown function were confirmed to be specifically expressed in this multifunctional tissue. A serine protease inhibitor and a tyrosinase-like protein, both important in byssogenesis, were expressed as high levels in the foot, as expected. Finally, an SCP domain-containing protein and a valine-rich protein of unknown function were confirmed as mantle rim-specific gene products.
Fig. 6

Gene expression levels of 10 selected genes in M. virgata. FO: foot; MR: mantle rim; GI: gills; PM: posterior adductor muscle; DG: digestive gland. “1”, “2” and “3” indicate the three biological replicates. Histogram bars represent the expression relative to housekeeping genes elongation factor 1 alpha and ribosomal protein S2. Error bars represent the standard deviation of three technical replicates

Gene expression levels of 10 selected genes in M. virgata. FO: foot; MR: mantle rim; GI: gills; PM: posterior adductor muscle; DG: digestive gland. “1”, “2” and “3” indicate the three biological replicates. Histogram bars represent the expression relative to housekeeping genes elongation factor 1 alpha and ribosomal protein S2. Error bars represent the standard deviation of three technical replicates It is worth noticing that, despite the maintenance of tissue specificity, some of the target genes displayed relevant inter-individual differences in their expression levels. While these variations are not surprising in the light of previous reports [95], they reveal that a number of unpredictable factors of difficult monitoring might influence gene expression of M. virgata in the marine environment. As an example, the two foot-specific genes displayed an expression level more than 10 times lower in specimen 1 compared to the two other biological replicates (Fig. 6). This difference might indicate a lower byssogenic activity, a process which is well known to be influenced by multiple environmental parameters [96], including the intensity of water currents and exposure to waves, factors which could not be controlled in the sites of sampling. In summary, the results obtained by RT-PCR confirmed that the pooling approach we used was appropriate to detect major differences in gene expression among tissues, permitting to pinpoint gene products which cover an important physiological function in the different body districts of mussels. At the same time however, the non-negligible inter-individual differences observed point out that extra care should be taken into account in future experiments aimed at assessing the individual response of M. virgata to abiotic and biotic stress.

The high prevalence of lectins in the Mytilisepta virgata transcriptome

Bivalve mollusks are known to produce a very broad range of secreted proteins containing carbohydrate-binding domains (CRDs). It has been suggested that these lectin-like molecules might function as Pattern Recognition Receptors (PRRs), acting as a first line of defense towards pathogens. Indeed, due to their high sequence diversity, bivalve lectins could potentially broaden the spectrum of recognition for Microbe Associated Molecular Patterns (MAMPs) or PAMPs, depending on whether the whole microbiota or just its pathogenic fraction are taken into account. However, alternative functions have been suggested. Specifically, the lectins found in the mucus that covers digestive organs have been linked to the feeding process, thanks to their ability to recognize carbohydrates exposed on the surface of food particles [97]. Others might be involved in establishing genetic barriers between species by regulating the species-specific recognition of congenetic gametes [98-100]. Many different types of bivalve lectins have been isolated since the early ‘90 through classical biochemical methods [101-103] and many more have been discovered in recent years in different bivalve species. Although most of these pertain to four distinct families, i.e. C-type lectins, fibrinogen-related proteins (FREPs), galectins and C1qDC proteins, others have been also found, including SUEL-type lectins, [104], lipopolysaccharide and β-1,3-glucan binding proteins [105], mytilectins [106] and F-type lectins [107]. Besides covering fundamental roles in bivalve physiology, some bivalve lectins also possess highly interesting binding or cytotoxic properties which might warrant future studies aimed at developing potential biotechnological applications. For example, the recently identified MytiLec is cytotoxic against some cancer cell types [108], and it has been suggested that a hemolytic C-type lectin from the freshwater clam Villorita cyprinoides could be potentially used as a clot lysing molecule [109]. Some bivalve lectin-encoding gene families are extremely expanded and comprise several hundred members, as suggested by proteomic [110] and transcriptome studies [20, 111, 112] and later confirmed on a whole genome scale in oyster [113]. The purplish Japanese mussel is no exception, as some important lectin-like gene families were also found among the most abundant ones in the de novo assembled transcriptome (Table 8).
Table 8

The 25 most highly represented PFAM conserved domains in the Mytilisepta virgata transcriptome

Domain namePFAM codedomain count
Protein kinase domainPF00069200
Immunoglobulin domainPF00047190
Protein tyrosine kinasePF07714183
Ankyrin repeats (3 copies)PF12796182
Ankyrin repeats (many copies)PF13857175
EF-hand domain pairPF13833161
C1q domain PF00386161
Immunoglobulin I-set domainPF07679161
Ankyrin repeatPF00023160
EF handPF00036157
Lectin C-type domain PF00059154
EGF-like domainPF00008149
EF-hand domainPF13405147
AAA domainPF00004139
Zinc finger, C3HC4 type (RING finger)PF00097129
50S ribosome-binding GTPasePF01926118
Leucine rich repeatPF13855117
7 transmembrane receptor (rhodopsin family)PF00001117
WD domain, G-beta repeatPF00400110
Immunoglobulin V-set domainPF07686109
B-box zinc fingerPF00643108
von Willebrand factor type A domainPF00092106
Leucine Rich repeats (2 copies)PF12799103
Tetratricopeptide repeatPF00515103
Zinc finger, C2H2 typePF00096102
F5/8 type C domain PF0075442
Fibrinogen C-terminal globular domain PF0014737
Galactose binding lectin domain PF0214023
Ricin-type beta-trefoil lectin domain-like PF142006
H-type lectin domain PF094586
Galactoside-binding lectin PF003372

Domains with carbohydrate-binding properties are marked in bold. A few additional less abundant lectin-like domains are listed in the bottom part of the table

The 25 most highly represented PFAM conserved domains in the Mytilisepta virgata transcriptome Domains with carbohydrate-binding properties are marked in bold. A few additional less abundant lectin-like domains are listed in the bottom part of the table Similarly to other mussel species [38], C1qDC proteins were found in very high number in M. virgata (161), suggesting the presence of several hundred genes in the genome of this species, in line with the previously demonstrated massive lineage-specific C1q gene family expansion event which occurred in the class Bivalvia [87]. C1qDC proteins have been implicated on multiple occasions in the innate immune response of different mollusks, where they have been often designed as sialic acid-binding lectins [114-116]. The expression profile of C1qDC genes in Mytilisepta closely resembles that of the Pacific oyster [87], since the most of such genes are selectively expressed in the digestive gland or in the gills. However, ~25% of mussel C1qDC proteins are produced ubiquitously by all tissues (Fig. 7).
Fig. 7

Heat maps summarizing gene expression patterns of C1q domain-containing proteins, C-type lectins, Fibrinogen-related proteins and Galactose-binding lectins in Mytilisepta virgata. Heat maps were built with log2 transformed gene expression levels; genes and tissues were hierarchically clusterized based on Euclidean distances, estimated with average linkage method. DG: digestive gland; G: gills; PAM: posterior adductor muscle; F: foot; MR: mantle rim

Heat maps summarizing gene expression patterns of C1q domain-containing proteins, C-type lectins, Fibrinogen-related proteins and Galactose-binding lectins in Mytilisepta virgata. Heat maps were built with log2 transformed gene expression levels; genes and tissues were hierarchically clusterized based on Euclidean distances, estimated with average linkage method. DG: digestive gland; G: gills; PAM: posterior adductor muscle; F: foot; MR: mantle rim C-type lectins represent another large family of carbohydrate-binding proteins, comprising 154 members in M. virgata. These proteins, which structurally resemble the mannose-binding lectin (MBL) involved in the lectin pathway of the vertebrate complement system, have been often linked to bivalve immunity as PRRs [117-119]. However, coherently with their remarkable structural diversification, they could also be involved in other functions, such as particle capture [120] and gamete recognition [98]. Overall, the pattern of expression of C-type lectins was similar to that of C1qDC proteins, with two well-distinct groups of digestive gland- and gills- specific genes and others found at similar levels in all tissues (Fig. 7). The role of FREPs, first described in gastropod mollusks [88], has later been demonstrated also in bivalves [121, 122]. Despite inconclusive phylogeny, molluscan FREPs structurally resemble vertebrate ficolins which, like MBL, can activate the complement system through the lectin pathway. FREPs possess an astounding sequence diversity in Mytilus spp., where they are though to provide a broad array of PRRs [123]. A total of 37 different FREPs could be identified in the purplish mussel, where they appear to be expressed in a broad range of tissues, but in particular in the gills (Fig. 7). A remarkable number of proteins bearing a galactose-binding domain (23) was also found to be encoded by Mytilisepta mRNAs. This domain, first identified in SUEL lectins from sea urchin eggs, allows the preferential binding of L-rhamnose, in addition to D-galactose [124]. Although most of them are expressed in all the main tissues of mussels, some are clearly foot-specific. On the other hand, the galactoside-binding domain, which characterizes galectins, was only identified in two sequences encoding two structurally different galectins with two and four consecutive CRDs, respectively. Consistently with previous reports in M. galloprovincialis [20], the F5/8 type C domain, typical of fucose-binding (F-type) lectins, was also detected in high abundance (42) in the Mytilisepta transcriptome. However, the function of F-type lectins has been only marginally investigated in bivalves so far and, taking into account that the F5/8 type C domain is also found in many non-lectin proteins (e.g. coagulation factors and bindins), the data currently available is insufficient to classify the M. virgata sequences as bona fide F-type lectins. Six different proteins bearing a ricin-type beta-trefoil lectin-like domain are present in the de novo assembled transcriptome. However, none of these can be classified as mytilectins, despite the fact these novel carbohydrate binding proteins have been first characterized and described as a multigenic family in M. galloprovincialis [106, 108]. This unexpected absence may either reflect a lack of expression in physiological conditions or a highly specific expression to one of the tissues which could not be sampled in the current experiment (e.g. hemocytes or inner mantle). The H-type lectin domain is capable of binding N-acetylgalactosamine and it has been described in the agglutinin of land snails, proteins which are part of the innate immune system, protecting fertilized eggs from bacteria [125, 126]. Although to date no H-type lectin has ever been characterized in bivalve mollusks, six assembled transcripts pertain to this family in M. virgata. Altogether these data confirm the high prevalence of transcripts encoding lectins in bivalve transcriptomes. Although the large majority of the putative carbohydrate-binding proteins encoded by the transcripts identified in M. virgata either pertaining to the C1qDC or to the C-type lectin families, many others, with distinct binding properties and potential biotechnological applications, are also present. Overall, gene expression data point out the digestive gland as the most lectin-rich tissue, followed by the gills, while on the other hand the mantle rim, foot and posterior adductor muscle only produce a limited set of lectin-like molecules. While this remarkable expression might be linked to food particle selection, it is also consistent with the emerging role of peripheral tissues in the bivalve innate immune system, coherently with the extensive contact of gills and digestive tract with the external environments and microbes associated with sea water, sediment and food particles.

Conclusions

As more and more bivalve species become the subject of –omic studies thanks to the development of cost-effective sequencing technologies, most transcriptomic studies are usually either focused on single tissues or on samples obtained from whole body, and just a very few have been so far dedicated to the investigation of the highly specialized function of tissues in a comparative way. With the present study, we tried to fill a gap in the genetic knowledge of M. virgata, providing a detailed snapshot of the gene expression profile of most of the main tissues of this marine mussel species. Besides revealing the high tissue-specificity of genes fundamental for mussel immune defense, feeding and attachment to the substrate, we also provide compelling evidence bimolecular phylogeny that the Japanese purplish mussel is not closely related to Septifer (Récluz, 1848) and that is should be instead considered as part of the Brachidontinae subfamily. Annotations and gene expression levels of Mytilisepta virgata contigs. (XLSX 5900 kb) Multiple sequence alignment of the consensus of the 17-mer repeated units present in the byssal cuticle proteins of Mytilisepta virgata, Septifer biforcatus and Bathymodiolus thermophilus. (PDF 122 kb) Scatter plots representing paired comparisons of gene expression profiles between tissues. (PDF 1316 kb) Complete results of gene set enrichment analyses. (XLSX 29 kb)
  110 in total

1.  Bioaccumulation of metals (Cd, Cu, Zn) by the marine bivalves M. galloprovincialis, P. radiata, V. verrucosa and C. chione in Mediterranean coastal microenvironments: association with metal bioavailability.

Authors:  Aikaterini Sakellari; Sotirios Karavoltsos; Dimitrios Theodorou; Manos Dassenakis; Michael Scoullos
Journal:  Environ Monit Assess       Date:  2012-08-09       Impact factor: 2.513

2.  RNA sequencing and de novo assembly of the digestive gland transcriptome in Mytilus galloprovincialis fed with toxinogenic and non-toxic strains of Alexandrium minutum.

Authors:  Marco Gerdol; Gianluca De Moro; Chiara Manfrin; Anna Milandri; Elena Riccardi; Alfred Beran; Paola Venier; Alberto Pallavicini
Journal:  BMC Res Notes       Date:  2014-10-14

3.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

4.  Early expression of a collagenase-like hatching enzyme gene in the sea urchin embryo.

Authors:  T Lepage; C Gache
Journal:  EMBO J       Date:  1990-09       Impact factor: 11.598

5.  Potassium channel modulation by a toxin domain in matrix metalloprotease 23.

Authors:  Srikant Rangaraju; Keith K Khoo; Zhi-Ping Feng; George Crossley; Daniel Nugent; Ilya Khaytin; Victor Chi; Cory Pham; Peter Calabresi; Michael W Pennington; Raymond S Norton; K George Chandy
Journal:  J Biol Chem       Date:  2009-12-04       Impact factor: 5.157

6.  New features of desiccation tolerance in the lichen photobiont Trebouxia gelatinosa are revealed by a transcriptomic approach.

Authors:  Fabio Candotto Carniel; Marco Gerdol; Alice Montagner; Elisa Banchi; Gianluca De Moro; Chiara Manfrin; Lucia Muggia; Alberto Pallavicini; Mauro Tretiach
Journal:  Plant Mol Biol       Date:  2016-03-18       Impact factor: 4.076

7.  Structural studies of Helix aspersa agglutinin complexed with GalNAc: A lectin that serves as a diagnostic tool.

Authors:  Agnieszka J Pietrzyk; Anna Bujacz; Paweł Mak; Barbara Potempa; Tomasz Niedziela
Journal:  Int J Biol Macromol       Date:  2015-09-28       Impact factor: 6.953

8.  Expression of Mytilus immune genes in response to experimental challenges varied according to the site of collection.

Authors:  Hui Li; Paola Venier; María Prado-Alvárez; Camino Gestal; Mylène Toubiana; Rosita Quartesan; Fabio Borghesan; Beatriz Novoa; Antonio Figueras; Philippe Roch
Journal:  Fish Shellfish Immunol       Date:  2010-01-01       Impact factor: 4.581

9.  Unexpected diversity in Shisa-like proteins suggests the importance of their roles as transmembrane adaptors.

Authors:  Jimin Pei; Nick V Grishin
Journal:  Cell Signal       Date:  2011-11-18       Impact factor: 4.315

10.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

View more
  8 in total

1.  Myticalins: A Novel Multigenic Family of Linear, Cationic Antimicrobial Peptides from Marine Mussels (Mytilus spp.).

Authors:  Gabriele Leoni; Andrea De Poli; Mario Mardirossian; Stefano Gambato; Fiorella Florian; Paola Venier; Daniel N Wilson; Alessandro Tossi; Alberto Pallavicini; Marco Gerdol
Journal:  Mar Drugs       Date:  2017-08-22       Impact factor: 5.118

2.  AmpuBase: a transcriptome database for eight species of apple snails (Gastropoda: Ampullariidae).

Authors:  Jack C H Ip; Huawei Mu; Qian Chen; Jin Sun; Santiago Ituarte; Horacio Heras; Bert Van Bocxlaer; Monthon Ganmanee; Xin Huang; Jian-Wen Qiu
Journal:  BMC Genomics       Date:  2018-03-05       Impact factor: 3.969

3.  A GM1b/asialo-GM1 oligosaccharide-binding R-type lectin from purplish bifurcate mussels Mytilisepta virgata and its effect on MAP kinases.

Authors:  Yuki Fujii; Marco Gerdol; Sarkar M A Kawsar; Imtiaj Hasan; Francesca Spazzali; Tatsusada Yoshida; Yukiko Ogawa; Sultana Rajia; Kenichi Kamata; Yasuhiro Koide; Shigeki Sugawara; Masahiro Hosono; Jeremy R H Tame; Hideaki Fujita; Alberto Pallavicini; Yasuhiro Ozeki
Journal:  FEBS J       Date:  2019-12-24       Impact factor: 5.542

4.  Organ transcriptomes of the lucinid clam Loripes orbiculatus (Poli, 1791) provide insights into their specialised roles in the biology of a chemosymbiotic bivalve.

Authors:  Benedict Yuen; Julia Polzin; Jillian M Petersen
Journal:  BMC Genomics       Date:  2019-11-07       Impact factor: 3.969

5.  Host-Endosymbiont Genome Integration in a Deep-Sea Chemosymbiotic Clam.

Authors:  Jack Chi-Ho Ip; Ting Xu; Jin Sun; Runsheng Li; Chong Chen; Yi Lan; Zhuang Han; Haibin Zhang; Jiangong Wei; Hongbin Wang; Jun Tao; Zongwei Cai; Pei-Yuan Qian; Jian-Wen Qiu
Journal:  Mol Biol Evol       Date:  2021-01-23       Impact factor: 16.240

6.  First Insights into the Repertoire of Secretory Lectins in Rotifers.

Authors:  Marco Gerdol
Journal:  Mar Drugs       Date:  2022-02-09       Impact factor: 5.118

7.  Development and Interrogation of a Transcriptomic Resource for the Giant Triton Snail (Charonia tritonis).

Authors:  A H Klein; C A Motti; A K Hillberg; T Ventura; P Thomas-Hall; T Armstrong; T Barker; P Whatmore; S F Cummins
Journal:  Mar Biotechnol (NY)       Date:  2021-06-30       Impact factor: 3.619

8.  Molecular Diversity of Mytilin-Like Defense Peptides in Mytilidae (Mollusca, Bivalvia).

Authors:  Samuele Greco; Marco Gerdol; Paolo Edomi; Alberto Pallavicini
Journal:  Antibiotics (Basel)       Date:  2020-01-19
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.