| Literature DB >> 32811897 |
Angela Bertel-Sevilla1, Juan F Alzate2, Jesus Olivero-Verbel3.
Abstract
Mugil incilis (lisa) is an important commercial fish species in many countries, living along the coasts of the western Atlantic Ocean. It has been used as a model organism for environmental monitoring and ecotoxicological investigations. Nevertheless, available genomic and transcriptomic information for this organism is extremely deficient. The aim of this study was to characterize M. incilis hepatic transcriptome using Illumina paired-end sequencing. A total of 32,082,124 RNA-Seq read pairs were generated utilizing the HiSeq platform and subsequently cleaned and assembled into 93,912 contigs (N50 = 2,019 bp). The analysis of species distribution revealed that M. incilis contigs had the highest number of hits to Stegastes partitus (13.4%). Using a sequence similarity search against the public databases GO and KEGG, a total of 7,301 and 16,967 contigs were annotated, respectively. KEGG database showed genes related to environmental information, metabolism and organismal system pathways were highly annotated. Complete or partial coding DNA sequences for several candidate genes associated with stress responses/detoxification of xenobiotics, as well as housekeeping genes, were employed to design primers that were successfully tested and validated by RT-qPCR. This study presents the first transcriptome resources for Mugil incilis and provides basic information for the development of genomic tools, such as the identification of RNA markers, useful to analyze environmental impacts on this fish Caribbean species.Entities:
Mesh:
Year: 2020 PMID: 32811897 PMCID: PMC7435268 DOI: 10.1038/s41598-020-70902-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the assembled transcriptome of M. incilis.
| Item | Number |
|---|---|
| Total length of sequence | 91,495,553 bp |
| Total number of sequences | 93,912 |
| Average contig length | 974 bp |
| Largest contig | 17,447 bp |
| Shortest contig | 201 bp |
| N25 stats | 25% of total sequence length is contained in the 4,329 sequences > = 3,634 bp |
| N50 stats | 50% of total sequence length is contained in the 12,855 sequences > = 2,019 bp |
| N75 stats | 75% of total sequence length is contained in the 30,528 sequences > = 786 bp |
| Total GC count | 43,537,515 bp |
| GC % | 47.58% |
| Number of Ns | 0 |
| Ns % | 0.00% |
Ns, ambiguous bases.
Figure 1Function classifications of Gene Ontology (GO) terms of M. incilis transcriptome data. GO terms were annotated at level 2 of classification according to three main categories (Biological process, Cellular component and Molecular function). The x-axis indicates the subcategories, and the y-axis registers the percentages of genes (number of a particular gene divided by total gene number).
Figure 2Top 20 pathways with the largest numbers of contigs based on KEGG annotation analysis.
Primer sequences for real-time PCR.
| Gene name | Gene abbreviation | Gene ID | Forward (5′ to 3′) | Reverse (5′ to 3′) | Amplicon size (pb) |
|---|---|---|---|---|---|
| Superoxide dismutase 1 | SOD1 | MH716024 | TGGAGATAACACGAACGGGTG | ATTGGGGCCAGCGTGATTC | 76 |
| Catalase | CAT | MH716025 | AAAGAGTGGTGCATGCCAAG | TGCCGACATGTTCAAACACC | 102 |
| Proteína de shock térmico de 70 KDa | HSP70 | MH716026 | CTGGATGTGACCCCCTTGTC | GTTGGGATGGTGGTGTTCCT | 86 |
| Factor Inducible por Hipoxia-1, Subunidad Alfa | HIF1A | MH716027 | TAGTTGAGAGCAGCCAACCC | CAGCTCTGCACACTTGTCCT | 80 |
| Cytochrome P4501A | CYP1A1 | MH716028 | TGTTCTTGGCCATCCTCGTC | CAGTGGTCATCCCTCACTCG | 142 |
| Cytochrome P450, family 3, subfamily A | CYP3A | MK110662 | GACTCTCCTGGTTGCGTTCA | CCCGTACTTCTTGTGGCACT | 172 |
| Retinoid X receptor alpha isoformX2 | RXRA X2 | MK110663 | TTCAACCCAGACTCCAAAGG | CGGGATACTTGTGTTTGCAG | 103 |
| Aryl hydrocarbon receptor | AHR | MH716029 | CTCGGCTTTCACCAGACTGA | CAAGATGAGGTGGACTCGGC | 161 |
| Apoptosis Regulator | Bcl-X | MH716030 | GAGTTTCAAGAAGTGGCTGCTG | ACAGTCGTTTCTGAGCGAAG | 83 |
| Apoptosis Regulator BAX | BAX | MH716031 | AGGCGATCAAGGAAATGCCA | GTGCGAGCTTCTTGTGGTTG | 167 |
| Caspase-3 | CASP3 | MK183035 | TCAGAGAAATGGCACAGACG | TCATTTGCTCGACTGACTGG | 104 |
| Growth arrest and DNA damage inducible alpha | Gadd45A | MK183036 | TTCACTTCACCCTCATCCAG | TTCACCCCCACTCTGTTTTC | 125 |
| Tumor necrosis factor alpha induced protein 3 | TNF-α | MH716032 | ATGCTAGAGGGCTACTGCGA | ATAACCAGAGGTGGGGCGA | 80 |
| Interleukin-6 | IL6 | MK183037 | GAAAAACAAGAAGCGGGTCA | GCTTCTCCCTTCTGTCGATG | 173 |
| Major Histocompatibility Complex, Class II, alpha chain | MHC- class II alpha | MK183039 | TTGGTCTGACTCTGGGTTTG | AGCTGCACTCGTTTCCTTTG | 72 |
| Peroxisome proliferator-activated receptor alpha | PPARA | MK183038 | GAGGACTCGGTGTTGGACAG | CGTCGATGGACTGGGAGATG | 91 |
| Peroxisome proliferator-activated receptor gamma | PPARG | MH716033 | CGGAGACAACATGCCTTTCG | CCTGAAACTCCGTGTGCAGT | 138 |
| Ribosomal Protein L13a (RP-L13) | L13a | MH712490 | TCCTGCGTAAGAGGATGAACAC | TGGTTTTATGGGGCAGCATG | 108 |
| Elongation Factor-1 alpha | EF1 | MH712491 | CGTGAAGCTGATCCCACAGA | CCAGAGACCTCCTTGTACTCG | 141 |
| Glyceraldehyde 3-phosphate dehydrogenase | GAPDH | MH716023 | AAGTCGGTATCAATGGATTTGGC | TCCACCTTCTTGGAGACGAA | 73 |
PCR primer efficiencies used in Real-time qPCR assays.
| Genes and pathways | Amplification efficiency (%) | R2 |
|---|---|---|
| SOD1 | 107.8 | 0.99 |
| CAT | 93.5 | 1.00 |
| HSP70 | 112.8 | 0.98 |
| HIF1A | 100.3 | 1.00 |
| CYP1A1 | 99.7 | 1.00 |
| CYP3A | 90.1 | 1.00 |
| RXRA X2 | 93.7 | 1.00 |
| AHR | 116.6 | 0.99 |
| Bcl-X | 100.9 | 1.00 |
| BAX | 116.1 | 1.00 |
| CASP3 | 101.8 | 1.00 |
| Gadd45A | 106.0 | 0.95 |
| TNF-α | 113.3 | 0.98 |
| IL6 | 89.6 | 0.99 |
| HLA-DRA | 91.8 | 0.99 |
| PPARA | 94.2 | 1.00 |
| PPARG | 91.2 | 1.00 |
| L13a | 93.0 | 1.00 |
| EF1 | 96.0 | 1.00 |
| GAPDH | 91.7 | 1.00 |
Figure 3Gel electrophoresis of the RT-PCR products of genes associated with stress responses/xenobiotic detoxification. The lanes correspond to blank (1), cDNA from M. incilis (2–9), P. magdalenae (11–12), H. malabaricus (13–14) and M. musculus (15). All the gels were run under the same experimental conditions and are presented using cropped images. The entire gel photos of all genes evaluated can be found in Figure S5.