| Literature DB >> 36180578 |
Abstract
Ecologically, Halophila beccarii Asch. is considered as a colonizing or a pioneer seagrass species and a "tiny but mighty" seagrass species, since it may recover quickly from disturbance generally. The use of transcriptome technology can provide a better understanding of the physiological processes of seagrasses. To date, little is known about the genome and transcriptome information of H. beccarii. In this study, we used single molecule real-time (SMRT) sequencing to obtain full-length transcriptome data and characterize the transcriptome structure. A total of 11,773 of the 15,348 transcripts were successfully annotated in seven databases. In addition, 1573 long non-coding RNAs, 8402 simple sequence repeats and 2567 transcription factors were predicted in all the transcripts. A GO analysis showed that 5843 transcripts were divided into three categories, including biological process (BP), cellular component (CC) and molecular function (MF). In these three categories, metabolic process (1603 transcripts), protein-containing complex (515 transcripts) and binding (3233 transcripts) were the primary terms in BP, CC, and MF, respectively. The major types of transcription factors were involved in MYB-related and NF-YB families. To the best of our knowledge, this is the first report of the transcriptome of H. beccarii using SMRT sequencing technology.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36180578 PMCID: PMC9525579 DOI: 10.1038/s41598-022-20988-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Summary of PacBio SMRT sequencing in Halophila beccarii Asch.
| Category | Dataset |
|---|---|
| Read bases of CCS | 325,064,946 |
| Number of CCS | 272,028 |
| Mean Read Length of CCS | 1194 |
| Number of full-length non-chimeric reads | 213,301 |
| Full-length non-chimeric percentage (FLNC%) | 78.41% |
| Average consensus isoforms read length | 1011 |
| Number of polished high-quality isoforms | 21,264 |
CCS circular consensus; SMRT single nucleotide real-time.
Figure 1FLNC length distribution. FNLC, full-length non-concatemer.
Figure 2(a) Length distribution of CDS and (b) Type distribution of TFs. CDS, coding sequence; TFs, transcription factors.
The top 20 mapped pathways annotated by the KEGG database.
| Pathways | Pathway ID | Gene number | Percentage |
|---|---|---|---|
| Protein processing in endoplasmic reticulum | ko04141 | 468 | 8.48% |
| Ribosome | ko03010 | 433 | 7.85% |
| Carbon metabolism | ko01200 | 292 | 5.29% |
| Biosynthesis of amino acids | ko01230 | 254 | 4.60% |
| Glycolysis / Gluconeogenesis | ko00010 | 232 | 4.21% |
| Endocytosis | ko04144 | 162 | 2.94% |
| Spliceosome | ko03040 | 161 | 2.92% |
| Oxidative phosphorylation | ko00190 | 153 | 2.77% |
| Plant-pathogen interaction | ko04626 | 141 | 2.56% |
| RNA transport | ko03013 | 137 | 2.48% |
| Epstein-Barr virus infection | ko05169 | 136 | 2.47% |
| Phagosome | ko04145 | 128 | 2.32% |
| Carbon fixation in photosynthetic organisms | ko00710 | 116 | 2.10% |
| Flavonoid biosynthesis | ko00941 | 115 | 2.08% |
| Phenylpropanoid biosynthesis | ko00940 | 110 | 1.99% |
| PI3K-Akt signaling pathway | ko04151 | 108 | 1.96% |
| Plant hormone signal transduction | ko04075 | 107 | 1.94% |
| Starch and sucrose metabolism | ko00500 | 106 | 1.92% |
| Arginine and proline metabolism | ko00330 | 104 | 1.89% |
| Photosynthesis—antenna proteins | ko00196 | 102 | 1.85% |
KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 3(a) GO annotation and (b) KOG annotation of Halophila. beccarii transcriptome. GO, Gene Ontology; KOG, EuKaryotic Orthologous Group.
Figure 4Candidate lncRNAs identified by PLEK, CPC2.0, CPAT and Pfam. lncRNAs, long non-coding RNAs.
The top 20 DEGs in the rhizomes compared with the leaves.
| DEGs | Fold change | p value |
|---|---|---|
| Alcohol dehydrogenase 2 (ADH2) | 2.461847516 | 1.45E−28 |
| Protein DMR6-like oxygenase 2 (DLO2) | 2.870885051 | 3.59E−15 |
| Naringenin,2-oxoglutarate 3-dioxygenase | 2.016622651 | 5.86E−14 |
| L-type lectin-domain containing receptor kinase IV.1 | 2.884287285 | 1.17E−11 |
| Probable ATP synthase | 0.384616453 | 4.69E−11 |
| Transcription factor bHLH123 | 2.451808363 | 6.07E−11 |
| Transcription factor MYB4 | 2.417753479 | 2.43E−10 |
| Cellulose synthase-like protein | 2.121630125 | 6.65E−10 |
| Probable cinnamyl alcohol dehydrogenase 1 (CAD1) | 3.378190526 | 1.25E−09 |
| Zinc finger protein 10 (ZFP10) | 2.428728573 | 1.73E−09 |
| PGR5-like protein 1A (PGRL1A) | 0.338663384 | 1.83E−09 |
| Probable apyrase 1 (APY1) | 2.023614949 | 2.04E−09 |
| Heavy metal-associated isoprenylated plant protein 39 (HIPP39) | 2.212188767 | 2.24E−09 |
| Respiratory burst oxidase homolog protein C (RBOHC) | 2.113301068 | 3.83E−09 |
| Cytochrome b5 | 2.232792949 | 5.67E−09 |
| NifU-like protein 3 (NIFU3) | 0.198341181 | 1.23E−08 |
| Serine/threonine-protein kinase SAPK6 | 2.173676296 | 2.00E−08 |
| RNA replication polyprotein | 3.324898518 | 2.12E−08 |
| Ferredoxin-dependent glutamate synthase | 0.418434506 | 3.78E−08 |
| Nudix hydrolase 1 | 2.015395898 | 7.84E−08 |
DEGs, differentially expressed genes.
Figure 5(a) Schematic diagram of the correlation between samples and (b) heatmap of a hierarchical cluster analysis of the patterns of gene expression in the leaves and rhizomes of Halophila. beccarii.
Figure 6KEGG enrichment analysis Bubble Diagram of upregulated gene expression in the leaf tissues of Halophila beccarii. The size of circle reflects the number of differentially expressed genes. KEGG, Kyoto Encyclopedia of Genes and Genomes.