| Literature DB >> 35806287 |
Héctor Osorio1, Patricio Tapia-Reyes2, Daniela Espinoza1, Daniel Laporte3, Alberto González1, Eduardo Castro-Nallar4,5, Alejandra Moenne1.
Abstract
The genome of the marine alga Ulva compressa was assembled using long and short reads. The genome assembly was 80.8 Mb in size and encoded 19,207 protein-coding genes. Several genes encoding antioxidant enzymes and a few genes encoding enzymes that synthesize ascorbate and glutathione were identified, showing similarity to plant and bacterial enzymes. Additionally, several genes encoding signal transduction protein kinases, such as MAPKs, CDPKS, CBLPKs, and CaMKs, were also detected, showing similarity to plants, green microalgae, and bacterial proteins. Regulatory transcription factors, such as ethylene- and ABA-responsive factors, MYB, WRKY, and HSTF, were also present and showed similarity to plant and green microalgae transcription factors. Genes encoding enzymes that synthesize ACC and ABA-aldehyde were also identified, but oxidases that synthesize ethylene and ABA, as well as enzymes that synthesize other plant hormones, were absent. Interestingly, genes involved in plant cell wall synthesis and proteins related to animal extracellular matrix were also detected. Genes encoding cyclins and CDKs were also found, and CDKs showed similarity to animal and fungal CDKs. Few genes encoding voltage-dependent calcium channels and ionotropic glutamate receptors were identified as showing similarity to animal channels. Genes encoding Transient Receptor Potential (TRP) channels were not identified, even though TRPs have been experimentally detected, indicating that the genome is not yet complete. Thus, protein-coding genes present in the genome of U. compressa showed similarity to plant and green microalgae, but also to animal, bacterial, and fungal genes.Entities:
Keywords: Ulva compressa; genome; marine alga; protein-coding genes; structural RNA genes
Mesh:
Substances:
Year: 2022 PMID: 35806287 PMCID: PMC9266709 DOI: 10.3390/ijms23137279
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Summary statistics of U. compressa genome and comparison with U. mutabilis.
| Genome |
|
|
|---|---|---|
| Genome size | 80.8 Mb | 98.5 Mb |
| Number of scaffolds | 2601 | 318 |
| Scaffolds N50 | 0.46 Mb | 0.6 Mb |
| Scaffolds L50 | 48 | 46 |
| Percentage of GC content | 57.3% | 57.2% |
| Number of protein coding genes | 19,207 | 12,924 |
| Gene density | 238 genes/Mb | 131 genes/Mb |
| Average intron per gene | 3.6 | nd |
| Average exon length | 322 bp | nd |
| Average intron length | 479 bp | nd |
| Number of exons in CDS | 88,404 | nd |
| Number of introns in CDS | 69,134 | nd |
| Percentage of core genes | 80% | 92% |
| Repetitive elements (RE) | 18.9% | 35% |
| LINE | 1.77% | 15.1% |
| LTR | 1.52% | 9.4% |
| DNA elements | 0.61% | |
| Unknown RE | 15.04% | 10.5% |
| Number of rRNAs | 18 | nd |
| Number of tRNAs | 109 | nd |
| Number of snoRNAs | 27 | nd |
| Number of snRNAs | 11 | nd |
| Number of primary miRNAs | 87 | nd |
nd: not determined.
Figure 1Classification into molecular function (A) and biological process (gene ontology) (B) of protein-coding genes in the genome of U. compressa.
Figure 2Scheme of Protein-coding genes in the U. compressa genome: superoxide dismutase (SOD), ascorbate peroxidase (AP), glutathione reductase (GR), and peroxiredoxin (PRX); L-galactono-lactone dehydrogenase (L-GLDH), L-galactose dehydrogenase, glutamate-cysteine synthase (GCS), and glutathione synthase (GS); metallothioneins UcMT1, UcMT2, and UcMT3; phenylalanine ammonia lyase (PAL); pyruvate dehydrogenase (PDH), isocitrate dehydrogenase (IDH), and 2-oxoglutarate dehydrogenase (OGDH); mitogen-activated protein kinases (MAPKs); calcium-dependent protein kinases (CDPKs); calcineurin B-like protein kinases (CBLPKs) and calcium/calmodulin protein kinases (CaMKs); transient receptor potential (TRP) channels, voltage-dependent calcium channels (VDCCs), phospholipase D (PLD), calcium/ryanodine-dependent channel (RyC), IP3-dependent channel (IP3C), NAADP-dependent channel (NAADPC), and ionotropic glutamate receptors (GluR); cAMP-dependent protein kinase (PKA) and cGMP-dependent protein kinase (PKG). Those proteins that have been identified in the U. compressa genome and have also been experimentally detected are depicted in black, and those that have been experimentally detected and are not present in the U. compressa genome are depicted in gray with a question mark.