| Literature DB >> 32900905 |
Mayuko Hamada1,2,3, Noriyuki Satoh4, Konstantin Khalturin4.
Abstract
Various Hydra species have been employed as model organisms since the 18th century. Introduction of transgenic and knock-down technologies made them ideal experimental systems for studying cellular and molecular mechanisms involved in regeneration, body-axis formation, senescence, symbiosis, and holobiosis. In order to provide an important reference for genetic studies, the Hydra magnipapillata genome (species name has been changed to H. vulgaris) was sequenced a decade ago (Chapman et al., 2010) and the updated genome assembly, Hydra 2.0, was made available by the National Human Genome Research Institute in 2017. While H. vulgaris belongs to the non-symbiotic brown hydra lineage, the green hydra, Hydra viridissima, harbors algal symbionts and belongs to an early diverging clade that separated from the common ancestor of brown and green hydra lineages at least 100 million years ago (Schwentner and Bosch 2015; Khalturin et al., 2019). While interspecific interactions between H. viridissima and endosymbiotic unicellular green algae of the genus Chlorella have been a subject of interest for decades, genomic information about green hydras was nonexistent. Here we report a draft 280-Mbp genome assembly for Hydra viridissima strain A99, with a scaffold N50 of 1.1 Mbp. The H. viridissima genome contains an estimated 21,476 protein-coding genes. Comparative analysis of Pfam domains and orthologous proteins highlights characteristic features of H. viridissima, such as diversification of innate immunity genes that are important for host-symbiont interactions. Thus, the H. viridissima assembly provides an important hydrozoan genome reference that will facilitate symbiosis research and better comparisons of metazoan genome architectures.Entities:
Keywords: Hydra viridissima A99; de novo assembly; green hydra; symbiosis; whole genome sequencing
Mesh:
Year: 2020 PMID: 32900905 PMCID: PMC7642931 DOI: 10.1534/g3.120.401411
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Phylogeny and morphology of green hydra Hydra viridissima. (A) Phylogenetic position of H. viridissima (red) within the phylum Cnidaria. (B) Relationship of Hydra viridissima strain A99 (red) with other H. viridissima strains and brown hydra species, based on phylogenetic analysis with the NJ method using cytochrome c oxidase subunit I (COI) gene sequences. The genomic region in H. viridissima A99 and Genbank IDs in other strains used in the phylogenetic analysis are indicated. (c) Photographs of H. viridissima (left) and H. vulgaris (right). H. viridissima is smaller than H. vulgaris, and green due to symbiotic Chlorella in its endodermal epithelial cells.
Comparison of the genome assembly statistics of cridarians
| Hydrozoa | Scyphozoa | Cubozoa | Anthozoa | |||||
|---|---|---|---|---|---|---|---|---|
| Species | ||||||||
| Geographical origin | Laboratory strain A99 | Laboratory strain 105 | Villefranche (Atlantic) | Baltic Sea (Atlantic) | Seto Inland Sea (Pacific) | Okinawa (Pacific) | Laboratory strain | |
| Genome size (Mbp) | 284 | 852 | 854 | 445 | 377 | 952 | 420 | 457 |
| Number of Scaffolds | 2,677 | 20,914 | 5,525 | 7,644 | 2,710 | 4,538 | 2,420 | 10,804 |
| Longest scaffold (Mbp) | 5.1 | 0.91 | 4.4 | 2.9 | 4.4 | 14.5 | 2.5 | 3.3 |
| Scaffold N50 (Mbp) | 1.1 | 0.1 | 1.0 | 0.4 | 1.0 | 2.2 | 0.5 | 0.5 |
| GC content (%) | 24.7 | 25.4 | 25.4 | 35 | 34.7 | 31.4 | 39 | 39 |
| Repeats (%) | 37.5 | 57 | 60.3 | 41 | 44.7 | 37.4 | 12.9 | 26 |
| Gap rate (%) | 16.8 | 7.82 | 8.0 | 16.6 | 6.63 | 11.9 | 15.2 | 16.6 |
| Number of genes | 21,476 | 31,452 | 33,820 | 26,727 | 28,625 | 24,278 | 23,668 | 27,273 |
| Mean gene length (bp) | 7,637 | 6,873 | 12,378 | 5,848 | 10,215 | 21,444 | 8,727 | 4,500 |
| Mean exon length (bp) | 209 | 247 | N/A | 281 | 368 | 350 | 316 | 208 |
| Mean intron length (bp) | 838 | N/A | N/A | N/A | 1391 | 3572 | 1057 | 800 |
| BUSCO (complete) % | 83.9 | 76.8 | 80.2 | 86.4 | 79.8 | 81.5 | 74.5 | 91.4 |
This study. b Leclère , cKhalturin . dShinzato . ePutnam , fGenome assembly version 1, Chapman , gGenome assembly version 2, Hydra2.0 Web Portal (https://research.nhgri.nih.gov/hydra/).
The number of NACHT/NB-ARC domain-containing proteins in cnidarians and the combination of repeat domains
| 264 | 89 | 101 | 56 | 58 | 24 | 6 | 489 | |
| 103 | 38 | 29 | 3 | 8 | 4 | 2 | 57 | |
| 2 | 4 | 2 | 3 | 2 | 3 | 1 | 8 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 125 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
Hvir: Hydra viridissima A99, Hvul v1: Hydra vulgaris (Chapman ),Hvul v2: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera,
Number of homeodomain-containing genes in the Hydra viridissima genome.
| Class | Subclass | Familiy | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ANTP | 0 | 0 | 0 | 1 | 2 | 1 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 3 | 1 | |||
| 2 | 3 | 3 | 1 | 4 | 5 | 3 | 1 | |||
| 1 | 1 | 1 | 0 | 1 | 1 | 4 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 0 | 1 | 1 | 4 | 2 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 2 | 1 | |||
| 1 | 2 | 2 | 2 | 1 | 2 | 1 | 1 | |||
| 0 | 0 | 0 | 1 | 1 | 1 | 2 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 1 | 3 | 2 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | |||
| 0 | 0 | 0 | 1 | 1 | 1 | 2 | 0 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | |||
| 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | |||
| 2 | 1 | 1 | 1 | 1 | 3 | 8 | 4 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 2 | 1 | 1 | 6 | 2 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | |||
| 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | |||
| PRD | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | ||
| 1 | 2 | 1 | 0 | 0 | 0 | 1 | 1 | |||
| 1 | 0 | 1 | 1 | 1 | 0 | 7 | 2 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | |||
| 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |||
| 3 | 3 | 2 | 3 | 4 | 7 | 3 | 4 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | |||
| 2 | 1 | 1 | 2 | 2 | 1 | 2 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | |||
| 2 | 0 | 1 | 1 | 2 | 1 | 1 | 0 | |||
| LIM | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | ||
| 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 1 | 0 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| POU | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | |||
| 0 | 0 | 0 | 0 | 1 | 1 | 3 | 2 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| SINE | 0 | 0 | 0 | 1 | 2 | 2 | 2 | 2 | ||
| 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | |||
| 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | |||
| TALE | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 5 | 1 | 1 | 1 | |||
| 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | |||
| 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | |||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |||
| CERS | 2 | 0 | 1 | 2 | 1 | 1 | 1 | 1 | ||
| CUT | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | ||
| HNF | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
Hvir: Hydra viridissima A99, Hvul v1: Hydra vulgaris (Chapman ), Hvul v2: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera.
Figure 2Interspersed Repeat Landscape in Hydra. Components and proportions of repetitive sequences in the genome of (A) Hydra viridissima A99 and (B) H. vulgaris are shown. Classes of repeat are shown in the right column.
Number of genes with Pfam domains enriched in the Hydra viridissima genome and comparison of their number in the other cnidarian genomes. A. Domains enriched in Hydra viridissima. B. Domains enriched in Hydra species
| A. Domain | Chi test* | |||||||
|---|---|---|---|---|---|---|---|---|
| 161 | 75 | 42 | 23 | 45 | 39 | 458 | 1E-53 | |
| 106 | 28 | 18 | 4 | 20 | 6 | 220 | 5E-53 | |
| 64 | 19 | 28 | 13 | 14 | 19 | 36 | 7E-35 | |
| 49 | 11 | 19 | 15 | 24 | 17 | 49 | 7E-16 | |
| 47 | 13 | 16 | 0 | 12 | 4 | 5 | 2E-66 | |
| 46 | 9 | 3 | 5 | 1 | 1 | 0 | 1E-189 | |
| 42 | 13 | 1 | 0 | 4 | 1 | 0 | 2E-160 | |
| 41 | 7 | 3 | 11 | 15 | 12 | 36 | 1E-22 | |
| 23 | 7 | 6 | 6 | 7 | 6 | 2 | 8E-19 | |
| 21 | 6 | 4 | 0 | 0 | 4 | 3 | 6E-40 | |
| 15 | 5 | 1 | 1 | 1 | 1 | 0 | 4E-40 | |
| 14 | 5 | 2 | 3 | 3 | 1 | 3 | 5E-15 | |
| 10 | 4 | 2 | 0 | 3 | 3 | 6 | 4E-06 | |
| 5 | 1 | 1 | 1 | 2 | 0 | 0 | 6E-07 |
Hvir: Hydra viridissima A99, Hvul: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera, *Chi-test: evalue of Chi-square test.
Top 10 overrepresented orthologs in the Hydra viridissima genome and comparison of their gene number in the other cnidarian genomes. A. Orthologs enriched in the Hydra viridissima genome. B Orthologs enriched in Hydra species
| A. Ortholog ID | Chi-test* | Annotation | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 152 | 51 | 30 | 19 | 0 | 0 | 0 | 8E-128 | Nod-like receptor like | |
| 98 | 48 | 11 | 4 | 15 | 26 | 1 | 2E-58 | Uncharacterized protein | |
| 73 | 29 | 0 | 0 | 0 | 1 | 0 | 3E-79 | Uncharacterized protein | |
| 29 | 13 | 11 | 0 | 0 | 8 | 0 | 3E-17 | Uncharacterized protein | |
| 24 | 4 | 0 | 8 | 2 | 0 | 0 | 7E-21 | Uncharacterized protein | |
| 23 | 6 | 2 | 0 | 0 | 0 | 0 | 3E-24 | Uncharacterized protein | |
| 23 | 1 | 1 | 0 | 0 | 0 | 1 | 2E-30 | TIR-only protein | |
| 19 | 8 | 1 | 0 | 0 | 0 | 0 | 2E-18 | Uncharacterized protein | |
| 18 | 1 | 0 | 0 | 0 | 0 | 0 | 1E-25 | Uncharacterized protein | |
| 14 | 6 | 2 | 0 | 0 | 0 | 1 | 1E-11 | DDE superfamily endonuclease |
Hvir: Hydra viridissima A99, Hvul: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera, *Chi-test: evalue of Chi-square test.
Figure 3Schematic representation of domain structures of NACHT/NB-ARC or TIR-domain-containing proteins identified in Hydra. The domain structures and the number of NACHT/NB-ARC or TIR-domain-containing proteins in Hydra viridissima A99 (Hvir) and H. vulgaris (Hvul) are shown.
Number of putative transcription factor genes (A) and signaling molecule genes (B) in the Hydra viridissima genome.
| A. Domain | ||||||||
|---|---|---|---|---|---|---|---|---|
| 8 | 7 | 9 | 10 | 10 | 8 | 5 | 8 | |
| 0 | 0 | 0 | 1 | 2 | 0 | 0 | 0 | |
| 26 | 26 | 30 | 26 | 22 | 25 | 36 | 29 | |
| 25 | 22 | 23 | 31 | 24 | 27 | 32 | 17 | |
| 1 | 0 | 1 | 3 | 1 | 1 | 2 | 1 | |
| 6 | 6 | 5 | 7 | 9 | 11 | 12 | 7 | |
| 9 | 11 | 11 | 13 | 21 | 14 | 16 | 12 | |
| 17 | 17 | 16 | 19 | 15 | 26 | 34 | 22 | |
| 4 | 4 | 5 | 3 | 7 | 7 | 4 | 5 | |
| 0 | 0 | 0 | 1 | 4 | 2 | 6 | 7 | |
| 33 | 36 | 34 | 44 | 52 | 50 | 72 | 53 | |
| 30 | 33 | 33 | 31 | 30 | 29 | 33 | 27 | |
| 50 | 44 | 49 | 70 | 88 | 82 | 153 | 96 | |
| 9 | 9 | 9 | 12 | 12 | 9 | 20 | 9 | |
| 2 | 1 | 3 | 2 | 1 | 2 | 3 | 3 | |
| 61 | 23 | 57 | 3 | 4 | 23 | 9 | 8 | |
| 2 | 3 | 2 | 3 | 4 | 4 | 6 | 4 | |
| 2 | 1 | 3 | 5 | 2 | 2 | 3 | 2 | |
| 2 | 2 | 2 | 4 | 3 | 2 | 3 | 1 | |
| 6 | 7 | 7 | 11 | 10 | 9 | 14 | 10 | |
| 1 | 1 | 1 | 2 | 3 | 2 | 1 | 2 | |
| 105 | 123 | 121 | 244 | 233 | 118 | 169 | 90 | |
| 1 | 2 | 2 | 3 | 2 | 3 | 3 | 4 | |
| 9 | 8 | 8 | 11 | 8 | 9 | 19 | 12 |
Hvir: Hydra viridissima A99, Hvul v1: Hydra vulgaris (Chapman ),Hvul v2: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera.
Number of genes for the subclass of homeodomain-containing proteins in cnidarians
| Class | Medusozoa | Anthozoad | ||||||
|---|---|---|---|---|---|---|---|---|
| Hydrozoa | ||||||||
| ANTP-HOXL | 5 | 7 | 7 | 6 | 13 | 13 | 17 | 9 |
| 9 | 8 | 11 | 20 | 21 | 20 | 65 | 33 | |
| 21 | 16 | 16 | 18 | 25 | 28 | 43 | 31 | |
| 4 | 4 | 5 | 5 | 5 | 5 | 5 | 4 | |
| 4 | 3 | 4 | 5 | 4 | 10 | 6 | 5 | |
| 2 | 2 | 2 | 4 | 5 | 4 | 6 | 4 | |
| 2 | 2 | 2 | 3 | 4 | 4 | 6 | 4 | |
| 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 | |
| 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 48 | 44 | 48 | 62 | 79 | 85 | 151 | 94 | |
Hvir: Hydra viridissima A99, Hvul v1: Hydra vulgaris (Chapman ), Hvul v2: Hydra vulgaris (Hydra2.0), Ch: Clytia hemisphaerica, Mv: Morbakka virulenta, Aa: Aurelia aurita, Nv, Nematostella vectensis, Ad: Acropora digitifera.
Figure 4ParaHox, Hox and NK genes in cnidarians. The putative Homeobox megacluster in the last common ancestor of cnidarians (top) and homeobox genes and their cluster structures in extant cnidarians are represented. ParaHox genes (green boxes); Hox genes (pink boxes); NK genes (blue boxes). Empty boxes indicate lost genes. Horizontal lines (black) indicate chromosome fragments.