| Literature DB >> 30830203 |
Anna Marcionetti1, Victor Rossier1, Natacha Roux2, Pauline Salis2, Vincent Laudet2, Nicolas Salamin1.
Abstract
Clownfishes are an iconic group of coral reef fishes, especially known for their mutualism with sea anemones. This mutualism is particularly interesting as it likely acted as the key innovation that triggered clownfish adaptive radiation. Indeed, after the acquisition of the mutualism, clownfishes diversified into multiple ecological niches linked with host and habitat use. However, despite the importance of this mutualism, the genetic mechanisms allowing clownfishes to interact with sea anemones are still unclear. Here, we used a comparative genomics and molecular evolutionary analyses to investigate the genetic basis of clownfish mutualism with sea anemones. We assembled and annotated the genome of nine clownfish species and one closely related outgroup. Orthologous genes inferred between these species and additional publicly available teleost genomes resulted in almost 16,000 genes that were tested for positively selected substitutions potentially involved in the adaptation of clownfishes to live in sea anemones. We identified 17 genes with a signal of positive selection at the origin of clownfish radiation. Two of them (Versican core protein and Protein O-GlcNAse) show particularly interesting functions associated with N-acetylated sugars, which are known to be involved in sea anemone discharge of toxins. This study provides the first insights into the genetic mechanisms of clownfish mutualism with sea anemones. Indeed, we identified the first candidate genes likely to be associated with clownfish protection form sea anemones, and thus the evolution of their mutualism. Additionally, the genomic resources acquired represent a valuable resource for further investigation of the genomic basis of clownfish adaptive radiation.Entities:
Keywords: zzm321990 Amphiprionzzm321990 ; anemonefish; coral reef fish; key-innovation; positive selection
Mesh:
Year: 2019 PMID: 30830203 PMCID: PMC6430985 DOI: 10.1093/gbe/evz042
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—(A) Phylogenetic relationship of the nine selected clownfish species, Amphiprion frenatus (available from Marcionetti et al. 2018), and the outgroup species Pomacentrus moluccensis. Circles represent the sea anemones species with whom each clownfish can interact (Fautin and Allen 1997). Closely related species with divergent host usages were selected. (B) and (C) show, respectively, A. nigripes and A. ocellaris in their host sea anemone Heteractis magnifica.
. 2.—Examples of gene trees for 1-to-1 OG (A), clownfish-specific duplicated genes (B), and overall multicopy HOGs (C). Mutualism with sea anemones appeared on the ancestral basis of clownfishes (in red in A). Gene were tested for positive selection (ω > 1) on branches specific to all clownfishes (in red in A, B, and C). Gene duplication events are visualized with blue stars.
Genome Assembly and Annotation Statistics for the Nine Assembled Clownfish Species and Pomacentrus moluccensis
| De Novo Assembly | Reference-Guided Assembly | |||||
|---|---|---|---|---|---|---|
| Total assembly size (Mb) | 744 | 798 | 799 | 800 | 800 | 799 |
| Number of scaffolds | 27,951 | 16,543 | 16,953 | 16,995 | 17,050 | 16,941 |
| N50 (bp) | 136,417 | 246,482 | 246,127 | 246,124 | 246,119 | 245,870 |
| non-ATGC characters (%) | 4.6 | 3.6 | 2.8 | 2.7 | 2.7 | 2.9 |
| Paired-ends mapping rate (%) | 95.3 | 98.2 | 98.9 | 98.9 | 99.0 | 99.0 |
| Number of genes | 24,383 | 29,913 | 28,891 | 28,558 | 28,640 | 28,727 |
| Number of proteins | 27,606 | 33,845 | 33,219 | 32,905 | 33,128 | 33,271 |
| Functional annotated proteins (%) | 94.0 | 92.7 | 93.2 | 93.1 | 92.9 | 92.9 |
| CEGMA genes in assembly (%) | 97.2 | 99.6 | 99.6 | 99.6 | 99.6 | 100 |
| BUSCOs genes in annotation (%) | 87 | 93 | 94 | 95 | 95 | 95 |
Reference-guided assembly | ||||||
| Total Assembly Size (Mb) | 801 | 801 | 803 | 797 | 794 | |
| Number of scaffolds | 17,172 | 17,212 | 17,399 | 16,164 | 15,505 | |
| N50 (bp) | 246,052 | 246,037 | 245,703 | 247,121 | 246,470 | |
| non-ATGC characters (%) | 1.9 | 2.0 | 1.4 | 2.9 | 7.9 | |
| Paired-ends mapping rate (%) | 99.0 | 99.0 | 97.4 | 96.9 | 81.2 | |
| Number of genes | 28,730 | 29,014 | 29,408 | 28,170 | 28,885 | |
| Number of proteins | 33,120 | 33,320 | 33,768 | 32,385 | 32,027 | |
| Functional annotated proteins (%) | 93.1 | 92.9 | 92.7 | 93.8 | 94.0 | |
| CEGMA genes in assembly (%) | 99.6 | 99.6 | 99.6 | 99 | 99.6 | |
| BUSCOs genes in annotation (%) | 95 | 94 | 94 | 95 | 89 | |
Note.—For A. ocellaris, statistics of both de novo and reference-guided assemblies are reported. Reference-guided assemblies were obtained using A. frenatus (Marcionetti et al. 2018) as reference genome. N50 index indicates the shortest scaffold length above which 50% of the genome is assembled. CEGMA and BUSCOs genes represent the completeness of the genome assemblies and annotations, respectively.
Results for the Positive Selection Analysis on 1-to-1 OG
| HOG Name | log | log | LRT | Positively Selected Sites (%) | ω | |
|---|---|---|---|---|---|---|
| HOG11195 | −41,547.05 | −41,526.90 | 2.19E-010 | 2.28E-007 | 0.8 | 233.5 |
| HOG16495 | −13,655.66 | −13,642.13 | 1.96E-007 | 1.53E-004 | 0.5 | 999.0 |
| HOG1437 | −16,835.23 | −16,825.39 | 9.19E-006 | 4.79E-003 | 0.3 | 248.0 |
| HOG9295 | −4,960.03 | −4,950.14 | 8.71E-006 | 4.79E-003 | 1.0 | 102.8 |
| HOG5827_3b | −2,138.88 | −2,129.50 | 1.48E-005 | 6.61E-003 | 1.1 | 999.0 |
| HOG11468 | −14,064.81 | −14,055.92 | 2.47E-005 | 7.85E-003 | 0.4 | 760.6 |
| HOG4335_1a | −23,361.23 | −23,352.35 | 2.51E-005 | 7.85E-003 | 0.5 | 340.4 |
| HOG11290 | −10,498.06 | −10,489.15 | 2.42E-005 | 7.85E-003 | 0.4 | 999.0 |
| HOG14257 | −23,503.25 | −23,495.89 | 1.24E-004 | 3.53E-002 | 0.1 | 999.0 |
| HOG16500 | −11,287.91 | −11,280.90 | 1.79E-004 | 3.87E-002 | 0.2 | 999.0 |
| HOG21171 | −69,291.90 | −69,284.86 | 1.75E-004 | 3.87E-002 | 1.3 | 25.3 |
| HOG4262 | −31,942.98 | −31,935.94 | 1.76E-004 | 3.87E-002 | 2.0 | 27.8 |
| HOG16343 | −6,212.65 | −6,205.67 | 1.86E-004 | 3.87E-002 | 0.5 | 359.5 |
Note.—The 13 positively selected genes are reported here, with information on the log-likelihood of the null model (no positive selection) and alternative model (positive selection on the branch leading to clownfishes, fig. 2). Likelihood-ratio test (LRT) P values, multiple-testing corrected q values, the proportion of sites under positive selection on the tested branch (ω classes 2a and 2b) and the corresponding ω values are reported for each gene.
Annotation of the Positively Selected 1-to-1 OG
| HOG Name | SwissProt ID | SwissProt Name |
|---|---|---|
| HOG11195 | P0C5E4 | Phosphatidylinositol phosphatase PTPRQ |
| HOG16495 | Q5RFL7 | Vezatin |
| HOG1437 | Q90953 | Versican core protein |
| HOG9295 | Q3UHZ5 | Leiomodin-2 |
| HOG5827_3b | Q803L0 | Protein lin-28 homolog A |
| HOG11468 | Q9D805 | Calpain-9 |
| HOG4335_1a | Q0ZM14 | Protocadherin-15 |
| HOG11290 | Q92581 | Sodium/hydrogen exchanger 6 |
| HOG14257 | Q8WXG6 | MAP kinase-activating death domain protein |
| HOG16500 | Q9EQQ9 | Protein O-GlcNAcase |
| HOG21171 | Q9TU53 | Cubilin |
| HOG4262 | Q9BYE9 | Cadherin-related family member 2 |
| HOG16343 | P37892 | Carboxypeptidase E |
Results for the Positive Selection on Clownfish-Specific Duplicated Genes
| Node | LRT | Corrected | ω1 | ω2 | |
|---|---|---|---|---|---|
| HOG4655 | |||||
| Node172 | 26.5139 | 0.0001 | 0.0681 (97%) | 46.2 (2.9%) | |
| Node119 | 24.3395 | 0.0004 | 1.00 (98%) | 10,000 (2.2%) | |
| Node70 | 19.7309 | 0.0041 | 0.00 (100%) | 10,000 (0.21%) | |
| HOG5344 | |||||
| Node89 | 23.3766 | 0.0006 | 0.00 (85%) | 15.9 (15%) | |
| Node204 | 20.9201 | 0.002 | 0.0401 (87%) | 11.1 (13%) | |
| Node142 | 17.5192 | 0.0109 | 0.00 (95%) | 10,000 (5.1%) | |
| AMPSE31855 | 15.4114 | 0.0314 | 0.00 (92%) | 92.5 (7.6%) | |
| Node120 | 14.6422 | 0.046 | 0.00 (98%) | 10,000 (2.4%) | |
| HOG5488 | |||||
| ENSDARG00000098394 | 20.6963 | 0.001 | 0.184 (81%) | 111 (19%) | |
| Node63 | 20.8642 | 0.001 | 0.00 (67%) | 9,410 (33%) | |
| Node32 | 15.7701 | 0.0121 | 0.00 (92%) | 10,000 (8.4%) | |
| HOG19886 | |||||
| Node7 | 21.4722 | 0.001 | 0.484 (93%) | 47.7 (7.1%) | |
| Node26 | 17.837 | 0.0061 | 0.439 (95%) | 21.3 (4.9%) |
Note.—We report the nodes with inferred positive selection, the Likelihood Ratio Test (LRT) statistic for selection, the corrected P value and the value of the inferred ω classes, with the proportion of sites in each class. The reported nodes correspond to nodes from the inferred gene trees (supplementary fig. S9, Supplementary Material online).
Annotation of the Clownfish-Specific Duplicated Genes
| HOG Name | SwissProt ID | SwissProt Name |
|---|---|---|
| HOG4655 | P33267 | Cytochrome P450 2F2 |
| HOG5344 | P30568 | Glutathione S-transferase A |
| HOG5488 | P04437 | T-cell receptor alpha chain V |
| HOG19886 | P30122 | Bile salt-activated lipase |