| Literature DB >> 32163422 |
Bruce R Southey1, Sandra L Rodriguez-Zas1,2,3, Justin S Rhodes2,4, Jonathan V Sweedler2,5.
Abstract
The Amphiprion (anemonefish or clownfish) family of teleost fish, which is not a common model species, exhibits multiple unique characteristics, including social control of body size and protandrous sex change. The social changes in sex and body size are modulated by neuropeptide signaling pathways. These neuropeptides are formed from complex processing from larger prohormone proteins; understanding the neuropeptide complement requires information on complete prohormones sequences. Genome and transcriptome information within and across 22 teleost fish species, including 11 Amphiprion species, were assembled and integrated to achieve the first comprehensive survey of their prohormone genes. This information enabled the identification of 175 prohormone isoforms from 159 prohormone proteins across all species. This included identification of 9 CART prepropeptide genes and the loss of insulin-like 5B and tachykinin precursor 1B genes in Pomacentridae species. Transcriptome assemblies generally detected most prohormone genes but provided fewer prohormone genes than genome assemblies due to the lack of expression of prohormone genes or specific isoforms and tissue sampled. Comparisons between duplicate genes indicated that subfunctionalization, degradation, and neofunctionalization may be occurring between all copies. Characterization of the prohormone complement lays the foundation for future peptidomic investigation of the molecular basis of social physiology and behavior in the teleost fish.Entities:
Year: 2020 PMID: 32163422 PMCID: PMC7067429 DOI: 10.1371/journal.pone.0228562
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Evolutionary relationship of 16 Percomorphaceae species studied.
Number and protein sequence similarity of prohormone isoforms identified from genome and transcriptome assemblies.
| Series | Family | Species | Subgenus | Assembly | Deg. Sim. | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| T | BioProject | Version | ||||||||
| E | C | P | M | |||||||
| G | PRJNA407816 | AmpOce1.0 | 35 | 63 | 72 | 5 | ||||
| H | PRJNA515163 | 42 | 34 | 63 | 72 | 6 | ||||
| I | PRJNA515163 | 42 | 27 | 52 | 87 | 9 | ||||
| T | PRJNA374650 | de novo | 26 | 53 | 79 | 17 | ||||
| G | PRJNA436093 | Nemo_v1.1 | 35 | 65 | 70 | 5 | ||||
| T | PRJEB27750 | de novo | 33 | 53 | 78 | 11 | ||||
| G | PRJNA515163 | 42 | 34 | 63 | 72 | 6 | ||||
| T | PRJNA294760 | de novo | 33 | 55 | 61 | 26 | ||||
| T | PRJEB27750 | de novo | 28 | 46 | 70 | 31 | ||||
| G | PRJNA433458 | 43 | 33 | 63 | 72 | 7 | ||||
| G | PRJNA515163 | 42 | 34 | 63 | 72 | 6 | ||||
| T | PRJNA398732 | de novo | 7 | 9 | 53 | 106 | ||||
| G | PRJNA515163 | 42 | 34 | 63 | 72 | 6 | ||||
| G | PRJNA515163 | 42 | 34 | 62 | 72 | 7 | ||||
| T | PRJNA285007 | de novo | 30 | 53 | 60 | 32 | ||||
| G | PRJNA515163 | 42 | 34 | 62 | 71 | 8 | ||||
| G | PRJNA515163 | 42 | 34 | 63 | 72 | 6 | ||||
| G | PRJNA515163 | 42 | 35 | 62 | 71 | 7 | ||||
| G | PRJNA515163 | 42 | 35 | 62 | 71 | 7 | ||||
| G | PRJNA515163 | 42 | 34 | 60 | 73 | 8 | ||||
| G | PRJNA311159 | ASM210954v1 | 35 | 61 | 73 | 6 | ||||
| G | PRJEB12469 | de novo | 33 | 64 | 69 | 9 | ||||
| T | PRJEB27750 | de novo | 27 | 41 | 75 | 32 | ||||
| G | PRJNA89147 | Stegastes_partitus-1.0.2 | 38 | 65 | 67 | 5 | ||||
| G | PRJEB12469 | 47 | 26 | 48 | 91 | 10 | ||||
| G | PRJNA60363 | AstBur1.0 | 32 | 65 | 72 | 6 | ||||
| G | PRJEB5837 | Nfu_20140520 | 24 | 66 | 63 | 22 | ||||
| G | PRJNA345597 | ASM164080v1 | 36 | 75 | 57 | 7 | ||||
| G | PRJNA245366 | ASM74293v1 | 28 | 74 | 59 | 14 | ||||
a Assembly where T: Type of sequence data denoting if the sequence is from the genome (G), alternative genome assembly (H), de novo genome assembly (I) or transcriptome (T); BioProject: NCBI BioProject identifier; Version: provides the assembly identifier, reference for the assembly or denotes that a de novo assembly was used.
b Deg. Sim: Degree of Similarity of sequences across species where Exact (E)–sequences aligned without gaps; Close (C)–sequences aligned with less than 5 consecutive gaps; Partial (P)–more than 5 consecutive gaps within the alignment; Missed (M)–No prohormone isoform annotated.
Fig 2Phylogenetic tree of the fish CART prepropeptide (CARTPT) prohormone protein sequences.
Comparison between different assemblies from the same species.
| Species 1 | Species 2 | Sequence | Gap | Missing | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Name | At | Name | At | Identity | Variant | Both | Sp1 | Sp2 | Both | Sp1 | Sp2 |
| G | H | 50 | 81 | 4 | 7 | 27 | 5 | 0 | 1 | ||
| G | I | 54 | 66 | 6 | 3 | 37 | 5 | 0 | 4 | ||
| H | I | 79 | 27 | 14 | 16 | 30 | 6 | 0 | 3 | ||
| G | T | 80 | 38 | 3 | 14 | 14 | 6 | 0 | 20 | ||
| G | T | 8 | 11 | 3 | 2 | 45 | 6 | 0 | 100 | ||
| G | T | 79 | 32 | 3 | 5 | 38 | 4 | 1 | 13 | ||
| H | T | 37 | 54 | 6 | 22 | 37 | 4 | 2 | 13 | ||
| I | T | 42 | 44 | 13 | 25 | 29 | 4 | 5 | 13 | ||
| G | T | 61 | 66 | 2 | 3 | 32 | 5 | 0 | 6 | ||
| G | T | 76 | 31 | 4 | 11 | 20 | 6 | 1 | 26 | ||
a Sequence: Sequences aligned without gaps with either complete sequence identity or presence of amino acid variants.
b Gap: Sequence aligned with at least one gap in both sequences (Both), only in the first species (Sp1) or only in the second species.
c Missing: No sequence recovered from corresponding assembly from both species (Both), only in the first species (Sp1) or only in the second species.
d Assembly molecular type: G–genome assembly; H–[42] alternative genome assembly; I–[42] de novo genome assembly; T–de novo transcriptome assembly.
Number of prohormone isoforms recovered from the different Amphiprion de novo transcriptome assemblies.
| Complete | Partial | Missing | Total | ||
|---|---|---|---|---|---|
| 0 | 1–2 | 3+ | |||
| 3+ | 0 | 14 | 23 | 3 | 40 |
| 3+ | 1 | 27 | 21 | 1 | 49 |
| 3+ | 3+ | 10 | 11 | 0 | 21 |
| 2 | 0 | 0 | 0 | 2 | 2 |
| 2 | 3+ | 5 | 4 | 7 | 16 |
| 1 | 0 | 0 | 0 | 6 | 6 |
| 1 | 1 | 0 | 0 | 5 | 5 |
| 1 | 2 | 0 | 0 | 5 | 5 |
| 1 | 3+ | 5 | 5 | 6 | 16 |
| 0 | 0 | 0 | 0 | 6 | 6 |
| 0 | 3+ | 4 | 2 | 3 | 9 |
| Total | 65 | 66 | 44 | 175 | |
a Number of prohormone isoforms that were completely recovered from 0, 1, 2 or 3 or more species.
b Number of prohormone isoforms that were partially recovered from 0, 1, 2, or 3 or more species.
c Number of prohormone isoforms that were not recovered from 0 species, 1 or 2 species, or 3 or more species.
Average Mean Protein Evolutionary Distance (MeaPED), normalized Robinson–Foulds metric (nRF) and proportion of edges shared between phylogenetic trees (Pedges) for prohormone families with 6 or more prohormones for Amphiprion, Pomacentridae and All species.
| Family | N isoforms | MeaPED | nRF | Pedges | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| All | All | All | ||||||||
| Calcitonin | 10 | 0.012 | 0.032 | 0.106 | 0.88 | 0.84 | 0.83 | 0.61 | 0.61 | 0.61 |
| CARTPT | 9 | 0.011 | 0.036 | 0.116 | 0.91 | 0.88 | 0.86 | 0.60 | 0.59 | 0.59 |
| Corticotrophin | 5 | 0.007 | 0.036 | 0.085 | 0.88 | 0.87 | 0.83 | 0.61 | 0.59 | 0.61 |
| Endothelin sarafotoxin | 6 | 0.011 | 0.053 | 0.173 | 0.89 | 0.81 | 0.80 | 0.60 | 0.62 | 0.62 |
| Glucagon | 10 | 0.024 | 0.042 | 0.107 | 0.88 | 0.91 | 0.88 | 0.61 | 0.58 | 0.58 |
| Granin | 9 | 0.004 | 0.012 | 0.037 | 0.82 | 0.71 | 0.71 | 0.64 | 0.67 | 0.67 |
| Hepcidin | 7 | 0.047 | 0.189 | 0.605 | 0.85 | 0.90 | 0.87 | 0.63 | 0.59 | 0.60 |
| Insulin | 6 | 0.008 | 0.034 | 0.119 | 0.84 | 0.85 | 0.81 | 0.63 | 0.61 | 0.62 |
| Natriuretic peptide | 6 | 0.007 | 0.044 | 0.141 | 0.87 | 0.83 | 0.78 | 0.61 | 0.62 | 0.63 |
| Neuromedin U (NmU) | 6 | 0.011 | 0.036 | 0.124 | 0.85 | 0.83 | 0.76 | 0.63 | 0.62 | 0.64 |
| Opioid | 8 | 0.006 | 0.024 | 0.071 | 0.82 | 0.78 | 0.74 | 0.64 | 0.64 | 0.65 |
| Parathyroid | 7 | 0.015 | 0.055 | 0.165 | 0.88 | 0.82 | 0.79 | 0.61 | 0.62 | 0.63 |
| PDGF/VEGF growth factor | 16 | 0.004 | 0.016 | 0.054 | 0.88 | 0.83 | 0.78 | 0.61 | 0.62 | 0.63 |
| Relaxin | 6 | 0.031 | 0.071 | 0.253 | 0.85 | 0.86 | 0.74 | 0.63 | 0.60 | 0.66 |
| Somatostatin/Urotensin | 9 | 0.013 | 0.046 | 0.142 | 0.87 | 0.88 | 0.82 | 0.61 | 0.59 | 0.61 |
| Tachykinin | 7 | 0.018 | 0.061 | 0.250 | 0.84 | 0.83 | 0.76 | 0.63 | 0.61 | 0.66 |
| Total | 175 | 0.015 | 0.046 | 0.143 | 0.87 | 0.84 | 0.80 | 0.62 | 0.61 | 0.62 |
Fig 3Alignment of Proprotein Convertase Subtilisin/Kexin Type 1 Inhibitor (PCSK1N) prohormone protein sequences containing the mouse KEP and Little SAAS peptides with mouse cleavage sites highlighted.
Fig 4Alignment of Proprotein Convertase Subtilisin/Kexin Type 1 Inhibitor (PCSK1N) prohormone protein sequences containing the conserved chaperone region, the mouse PEN and Little Len peptides, and mouse cleavage sites highlighted.