| Literature DB >> 27916957 |
Harald M I Kerkkamp1, R Manjunatha Kini2, Alexey S Pospelov3, Freek J Vonk4, Christiaan V Henkel5, Michael K Richardson6.
Abstract
Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.Entities:
Keywords: Malayan pit viper; genome; genomics; king cobra; reptile; snake
Mesh:
Substances:
Year: 2016 PMID: 27916957 PMCID: PMC5198554 DOI: 10.3390/toxins8120360
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Snake genome projects published or in progress.
| Trivial Name | Scientific Name | Family | Notes |
|---|---|---|---|
| Prong-snouted blind snake | Typhlopidae | F.J. Vonk et al., in progress | |
| Texas blind snake | Leptotyphlopidae | T.A. Castoe et al., in progress | |
| Boa constrictor | Boidae | Ref. [ | |
| Boa constrictor | Boidae | Ref. [ | |
| Burmese python | Pythonidae | Published [ | |
| Garter snake | Colubridae | GenB: LFLD00000000 | |
| Colubridae | Ref. [ | ||
| Corn snake | Colubridae | Ref. [ | |
| Corn snake | Colubridae | Targeted sequencing: 5′ hox genes [ | |
| King cobra | Elapidae | Published [ | |
| Malayan pit viper | Viperidae | F.J. Vonk et al., in progress | |
| Five-pacer viper | Viperidae | Ref. [ | |
| European adder | Viperidae | Baylor College of Medicine, Human Genome Sequencing Center; GenB: JTGP00000000 | |
| Habu | Viperidae | H. Shibata et al., in progress | |
| Brown spotted pit viper | Viperidae | A.S. Mikheyev et al., in progress; GenB: PRJDB4386 | |
| Prairie rattlesnake | Viperidae | T.A. Castoe et al., in progress | |
| Western diamond-backed rattlesnake | Viperidae | Ref. [ | |
| Timber rattlesnake | Viperidae | GenB: LVCR00000000.1 | |
| Speckled rattlesnake | Viperidae | Ref. [ | |
| Western Diamondback rattlesnake, Mojave rattlesnake and Eastern Diamondback rattlesnake | Viperidae | Targeted sequencing of bacterial artificial chromosome (BAC) clones containing phospholipase A2 genes. | |
| Pygmy rattlesnake | Viperidae | Ref. [ | |
| Temple pit viper | Viperidae | R.M. Kini et al., in progress |
This list is not necessarily exhaustive. Abbreviation: GenB, GenBank accession number. Taxonomy according to the Pubmed Taxonomy database [19].
Selected data from the Burmese python and king cobra draft genomes and comparison with genomes of other species.
| Species | Coding Genes (k) | Genome Size (Gb) | Repeats (%) |
|---|---|---|---|
| Burmese python | 25 [ | 1.44 [ | 31.8–59.4 [ |
| King cobra | 21.19 [ | 1.36–1.59 [ | 35.2–60.4 [ |
| Chicken | 20–23 * [ | 1.05 [ | 4.3–8.0 [ |
| Human | 20.4 ¶; 19 [ | 3.54 ¶ | >66–69 [ |
| Anolis | 18.5 † | 1.70 † | 30% ‡ [ |
* v. 85.4 in ensembl.org gives the number of coding genes in the chicken genome as 15,508; ¶ Human genome, build 38; ensembl.org; † GenBank Assembly ID GCA_000090745.1; ‡ Mobile elements.
Number of copies (paralogues) of toxin genes in the king cobra (Ophiophagus hannah) genome; data from Ref. [3].
| Venom Toxin or Toxin Family | Number of Paralogues |
|---|---|
| 3FTx (three-finger toxin) * | 21 |
| PLA2 (phospholipase A2) * | 12 |
| Lectin * | 11 |
| Kunitz * | 10 |
| Waprins * | 6 |
| Cystatin | 5 |
| CRISP (cysteine-rich secretory protein) | 3 |
| CVF (cobra venom factor) | 3 |
| Kallikrein | 3 |
| SVMP (snake venom metalloproteinase) | 3 |
| LAAO (L-amino acid oxidase) | 2 |
| NGF (nerve growth factor) | 2 |
| NP (natriuretic peptide) * | 2 |
| Acetylcholinesterase | 1 |
| Hyaluronidase | 1 |
| PLB (phospholipase-B) | 1 |
| VEGF (vascular endothelial growth factors) | 1 |
| Vespryn | 1 |
Key: (*) estimated number of paralogues; the current genome assembly is not sufficiently well-scaffolded to allow the number of paralogues to be determined with certainty.
Figure 1Syntenic comparisons of venom genes in the king cobra with other vertebrates revealing toxin recruitment by hijacking/modification and gene duplication. (A) Modification of PLBD1 gene found in the green anole lizard (Anolis carolinensis) and the chicken (Gallus gallus) results in the venom gland expressed phospholipase-B (PLB). Note that PLB is found split across two king cobra genome scaffolds; (B) Modification of HYALP1 gene found in the mouse (Mus musculus) results in the venom gland expressed hyaluronidase (HYAL); (C) Duplication of the non-venom gland expressed ADAM gene in the king cobra results in a venom gland expressed snake venom metalloproteinase (SVMP) gene. The ADAM gene in the green anole is flanked on both sides by non-SVMP genes, demonstrating the absence of gene duplication in this species. Note that subsequent downstream duplication of the SVMP gene in the king cobra results in multiple venom gland expressed SVMP isoforms. Based on Figure S5 from [3].
Figure 2Preliminary analysis of three finger toxin isoforms in the king cobra genome. (a) Phylogeny showing isoform numbers; (b) Expression level (transcript abundance) in the venom gland; (c) Apparent copy number in genome. One hypothesis consistent with the figure is that the more recently expanded paralogues tend to be more highly expressed. The figure is an unpublished analysis by one of us (Christiaan Henkel) based on data in Ref. [3]. See Table 4 for corresponding genome sequencing and accession codes of the three finger toxin isoforms.
King cobra three finger toxin genome sequencing and accession codes. Isoforms correspond with the ones referred to in Figure 2.
| 3FTX Isoform | Nucleotide Sequence | Accession Code Genbank |
|---|---|---|
| Iso1 | GATACACCTTGACATGTCTAACACATGAATCATTATTTTTTGAAACCACTGAGACTTGTTCAGATGGGCAGAACCTATGCTATGCAAAATGGTTTGCAGTTTTTCCAGGTG | AZIM01011044.1 |
| Iso2 | GATACACCAGGATATGCCACAAATCTTCTTTTATCTCTGAGACTTGTCCAGATGGGCAGAACCTATGCTATTTAAAATCGTGGTGTGACATTTTTT | AZIM01016929.1 |
| Iso3 | GATACACCTTGACATGCATCACATCTGCTCGTAACTTTGAGACTTGTCCACCTGGGCAGAACCTATGCTTTTTAAAATCATGGTATGAAGCTTCAT | AZIM01214498.1 |
| Iso4 | TACAAAACCGGTGAACGTATTATTTCTGAGACTTGTCCCCCTGGGCAGGACCTATGCTATATGAAGACTTGGTGTGACGTTTTTT | AZIM01146344.1 |
| Iso5 | GATACACCATGACATGTTACACACAGTACTCATTGTCTCCTCCAACCACTAAGACTTGTCCAGATGGGCAGAACCTATGCTATAAAAGGTGATTTGCGTTTATTCCACATG | AZIM01015434.1 |
| Iso6 | GATACACCACGAAATGCTACGTAACACCTGATGCTACCTCTCAGACTTGTCCAGATGGGGAGAACATATGCTATACAAAGTCTTGGTGTGACGGTTTTT | AZIM01133918.1 |
| Iso7 | GATACACCACGAAATGCTATGTAACACCTGATGCTACCTCTCAGACTTGTCCAGATGGGGAGAACATATGCTATACAAAGTCTTGGTGTGACGTTTTTT | AZIM01229389.1 |
| Iso8 | GATACACCACGAAATGCTACATAACACCTGATGTGAAGTCTCAGACTTGTCCAGATGGGGAGAACATATGCTATACAAAGACTTGGTGTGATGTTTGGT | AZIM01229389.1 |
| Iso9 | GATACACCACGAAATGCTACGTAACACCTGATGTTAAGTCTGAGACTTGTCCAGATGGGCAGGACATATGCTATACAGAGACTTGGTGTGACGTTTGGT | AZIM01028336.1 |
| Iso10 | GATACACCACGAAATGCTACGTAACACCTGATGTTAAGTCTGAGACTTGTCCAGCTGGGCAGGACATATGCTATACAGAGACTTGGTGTGATGCTTGGT | AZIM01097792.1 |
| Iso11 | GACACACCAGGATATGTCTCACAGACTACTCAAAAGTTAGTGAAACCATTGAGATTTGTCCAGATGGGCAGAACTTCTGCTTTAAAAAGTTTCCTAAGGGTATTCCATTTT | AZIM01006046.1 |
| Iso12 | GATACACCATGAAATGTCTCACAAAGTACTCCCGGGTTAGTGAAACCTCTCAGACTTGTCACGTTTGGCAGAACCTATGTTTTAAAAAGTGGCAGAAGG | AZIM01011575.1 |
| Iso13 | GACACACCTTGATATGTGTCAAACAGTACACAATTTTTGGTGTAACCCCTGAGATTTGCGCAGATGGGCAGAACCTATGCTATAAAACATGGCATATGGTGTATCCAGGTG | AZIM01011969.1 |
| Iso14 | GATACACCACGAAATGTTACAACCACCAGTCAACGACTCCTGAAACCACTGAAATTTGTCCAGATTCAGGGTACTTTTGCTATAAAAGCTCTTGGATTGATGGACGTG | AZIM01034614.1 |
| Iso15 | GATACACCCTGATATGTCACCGAGTGCATGGACTTCAGACTTGTGAACCAGATGAGAAGTTTTGCTTTAGAAAGACGACAATGTTTTTTCCAAATC | AZIM01009352.1 |
| Iso16 | GATACACCAGGAAATGTCTCAACACACCGCTTCCTTTGATCTATANTTAAAATGACTATTAAGAAGTTGCCATCTA | AZIM01009586.1 |
| Iso17 | NATACACCAGGATATGTTTAAAGCAAGAGCCATTTCAACCTGAAACCAGTACAACTTGTCCAGATGGGGAAGATGCTTGCTATAGTACATTTTGGAGTGATAACC | AZIM01019523.1 |
| Iso18 | NATACACCAGGATATGTTTAAAGCAAGAGCCGTTTCAACCTGAAACCACTACAACTTGTCCAGAAGGGGAGGATGCTTGCTATAATTTGTTTTGGAGTGATCACA | AZIM01052732.1 |
| Iso19 | GATACAGCTTGATATGTTTTAACCAAGAGACGTATCGACCTGAAACCACTACAACTTGTCCAGATGGGGAGGACACTTGCTATAGTACATTTTGGAATGATCACCATG | AZIM01009977.1 |
| Iso20 | CACAAACCAAGACATGTTACTCATGCACTGGAGCATTTTGTTCTAATCGTCAAAAATGTTCGGGTGGGCAGGTCATATGCTTTAAAAGTTGGAAAAATACTCTTCTGATAT | AZIM01013260.1 |
| Iso21 | CACACACCCTGACATGTTACTCATGCAATGGATTATTATGTTCTGACCGTGAACAATGTCCAGATGGGTAGGACATATGCTTTAAGAGATGGAATGATACTGATTGGTCAG | AZIM01013561.1 |
| Iso22 | GATACAGCTTGACATGTCTCAATTGCCCAGAACAGTATTGTAAAAGAATTCACACTTGTCGAGATGGGGAGAACGTATGCTTTAAAAGGTTTTACGAGGGTAAACTATTAT | AZIM01071124.1 |
| Iso23 | GATACACTCTGTTGTGTTGCAAATGCAATCAAACGGTTTGTGATCTCAATTCGTATTGTTCAGCAGGCAAGAACCAATGCTATATATTGCAGAATAATA | AZIM01008565.1 |