| Literature DB >> 31775284 |
Blake T Hovde1, Hajnalka E Daligault1, Erik R Hanschen1, Yuliya A Kunde1, Matthew B Johnson2, Shawn R Starkenburg1, Shannon L Johnson1.
Abstract
The sequenced genome and the leaf transcriptome of a near relative of Abrus pulchellus and Abrus precatorius was analyzed to characterize the genetic basis of toxin gene expression. From the high-quality genome assembly, a total of 26 potential coding regions were identified that contain genes with abrin-like, pulchellin-like, and agglutinin-like homology, with full-length transcripts detected in leaf tissue for 9 of the 26 coding regions. All of the toxin-like genes were identified within only five isolated regions of the genome, with each region containing 1 to 16 gene variants within each genomic region (<1 Mbp). The Abrus precatorius cultivar sequenced here contains genes which encode for proteins that are homologous to certain abrin and prepropulchellin genes previously identified, and we observed substantial diversity of genes and predicted gene products in Abrus precatorius and previously characterized toxins. This suggests diverse toxin repertoires within Abrus, potentially the results of rapid toxin evolution.Entities:
Keywords: Abrus precatorius; Iso-Seq; abrin; full genome sequencing; genome assembly; isoform; pulchellin; toxin evolution
Mesh:
Substances:
Year: 2019 PMID: 31775284 PMCID: PMC6950105 DOI: 10.3390/toxins11120691
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Genome assembly and annotation statistics for Abrus precatorius.
| Genome Size | 347,231,436 bp |
|---|---|
| GC% | 31.8% |
| Number of contigs | 344 |
| Contig N50/L50 | 11,837,218/12 |
| Number of scaffolds | 160 |
| Scaffold N50/L50 | 35,860,869/5 |
| Predicted genes | 29,216 |
Figure 1Location of abrin-like genes identified within the Abrus precatorius whole genome assembly. A total 26 Abrin-like genes were identified across four assembled scaffolds of the genome. Regions boxed in red are magnified to show the gene orientation in more detail. Nine of those genes that displayed active transcript evidence are highlighted with an “*”. For each “*” stacked an individual transcript was identified from the pool of full-length transcript sequencing data obtained in this study.
Figure 2Unrooted phylogenetic tree of Abrus abrin and prepropulchellin protein sequences. Independent maximum likelihood (ML) and Bayesian analyses estimated the same tree topology with the exception of the poorly supported placement of ctg3_13282511. Numbers indicate ML bootstrap values and Bayesian posterior probabilities, respectively. Branch lengths and topology shown are from the Bayesian estimation. Novel sequences are denoted with contig number (“ctg”) and start codon position within genome assembly reported in this work (NCBI: GCA_003935025). Each instance of an individually sequenced isoform mapped to each sequence is denoted by a plus sign (18 total abrin-like isoforms were identified from the Iso-Seq sequencing). Accession numbers for reference sequences are included in parentheses. Sequences in gray indicate putative pseudogenes. Genes used in this comparison include Abrin A, Abrin B, Abrin C, Abrin D, prepropuchellin I–IV, and Agglutinin genes published on NCBI GenBank (Tables S1 and S2). The translated alignment file used to generate this tree is available (File S1).
Figure 3Phylogenetic tree of Abrus 18S rRNA sequences. Species sequenced here are emphasized. Numbers indicate independent ML bootstrap values and Bayesian posterior probabilities, respectively. Branch lengths shown are from the ML estimation. Outgroups, Glycyrrhiza (licorice) and Aganope (Ostryocarpus), used in this 18S analysis were two plants within the same family, Fabaceae, as Abrus.