| Literature DB >> 33149734 |
Leandro de Mattos Pereira1,2, Elisa Alves Messias1, Bruna Pereira Sorroche1, Angela das Neves Oliveira1, Lidia Maria Rebolho Batista Arantes1, Ana Carolina de Carvalho1, Anita Mitico Tanaka-Azevedo3, Kathleen Fernandes Grego3, André Lopes Carvalho1, Matias Eliseo Melendez1,4,5.
Abstract
BACKGROUND: Lack of complete genomic data of Bothrops jararaca impedes molecular biology research focusing on biotechnological applications of venom gland components. Identification of full-length coding regions of genes is crucial for the correct molecular cloning design.Entities:
Keywords: Biotechnological application; Bothrops jararaca; Stonustoxin; Transcriptome; Venom gland; Verrucotoxin
Year: 2020 PMID: 33149734 PMCID: PMC7579844 DOI: 10.1590/1678-9199-JVATITD-2019-0058
Source DB: PubMed Journal: J Venom Anim Toxins Incl Trop Dis ISSN: 1678-9180
Statistics of Trinity de novo assembly.
| Global Trinity Stats | |
|---|---|
| Total trinity 'genes' counts | 76 765 |
| Total trinity transcripts counts | 96 044 |
| Percent GC (%) | 43.56 |
|
| |
| Contig N10 | 3 774 |
| Contig N20 | 2 674 |
| Contig N30 | 2 012 |
| Contig N40 | 1 512 |
| Contig N50 | 1 104 |
| Median contig length | 362 |
| Average contig (%) | 675.25 |
| Total assembled bases | 64 853 458 |
|
| |
| Contig N10 | 3 302 |
| Contig N20 | 2 264 |
| Contig N30 | 1 642 |
| Contig N40 | 1 179 |
| Contig N50 | 828 |
| Median contig length | 335 |
| Average contig | 585.97 |
| Total assembled bases | 44 982 090 |
Summary of transcriptome completeness assessment by BUSCO notation.
| Eukaryotic genes | |
|---|---|
| Complete and single-copy BUSCOs | 222 (73.3%) |
| Complete and duplicated BUSCOs | 38 (12.5%) |
| Fragmented BUSCOs | 35 (11.6%) |
| Missing BUSCOs | 8 (2.6%) |
|
|
|
|
| |
| Complete and single-copy BUSCOs | 683 (69.8%) |
| Complete and duplicated BUSCOs | 185 (18.9%) |
| Fragmented BUSCOs | 82 (8.4%) |
| Missing BUSCOs | 28 (2.9%) |
|
|
|
Full-length transcript reconstruction analysis of B. jararaca venom gland transcriptome in relation to Anolis carolinensis, Ophiophagus hannah, Python bivittatus and UniprotKB/Swiss-Prot.
|
|
|
| UniprotKB/Swiss-Prot | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Pct_cov_Hit (%) | PC | PCS | Pct_cov_Hit (%) | PC | PCS | Pct_cov_Hit (%) | PC | PCS | Pct_cov_Hit (%) | PC | PCS |
| 100 | 1438 | 1438 | 100 | 3069 | 3069 | 100 | 4974 | 4974 | 100 | 3739 | 3739 |
| 90 | 367 | 1805 | 90 | 834 | 3903 | 90 | 928 | 5902 | 90 | 1034 | 4773 |
| 80 | 271 | 2076 | 80 | 652 | 4555 | 80 | 805 | 6707 | 80 | 699 | 5472 |
Cumulative number of protein of the A. carolinensis, O. hannah, Python bivittatus and UniprotKB-Swiss-Prot databases recovery by BLASTX that aligned by at least one transcript in the assembly B. jararaca transcriptome across at 80-100 percentage (%) of coverage. The transcripts identified in B. jararaca were annotated as full-length transcripts if they match a protein in the reference proteome database at E-value threshold of 1e-20. Pct_cov_Hit: percentage of coverage of top matching hits of reference proteome that align across more than X% (80-100) with a transcript of B. jararaca. PC: protein counts of target reference proteome that aligned by at least one transcript of B. jararaca. PCS: protein count sum of reference proteins of reference proteome that aligned at X% (80-100) coverage by at least one transcript of B. jararaca.
Figure 1.Top BLAST hit distribution of predicted proteins from of Bothrops jararaca venom gland transcriptome. Recovery by Blast2GO with similarity filter parameter of 55%, E-Value-Hit-Filter: 10-6.
Figure 2.Gene Ontology category classification at level 2 and functional distribution of the transcriptome of B. jararaca performed by Blast2GO. The predicted proteins were functionally mapped according to the three major classifications of Gene Ontology: (A) biological process (BP), (B) molecular function (MF) and (C) cellular component (CC). They wereannotated by setting the following parameters - E-Value-Hit-Filter: 10-6 and others parameters default.
Figure 3.Functional domain annotation of predicted proteins of B. jararaca transcriptome with InterProScan: (A) PFAM, (B) SMART and (C) SUPERFAMILY databases.
Figure 4.Functional class annotation of toxins and accessory family proteins identified in B. jararaca venom using Animal Toxin Annotation Project as reference.
Figure 5.Phylogenetic tree of stonustoxin and verrucotoxin proteins.