| Literature DB >> 28406967 |
Danila Blanco de Carvalho1, Carlos Congrains2, Samira Chahad-Ehlers2, Heloisa Pinotti1, Reinaldo Alves de Brito2, João Aristeu da Rosa1.
Abstract
Chagas disease is one of the main parasitic diseases found in Latin America and it is estimated that between six and seven million people are infected worldwide. Its etiologic agent, the protozoan Trypanosoma cruzi, is transmitted by triatomines, some of which from the genus Rhodnius. Twenty species are currently recognized in this genus, including some closely related species with low levels of morphological differentiation, such as Rhodnius montenegrensis and Rhodnius robustus. In order to investigate genetic differences between these two species, we generated large-scale RNA-sequencing data (consisting of four RNA-seq libraries) from the heads and salivary glands of males of R. montenegrensis and R. robustus. Transcriptome assemblies produced for each species resulted in 64,952 contigs for R. montenegrensis and 70,894 contigs for R. robustus, with N50 of approximately 2,100 for both species. SNP calling based on the more complete R. robustus assembly revealed 3,055 fixed interspecific differences and 216 transcripts with high levels of divergence which contained only fixed differences between the two species. A gene ontology enrichment analysis revealed that these highly differentiated transcripts were enriched for eight GO terms related to AP-2 adaptor complex, as well as other interesting genes that could be involved in their differentiation. The results show that R. montenegrensis and R. robustus have a substantial quantity of fixed interspecific polymorphisms, which suggests a high degree of genetic divergence between the two species and likely corroborates the species status of R. montenegrensis.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28406967 PMCID: PMC5390988 DOI: 10.1371/journal.pone.0174997
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sequencing and assembly statistics based on cephalic tissue from males of R. montenegrensis (Rm) and R. robustus (Rr).
| Rm | Rr | |
|---|---|---|
| Total reads | 32,502,355 | 33,123,076 |
| Reads filtered by quality | 30,772,029 | 31,528,614 |
| Reads after normalization x 2 reads | 6,543,879 | 7,296,681 |
| Total trinity unigenes | 53,042 | 56,756 |
| Total trinity transcripts | 64,952 | 70,894 |
| Percent GC | 34.66 | 34.68 |
| N50 | 2,083 | 2,134 |
| N50 unigenes | 1,572 | 1,585 |
| Contigs larger than 1000 bp | 19,838 | 21,652 |
| Contigs larger than 2000 bp | 9,903 | 10,918 |
| Mean (bp) | 477 | 480 |
| Median (bp) | 1,046 | 1,057 |
| Total assembled bases | 67,920,294 | 74,955,989 |
| Largest contig (bp) | 32,076 | 30,264 |
Fig 1Gene Ontology level 2 terms with frequencies greater than 1% found for coding regions of unigenes derived from the head transcriptomes of males of R. montenegrensis and R. robustus.
Enrichment analysis of the GO terms of the most divergent unigenes ( = 1).
P-values were corrected using the method proposed by Benjamini and Hochberg [32].
| GO Term | Category Name | Main Category | p-value |
|---|---|---|---|
| GO:0030128 | clathrin coat of endocytic vesicle | Cellular Component | 0.01278887 |
| GO:0030122 | AP2 adaptor complex | Cellular Component | 0.01278887 |
| GO:0045334 | clathrin-coated endocytic vesicle | Cellular Component | 0.02489061 |
| GO:0030669 | clathrin-coated endocytic vesicle membrane | Cellular Component | 0.02489061 |
| GO:0030132 | clathrin coat of coated pit | Cellular Component | 0.02489061 |
| GO:0030666 | endocytic vesicle membrane | Cellular Component | 0.0423901 |
| GO:0030125 | clathrin vesicle coat | Cellular Component | 0.0423901 |
| GO:0005905 | coated pit | Cellular Component | 0.0423901 |
Fig 2Scatter plot of non-synonymous (Ka) and synonymous (Ks) rates among ortholog CDSs expressed in head transcriptomes of males of R. montenegrensis and R. robustus.
Ka/Ks = 1 and and Ka/Ks = 0.5 are shown in red and blue lines, respectively.
Summary of functional annotation and inferred evolutionary rates (Ka, Ks and Ka/Ks) of unigenes in head transcriptomes of R. montenegrensis and R. robustus.
| Annotation | Function | Ka | Ks | Ka/Ks |
|---|---|---|---|---|
| Salivary kazal-type proteinase inhibitor | Anticoagulant, vasodilator and anti-microbial activities [ | 0.00539 | 1.08 x 10−4 | 50 |
| Salivary lipocalin | Transportation of hydrophobic compounds in aqueous biological fluids, and as a binding protein sex pheromone [ | 0.07865 | 6.9827 x 10−2 | 1.12635 |
| Odorant-binding protein rproobp4 precursor | Precursor odor binding protein and / or pheromones [ | 0.002812 | 5.62 x 10−5 | 50 |
| Hemolysin-like secreted salivary protein 2 | Poration toxin and is often described in the microorganism [ | 0.011252 | 2.25 x 10−4 | 50 |
| Heme-binding protein | Prevents heme-induced oxidative damage to lipophorin [ | 0.028048 | 5.61 x 10−4 | 50 |
| Diptericin | Antibacterial peptide [ | 0.046749 | 9.35 x 10−4 | 50 |
| Nitrophorin 3b | Antihaemostatic [ | 0.117836 | 1.12417 | 0.719394 |
Ka: non-synonymous; Ks: synonymous; Ka/Ks. ratio of non-synonymous to synonymous rates.
Fig 3Distribution of absolute allele frequency differences (D) of the 13,696 shared SNPs between R. montenegrensis and R. robustus.
We highlight in red the class of SNPs with D = 1.
Fig 4Absolute allele frequency difference averages of shared SNPs of R. montenegrensis and R. robustus per transcript ().
We highlight in red the class of SNPs with = 1.
Fig 5Distribution of number of SNPs per contig that are 100% fixed for different alleles between R.montenegrensis and R. robustus.
Unigenes highly differentiated and potentially evolved under positive selection (Ka/Ks > 1) in head transcriptomes of R. montenegrensis and R. robustus.
| Contig RR | Annotation | Function | SNP | Ka | Ks | Ka/Ks | |
|---|---|---|---|---|---|---|---|
| c10228_g1_i1 | PREDICTED: zinc finger CCHC-type and RNA-binding motif-containing protein 1-like | Binds to RNA, being involved in many stages of its metabolism [ | 4 | 1 | 3.94 x 10−3 | 7.90 x 10−5 | 50 |
| c10808_g1_i1 | 6 | 1 | 1.86 x 10−2 | 3.7 x 10−4 | 50 | ||
| c11678_g1_fi1 | phosphopentothenoylcysteine decarboxylase | Acts in biosynthesis of coenzyme A [ | 4 | 1 | 6.33 x 10−3 | 1.3 x 10−4 | 50 |
| c13526_g1_i2 | 4 | 1 | 8.62 x 10−3 | 5.28 x 10−3 | 1.63269 | ||
| c13603_g1_i1 | nucleolar phosphoprotein-like | Ribosomal protein encoding and transport, control of centrosome duplication [ | 6 | 0.9993 | 1.59 x 10−2 | 1.09 x 10−2 | 1.45733 |
| c41453_g1_i1 | PREDICTED: uncharacterized protein LOC101741012 isoform X1 | 2 | 0.9969 | 8.44 x 10−3 | 1.7 x 10−4 | 50 | |
| c13063_g1_i1 | hypothetical protein TcasGA2_TC012103 | 7 | 0.9959 | 3.63 x 10−3 | 7.30 x 10−5 | 50 | |
| c10204_g1_i1 | PREDICTED: heme oxygenase 1-like | Catalyzes the degradation of heme. Important for digestion of the blood meal [ | 10 | 0.9949 | 6.67 x 10−3 | 1.3 x 10−4 | 50 |
RR: Rhodnius robustus; SNP: Single Nucleotide Polymorphisms;
: Average of SNP allele frequency differences between species per contig; Ka: rate of non-synonymous substitutions; Ks: rate of synonymous substitutions; Ka/Ks. ratio of non-synonymous and synonymous rates.