| Literature DB >> 31208458 |
Raíssa N Brito1, Juliana A Geraldo2,3, Fernando A Monteiro4, Cristiano Lazoski5, Rita C M Souza1, Fernando Abad-Franch6,7.
Abstract
BACKGROUND: Rhodnius montenegrensis (Triatominae), a potential vector of Chagas disease, was described after R. robustus-like bugs from southwestern Amazonia. Mitochondrial cytb sequence near-identity with sympatric R. robustus (genotype II) raised doubts about the taxonomic status of R. montenegrensis, but comparative studies have reported fairly clear morphological and genetic differences between R. montenegrensis and laboratory stocks identified as R. robustus. Here, we use a transcriptome-based approach to investigate this apparent paradox.Entities:
Keywords: Molecular systematics; Rhodnius; Transcriptomics; Triatominae
Mesh:
Year: 2019 PMID: 31208458 PMCID: PMC6580618 DOI: 10.1186/s13071-019-3558-9
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Fig. 1Transcriptome-based molecular systematics: a diagrammatic representation of our methodological approach. We determined 10 ITS2 query sequences and deposited them in NCBI’s (National Center for Biotechnology Information) GenBank (broken grey arrow). Other query and outgroup sequences were retrieved from GenBank, and transcriptome read archives (SRAs) from NCBI’s Sequence Read Archive and the European Nucleotide Archive (ENA). The right panel (broken-line box) illustrates some details of the process of generating consensus sequences from raw transcriptome reads plus query sequences and then computing breadth- and depth-coverage (pale-blue histograms behind the consensus sequence). The dotted-line vertical boxes highlight two variable sites: one in which the consensus sequence differs from the query (T/A), even though a minority of reads have T as in the query; and another in which a rare C variant appears in a minority of aligned reads and the consensus sequence, therefore, retains G as in the query
Transcriptome raw data (sequence read archives) used in this study
| Putative species | SRA | ENA run | Platforma | Colony | Material | Notes | Reference |
|---|---|---|---|---|---|---|---|
|
| SRX1996481 | SRR3995415 | HiSeq 2500 | UNESP ‘CTA 88’ | Heads | Pool of 7 males | [ |
| SRX1996482 | SRR3995416 | HiSeq 2500 | UNESP ‘CTA 88’ | Heads | Pool of 6 males | [ | |
|
| SRX1996483 | SRR3995417 | HiSeq 2500 | UNESP ‘CTA 85’ | Heads | Pool of 6 males | [ |
| SRX1996484 | SRR3995418 | HiSeq 2500 | UNESP ‘CTA 85’ | Heads | Pool of 6 males | [ | |
| ERX1387159 | ERR1315266 | HiSeq 2000 | UNESP colony (Peru) | Rostrum + antennae | Pool of 21 females | Unpublishedb | |
| ERX1387160 | ERR1315268 | HiSeq 2000 | UNESP colony (Peru) | Rostrum + antennae | Pool of 19 males | Unpublishedb |
aIllumina, 2 × 100-bp paired-end reads
bCNRS (Centre National de la Recherche Scientifique, France) sequencing project; see [21]
Abbreviations: SRA, transcriptome access code at the Sequence Read Archive, National Center for Biotechnology Information (NCBI); ENA run, transcriptome run access code at the European Nucleotide Archive, European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI); UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil
Fig. 2Alignment of Rhodnius montenegrensis transcriptome reads to Rhodnius spp. query sequences. The graphs present, for each of three loci (mitochondrial cytb plus nuclear rDNA ITS2, and D2-28S), breath-coverage (% of positions; bold vertical lines; left vertical axis) and mean depth-coverage (reads/position; bars; right vertical axis) for queries against two R. montenegrensis transcriptomes (SRX1996481 and SRX1996482; [7, 12, 13]). Red lines/bars highlight queries yielding full-breadth-coverage; queries yielding only partial-breadth-coverage are black/grey. Abbreviation: UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil
Fig. 3Alignment of Rhodnius robustus transcriptome reads to Rhodnius spp. cytochrome b sequences. Transcriptome reads were retrieved from the benchmark Rhodnius robustus stocks used in R. montenegrensis’ description and in later transcriptome-based comparisons. The graphs present breath-coverage (% of positions; bold vertical lines; left vertical axis) and mean depth-coverage (reads/position; bars; right vertical axis) for queries against four R. robustus transcriptomes (SRX1996483, SRX1996484, ERX1387159 and ERX1387160; [7, 12, 13, 21]). Red lines/bars highlight queries yielding full-breadth-coverage; queries yielding only partial-breadth-coverage are black/grey. Abbreviations: UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil; CNRS, Centre National de la Recherche Scientifique, France
Fig. 4Alignment of Rhodnius robustus transcriptome reads to Rhodnius spp. ribosomal DNA sequences. Transcriptome reads were retrieved from the benchmark Rhodnius robustus stocks used in R. montenegrensis’ description and in later transcriptome-based comparisons; query sequences include two nuclear rDNA loci: ITS2 (upper row) and D2-28S (lower row). The graphs present breath-coverage (% of positions; bold vertical lines; left vertical axis) and mean depth-coverage (reads/position; bars; right vertical axis) for queries against four R. robustus transcriptomes (SRX1996483, SRX1996484, ERX1387159 and ERX1387160; [7, 12, 13, 21]). Red lines/bars highlight queries yielding full-breadth-coverage; queries yielding only partial-breadth-coverage are black/grey. Abbreviations: UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil; CNRS, Centre National de la Recherche Scientifique, France
Full-breadth-coverage consensus sequences generated from transcriptomes determined using UNESP Rhodnius montenegrensis colonies
| Transcriptome | Locus | Query | Depth-coverage | Identity | Divergencea | ||||
|---|---|---|---|---|---|---|---|---|---|
| NCBI SRA | Sequence | Species | Mean | Minimum | < 10 readsb (%) | Percent | Distance | SE | |
| SRX1996481 |
| EF011724.1 | 13,831.35 | 1996 | – | 99.85 | 0.00152 | 0.00144 | |
| AF421341.1 | 5745.50 | 48 | – | 99.55 | 0.00463 | 0.00252 | |||
| EF071583.1 | 4646.05 | 67 | – | 99.70 | 0.00306 | 0.00205 | |||
| EF011720.1 | 4061.95 | 25 | – | 99.85 | 0.00152 | 0.00140 | |||
| KR072682.1 |
| 3298.10 | 12 | – | 100.00 | 0.00000 | 0.00000 | ||
| ITS2c | MK411275 | 24.41 | 3 | 1.6 | 99.59 | 0.00411 | 0.00231 | ||
| D2- | AF435857.1 | 631.85 | 18 | – | 100.00 | 0.00000 | 0.00000 | ||
| AF435858.1 | 334.82 | 2 | 1.3 | 100.00 | 0.00000 | 0.00000 | |||
| AF435859.1 | 199.84 | 1 | 33.0 | 99.68 | 0.00316 | 0.00220 | |||
| SRX1996482 |
| EF011724.1 | 11,834.29 | 1689 | – | 99.85 | 0.00152 | 0.00144 | |
| AF421341.1 | 4834.87 | 32 | – | 99.55 | 0.00463 | 0.00252 | |||
| EF071583.1 | 3450.87 | 47 | – | 99.70 | 0.00306 | 0.00205 | |||
| EF011720.1 | 3203.49 | 13 | – | 100.00 | 0.00000 | 0.00000 | |||
| KR072682.1 |
| 2311.50 | 6 | 0.3 | 100.00 | 0.00000 | 0.00000 | ||
| ITS2c | MK411274 | 20.29 | 1 | 29.6 | 99.32 | 0.00686 | 0.00290 | ||
| D2- | AF435857.1 | 714.15 | 20 | – | 100.00 | 0.00000 | 0.00000 | ||
| AF435859.1 | 211.15 | 1 | 31.3 | 99.68 | 0.00316 | 0.00205 | |||
Notes: Depth-coverage (reads/position) and sequence identity (percent of identical bases) and divergence (model-based genetic distance estimates) of query vs consensus sequences are presented for three loci and two transcriptomes determined by UNESP researchers [12, 13]
aFrom the best-fit models of nucleotide substitution: Tamura three-parameter with γ-distributed rates for cytb (γ = 0.20) and D2-28S (γ = 0.05), and Tamura three-parameter for ITS2; standard errors (SE) computed after 1000 bootstrap pseudo-replicates
bPercent of positions at which depth-coverage was < 10 reads (Additional file 6: Figure S4)
cSee details in Additional file 1: Table S2
Abbreviations: UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil; NCBI, National Center for Biotechnology Information, USA; SRA, transcriptome access code at NCBI’s Sequence Read Archive; cytb, mitochondrial cytochrome b gene; ITS2, second internal transcribed spacer of the nuclear ribosomal DNA; D2-28S, D2 variable region of the nuclear ribosomal DNA
Full-breadth-coverage consensus sequences generated from transcriptomes determined using UNESP ‘Rhodnius robustus’ colonies
| Transcriptome | Colony at UNESP | Locus | Query | Depth-coverage | Identity | Divergencea | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Group [refs.] | NCBI SRA | Sequence | Species | Mean | Minimum | < 10 readsb (%) | Percent | Distance | SE | ||
| UNESP [ | SRX1996483 |
| EF011721.1 |
| 13,238.94 | 7978 | – | 99.85 | 0.00152 | 0.00145 | |
| EF011716.1 |
| 13,235.83 | 8029 | – | 99.85 | 0.00152 | 0.00144 | ||||
| AF421339.1 |
| 11,680.80 | 29 | – | 99.70 | 0.00308 | 0.00211 | ||||
| EF011724.1 | 6.18 | 2 | 90.2 | 99.85 | 0.00152 | 0.00144 | |||||
| ITS2d | MK411270 |
| 21.37 | 2 | 42.2 | 100.00 | 0.00000 | 0.00000 | |||
| MK411269 |
| 20.03 | 2 | 36.9 | 100.00 | 0.00000 | 0.00000 | ||||
| MK411271 |
| 19.84 | 1 | 25.1 | 100.00 | 0.00000 | 0.00000 | ||||
| MK411272 |
| 17.30 | 2 | 53.7 | 100.00 | 0.00000 | 0.00000 | ||||
| D2- | JQ897670.1e |
| 2131.15 | 8 | 1.0 | 99.34 | 0.00702 | 0.00407 | |||
| AF435862.1 |
| 2002.57 | 315 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435860.1 |
| 1996.21 | 313 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435861.1 | 1978.62 | 316 | – | 100.00 | 0.00000 | 0.00000 | |||||
| SRX1996484 |
| EF011721.1 |
| 15,319.04 | 9426 | – | 99.85 | 0.00152 | 0.00145 | ||
| EF011716.1 |
| 15,185.29 | 9224 | – | 99.85 | 0.00152 | 0.00144 | ||||
| AF421339.1 |
| 13,564.95 | 29 | – | 99.70 | 0.00306 | 0.00207 | ||||
| JX273156.1 |
| 8.63 | 1 | 53.5 | 99.40 | 0.00621 | 0.00314 | ||||
| D2- | JQ897670.1e |
| 1319.28 | 6 | 5.7 | 99.68 | 0.00334 | 0.00257 | |||
| AF435861.1 | 1266.95 | 169 | – | 100.00 | 0.00000 | 0.00000 | |||||
| AF435862.1 |
| 1259.78 | 173 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435860.1 |
| 1238.28 | 188 | – | 100.00 | 0.00000 | 0.00000 | ||||
| CNRS [ | ERX1387159 |
| EF011721.1 |
| 245.46 | 102 | – | 100.00 | 0.00000 | 0.00000 | |
| AF421339.1 |
| 233.66 | 1 | 2.7 | 100.00 | 0.00000 | 0.00000 | ||||
| EF011716.1 |
| 219.03 | 60 | – | 99.85 | 0.00152 | 0.00144 | ||||
| AF421341.1 | 43.65 | 6 | 10.9 | 100.00 | 0.00000 | 0.00000 | |||||
| ITS2d | MK411271 |
| 78.31 | 21 | – | 100.00 | 0.00000 | 0.00000 | |||
| MK411272 |
| 66.01 | 20 | – | 100.00 | 0.00000 | 0.00000 | ||||
| MK411270 |
| 64.70 | 18 | – | 100.00 | 0.00000 | 0.00000 | ||||
| MK411269 |
| 60.10 | 15 | – | 99.87 | 0.00139 | 0.00130 | ||||
| D2- | AF435861.1 | 896.28 | 28 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435860.1 |
| 891.44 | 34 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435862.1 |
| 879.61 | 37 | – | 100.00 | 0.00000 | 0.00000 | ||||
| AF435857.1 | 735.23 | 2 | 2.1 | 99.84 | 0.00159 | 0.00154 | |||||
| JQ897670.1e |
| 710.08 | 3 | 6.9 | 99.84 | 0.00165 | 0.00187 | ||||
| ERX1387160 |
| EF011721.1 |
| 222.43 | 136 | – | 100.00 | 0.00000 | 0.00000 | ||
| AF421339.1 |
| 205.40 | 1 | 3.2 | 100.00 | 0.00000 | 0.00000 | ||||
| EF011716.1 |
| 197.09 | 60 | – | 99.85 | 0.00152 | 0.00144 | ||||
| AF421341.1 | 186.42 | 19 | – | 100.00 | 0.00000 | 0.00000 | |||||
| ITS2d | MK411271 |
| 28.28 | 9 | 2.1 | 100.00 | 0.00000 | 0.00000 | |||
| MK411269 |
| 24.03 | 7 | 6.9 | 100.00 | 0.00000 | 0.00000 | ||||
| MK411272 |
| 23.66 | 5 | 14.4 | 100.00 | 0.00000 | 0.00000 | ||||
| MK411270 |
| 22.48 | 5 | 8.6 | 100.00 | 0.00000 | 0.00000 | ||||
| D2- | AF435860.1 |
| 57.74 | 3 | 1.6 | 100.00 | 0.00000 | 0.00000 | |||
| AF435861.1 | 54.91 | 3 | 0.5 | 100.00 | 0.00000 | 0.00000 | |||||
| AF435859.1 | 33.38 | 1 | 34.3 | 100.00 | 0.00000 | 0.00000 | |||||
Notes: Depth-coverage (reads/position) and sequence identity (percent of identical bases) and divergence (model-based genetic distance estimates) of query vs consensus sequences are presented for three loci and four transcriptomes determined by two research groups
aFrom the best-fit models of nucleotide substitution: Tamura three-parameter with γ-distributed rates for cytb (γ = 0.20) and D2-28S (γ = 0.05), and Tamura three-parameter for ITS2; standard errors (SE) computed after 1000 bootstrap pseudo-replicates
bPercent of positions at which depth-coverage was < 10 reads (Additional file 6: Figure S4)
cConsensus sequences with mean depth-coverage < 10 reads/position (Additional file 1: Table S1, Additional file 6: Figure S4) that were excluded from phylogenetic analyses
dSee details in Additional file 1: Table S2
eJQ897670.1 is from a bug identified as Rhodnius neglectus from ‘Orellana, Ecuador’ (see [31]), where neither R. neglectus nor R. prolixus occur (R. robustus II and R. barretti do); this is most likely a case of misidentification or mislabeling of the specimen
Abbreviations: UNESP, Universidade Estadual Paulista ‘Júlio de Mesquita Filho’, Brazil; CNRS, Centre National de la Recherche Scientifique, France; NCBI, National Center for Biotechnology Information, USA; SRA, transcriptome access code at NCBI’s Sequence Read Archive; cytb, mitochondrial cytochrome b gene; ITS2, second internal transcribed spacer of the nuclear ribosomal DNA; D2-28S, D2 variable region of the nuclear ribosomal DNA
Negligible mitochondrial cytochrome b sequence divergence between Rhodnius montenegrensis and R. robustus genotype II
| Sequence | Geography | Divergence from | |||||||
|---|---|---|---|---|---|---|---|---|---|
| GenBank ID | Reference | Locality | State/Province | Country | No. of bases | p-distance | SE | T3p+γ distance | SE |
| EF071583.1 | [ | Not reported | Rondônia | Brazil | 1a | 0.00271 | 0.00258 | 0.00276 | 0.00263 |
| EF011720.1 | [ | Porto Velho | Rondônia | Brazil | 2 | 0.00542 | 0.00383 | 0.00563 | 0.00429 |
| EF011724.1 | [ | Monte Negro | Rondônia | Brazil | 3b | 0.00813 | 0.00427 | 0.00862 | 0.00528 |
| AF421341.1 | [ | Not reported | Napo | Ecuador | 5 | 0.01355 | 0.00524 | 0.01494 | 0.00732 |
Notes: The comparisons involve sequence KR072682.1 (from Rhodnius montenegrensis’ original description [7]) vs R. robustus II sequences from the type locality (Monte Negro), from the same sub-region (state of Rondônia, Brazil), and from a ~2000-km distant area (province of Napo, Ecuador) within western Amazonia
aLikely a base-call error in EF071583.1
bIncluding one probable base-call error in EF011724.1
Abbreviations: p-distance, observed proportion of segregating sites; SE, standard error (from 1000 bootstrap pseudo-replicates); T3p+γ distance, corrected proportion of segregating sites estimated using the best-fit model of nucleotide substitution (Tamura three-parameter model with γ-distributed rates – five categories, γ = 0.20)
Fig. 5Mitochondrial cytb Bayesian phylogenetic tree of members of the Rhodnius prolixus–R. robustus cryptic–species complex. The tree was inferred using query sequences (black font) and the consensus sequences generated from R. montenegrensis (black/blue font) and R. robustus (black/green font) transcriptome-read archives (for which taxon labels include NCBI Sequence Read Archive codes). Rhodnius montenegrensis’ original sequence (KR072682.1; [7]) is highlighted in red font. Two full-breadth-coverage consensus sequences had mean depth-coverage < 10 reads/position and were therefore excluded from these analyses (see Table 4 and Additional file 6: Figure S4). Numbers at nodes are posterior probabilities from Bayesian analysis (BA) and bootstrap support values (1000 pseudo-replicates) for maximum-likelihood (ML) and maximum-parsimony (MP) topologies (see Additional file 3: Figure S1). The scale-bar indicates substitutions per site (from BA)
Fig. 6Nuclear ribosomal ITS2 Bayesian phylogenetic tree of members of the Rhodnius prolixus–R. robustus cryptic–species complex. The tree was inferred using query sequences [black font; new GenBank ID codes for ITS2 sequences generated for this study (MK411269–MK411278) are presented in parentheses] and the consensus sequences generated from R. montenegrensis (black/blue font) and R. robustus (black/green font) transcriptome-read archives (for which taxon labels include the query names and the NCBI Sequence Read Archive codes). Numbers at nodes posterior probabilities from Bayesian analysis (BA) and bootstrap support values (1000 pseudo-replicates) for maximum-likelihood (ML) and maximum-parsimony (MP) topologies (see Additional file 4: Figure S2). The scale-bar indicates substitutions per site (from BA)