Literature DB >> 24625572

Molecular evidence of demographic expansion of the chagas disease vector Triatoma dimidiata (Hemiptera, Reduviidae, Triatominae) in Colombia.

Andrés Gómez-Palacio1, Omar Triana1.   

Abstract

BACKGROUND: Triatoma dimidiata is one of the most significant vectors of Chagas disease in Central America and Colombia, and, as in most species, its pattern of genetic variation within and among populations is strongly affected by its phylogeographic history. A putative origin from Central America has been proposed for Colombian populations, and high genetic differentiation among three biographically different population groups has recently been evidenced. Analyses based on putatively neutral markers provide data from which past events, such as population expansions and colonization, can be inferred. We analyzed the genealogies of the nicotinamide adenine dinucleotide dehydrogenase 4 (ND4) and the cytochrome oxidase subunit 1-mitochondrial genes, as well as partial nuclear ITS-2 DNA sequences obtained across most of the eco-geographical range in Colombia, to assess the population structure and demographic factors that may explain the geographical distribution of T. dimidiata in this country.
RESULTS: The population structure results support a significant association between genetic divergence and the eco-geographical location of population groups, suggesting that clear signals of demographic expansion can explain the geographical distribution of haplotypes of population groups. Additionally, empirical date estimation of the event suggests that the population's expansion can be placed after the emergence of the Panama Isthmus, and that it was possibly followed by a population fragmentation process, perhaps resulting from local adaptation accomplished by orographic factors such as geographical isolation.
CONCLUSION: Inferences about the historical population processes in Colombian T. dimidiata populations are generally in accordance with population expansions that may have been accomplished by two important biotic and orographic events such as the Great American Interchange and the uplift of the eastern range of the Andes mountains in central Colombia.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24625572      PMCID: PMC3953067          DOI: 10.1371/journal.pntd.0002734

Source DB:  PubMed          Journal:  PLoS Negl Trop Dis        ISSN: 1935-2727


Introduction

A population's demographic history as well as phylogeographic inferences are usually accessed by studying the reconstructed genealogical histories of individual genes (gene trees) sampled from different populations [1]–[3]: Studying patterns of genetic variation in a geographical context via gene trees can contribute considerably to our understanding of what factors have influenced geographical population structure and species divergence [4], [5]. Coalescent theory [6] is applied to studies relating to the haplotype frequency, genealogy, and geographical distribution of populations, and has been applied as a useful focus for understanding many events that may have occurred in the past across the demographic history of populations (e.g., population expansion, bottlenecks, vicariance, and migration). Triatoma dimidiata is considered the major vector of Chagas disease in several Central American countries as well as in various regions of Ecuador and Colombia [7]. Across its distribution in Colombia, T. dimidiata occupies a great diversity of habitats, including sylvatic habitats such as palm trees and hollow trees in northern regions, or rock piles, as well as intradomiciliary synanthropic habitats mostly in the country's central Andean departments [8]. Previous studies have suggested that T. dimidiata shows a strong and significant genetic structure related to its original eco-geographical regions in Colombia [9], which, albeit weakly, correlates with an isolation-by-distance model [10]. A preliminary paper on the genetic diversity and population differentiation of T. dimidiata in Colombia was assessed using DNA sequence analysis of the nicotinamide adenine dinucleotide dehydrogenase 4 (ND4) mitochondrial gene, which interestingly suggested a high genetic interpopulation differentiation within Colombia [9]. However, because the sample evaluated was rather small (n = 40), representing only a minimal area of the species distribution, a more exhaustive genetic analysis of several communities of Colombian T. dimidiata was performed by using a microsatellite as well as cytochrome c oxidase subunit 1 (CO1) gene [10]. Here, three major clusters with distinct ecological attributes were distinguished. These three clusters were termed: (i) Inter-Andean Valleys (IAV), harboring a population group located in central Colombia, where T. dimidiata shows more epidemiological relevance and apparently high flow between synanthropic and sylvatic habitats; (ii) the Caribbean Plains (CP) population group, the most widely distributed group from the Caribbean coast to the lowlands of the Central Andean Cordillera, occupying mainly sylvatic habitats; and (iii) the Sierra Nevada de Santa Marta (SNSM) mountain population group, located in the northwestern zone of Colombia, occupying exclusively sylvatic habitats such as palm trees, although a few individuals have also been found sporadically visiting indigenous dwellings and have also been implicated in human Trypanosoma cruzi infections [11], [12]. In a phylogeographic context, according to the evidence addressed by molecular analyses of ITS-2 [13], cytochrome b (cyt b), and ND4 genes [14], Colombian T. dimidiata populations are considered a differentiated form derived from Central American conspecific populations (in fact, it might be considered an additional subspecies or species within T. dimidiata sensu lato, according to the authors) [7], [13]–[16]. Under this hypothesis, Colombian populations are thought to have originated from an ancient population introduced through the Isthmus of Panama [13] after its emergence between 1.9 and 3.8 mya [14], therefore undergoing a wide geographical expansion at a later time that gave birth to the current population structure. Consequently, the aim of the present study was to assess the population structure and history as factors explaining the geographical distribution of population groups of T. dimidiata in Colombia as well as their position in the phylogeographic picture proposed for the species so far. Analysis of the population genetics in the context of the geographic structure suggests demographic processes that occurred in the past. Thus, while the pattern of variation in mtDNA haplotypes allows one to identify geographical distribution differentiation among groups of haplotypes in several populations, it also supports inferences on demographic events that occurred in the past, such as geographical range expansion and population size, according to coalescence theory. In this way, the change in population size through genealogy is reflected by a haplotype network with a star shape [17], an excess of rare mutations resulting in an excess of low-frequency haplotype presence [18], and a unimodal mismatch nucleotide distribution [19], [20]. In this study, we broaden the knowledge of the spatial structure of the three population groups of T. dimidiata in Colombia by analyzing ND4 gene nucleotide sequences obtained from 228 specimens in 22 localities; subsequently, several historical demographic tests were conducted using ND4 combined with previously reported CO1 nucleotide sequences [10] and ITS-2 rDNA. Finally, we also explored the phylogeographic pattern of Colombian populations by including the available ITS-2 and ND4 sequences of Central American and Mexican conspecific populations. Knowledge of population dynamics issues such as geographical dispersion and individual migration between extradomiciliary and domiciliary ecotopes is essential for predicting the success of vector control and surveillance strategies against Chagas disease. In this sense, study of the population structure and demographic history in the most relevant vectors is required for the design of more effective intervention strategies.

Materials and Methods

Insect samples

A total of 228 sequences for the ND4 (624-bp) gene as well as 42 partial sequences (252 bp) for ITS-2 were obtained (Table 1). Individuals were collected in intradomiciliary, peridomiciliary, and sylvatic ecotopes of 22 municipalities from ten departments in Colombia (Table 1 and Figure 1). Bug captures were carried out in 2003–2009 in collaboration with local personnel from the Ministry of Health. Sylvatic samples were collected with live-baited traps [21]. Domiciliary and peridomiciliary collections were made using the traditional manual collection method using a dislodging spray [22] and capture by homeowners. Captures from palm trees were obtained through palm dissection as described elsewhere [23], with consent previously obtained from the landowners. All specimens were identified to the species level using Lent and Wygodzinsky's typological key [24] and kept in 70% ethanol until being processed for DNA extraction.
Table 1

Origins and number of Colombian T. dimidiata samples used.

RegionMap No.DepartmentSampling sites (locality)LongitudeLatitudeAltitude (masl)EcotopeITS-2ND4CO1
Inter-Andean Valleys 1BoyacáSoata*, Urban zone, El Espino−73.13005.08002501I0132
2Soata, La Costa−72.71006.33331710P, S0199
3San Mateo*, Huerta Vieja, Guayabal−72.56436.39272177I164
4Boavita*, Espigón, Lagunillas−72.64126.28381479I122
5Susacon, Urban zone−72.69006.23002488I011
6Tipacoque, Urban zone−72.69536.42332020I2123
7Sativanorte*, Baracuta, La estancia, Datal−72.71146.13442813I361
8El Espino, Urban zone−72.49596.48332148I240
9HuilaPital, San Joaquin−75.81862.2306976I010
10Tarqui, Urban zone−75.82032.1114828I021
11Gigante, Veracruz−75.61012.3233754I011
12SantanderCapitanejo, Chorreras−72.70036.53141086I, P3236
Caribbean Plains 13AntioquiaTurbo, Blanquiset, Villa Eugenia−76.93387.87456S352
14BolívarMargarita, Urban zone−74.29289.145720S324
15San Fernando, Urban zone−74.43859.165318S092
16CesarValledupar*, Armawake, Tamacal−73.239210.4275132P2115
17Valledupar*, Seyminin, Bechungaka, Sabana de Crespo, Donachui, Chemesquemena−73.250610.4769176I54711
18CórdobaSan Bernardo del Viento, Playas del viento−75.95589.35589S210
19Norte de SantanderEl Carmen*, Tierra Azul, Zaragosa, El Chamizón, Santa Rita, Maracaibo−72.87088.0308714I7193
Sierra Nevada de Santa Marta 20La GuajiraDibulla*, Gumake, Umandita, Taminaca−73.294011.1781365S2138
21San Juan del Cesar*, Marocazo, Ulago, Cherua−73.013310.8112423I4119
22MagdalenaSanta Marta*, Guachaca-Cacahualito, Tarapaca, Mendihuaca−73.821911.2117283S22012
Total 42 228 86

Number and geographical origin of Colombian T. dimidiata samples included in the study. Ecotopes: I, intradomiciliary; S, sylvatic; P, peridomiciliary.

* Insects from nearby localities were grouped as one sampling site.

Sequences reported in [10].

Figure 1

Geographical distribution of the sampling sites of the Colombian T. dimidiata populations studied.

Map of Colombia showing the 22 T. dimidiata sampling sites used in differentiation and demographic analyses. Site numbers are detailed in Table 1. Eco-geographical regions correspond to the Inter-Andean Valleys region, the Caribbean Plains region, and the Sierra Nevada de Santa Marta Mountain.

Geographical distribution of the sampling sites of the Colombian T. dimidiata populations studied.

Map of Colombia showing the 22 T. dimidiata sampling sites used in differentiation and demographic analyses. Site numbers are detailed in Table 1. Eco-geographical regions correspond to the Inter-Andean Valleys region, the Caribbean Plains region, and the Sierra Nevada de Santa Marta Mountain. Number and geographical origin of Colombian T. dimidiata samples included in the study. Ecotopes: I, intradomiciliary; S, sylvatic; P, peridomiciliary. * Insects from nearby localities were grouped as one sampling site. Sequences reported in [10].

DNA extraction, PCR amplification, and sequencing

Genomic DNA was obtained from four legs of each insect or from the thorax muscle when necessary (i.e., in old, dead, dry bugs, or those that had lost their legs). DNA extraction was performed according to a previously reported mosquito DNA-extraction protocol [25]. For each specimen, a 614-bp fragment of the ND4 gene was PCR-amplified using ND4-F (5′-TCAACATGAGCCCTTGGAAG-3′) and ND4-R (5′-TAATTCGTTGTCATGGTAATG-3′) primers [9]. PCR reactions for the mitochondrial gene were conducted in a final volume of 35 µl using a 30-ng DNA template, 1× PCR buffer (0.1 M Tris-HCl, 0.5 M KCl, and 0.015 M MgCl2, pH 8.3), 250 µM dNTP, 0.016 µM of each primer, 5 mM MgCl2, and 2 U of Taq DNA polymerase (Promega®). The fragments were amplified with the following thermal cycling conditions: 95°C for 5 min; 35 cycles of 94°C for 30 s, 50°C for 30 s, and 72°C for 60 s; 72°C for 10 min. Because no reproducible and unspecific amplifications in Colombian T. dimidiata specimens were obtained when universal primers 5.8S and 28T [16] were used, we employed the T. dimidiata-specific primers TdITSF (5′-TGGAAATTTTCTGTTGTCCACA-3′) and TdITS-2R (5′-CTTGCTTTATACAACAAGAAGTA-3′) [26] for PCR amplification of a 252-bp fragment of ITS-2 rDNA. PCR reactions were conducted in a final volume of 35 µl using 30-ng DNA templates, 1× PCR buffer (0.1 M Tris-HCl, 0.5 M KCl, and 0.015 M MgCl2, pH 8.3), 250 µM dNTP, 0.025 µM of each primer, 3 mM MgCl2, and 2 U of Taq DNA polymerase (Promega®). After an initial denaturation of 95°C for 5 min, PCR reactions comprised 35 cycles at 95°C for 30 s, 60°C for 30 s, and 72°C for 30 s, followed by a final extension of 72°C for 7 min [26]. All PCR products were sent to Macrogen Inc., Seoul, Korea, for DNA purification and sequencing service. For all samples, sequencing was conducted in both forward and reverse directions.

Sequence analyses and inter-population differentiation

Forward and reverse sequences from specimens were used to generate a consensus sequence with a previous pairwise alignment using the CLUSTALW algorithm [27] implemented in Bioedit v. 7.0.5 [28]. Posterior multiple sequence alignment for each DNA marker was performed using the CLUSTALW algorithm [27]. In the complete data set for ND4 and ITS-2, we evaluated the nucleotide diversity (π), number of haplotypes (h), and haplotype diversity (Hd) using DnaSP v.5.10 [29]. The genetic differentiation among Colombian geographical samples was assessed by Fst comparison, and both nucleotide and haplotype diversity levels were estimated using Hudson's statistics Kst and Hst [30], defining the statistical significance (p<0.001) with a permutation test of 1,000 replicates. A median joining (MJ) haplotype network was used to examine inter-haplotype relationships among the 155 haplotypes of the 228 ND4 sequences as well as for 17 partial ITS-2 haplotypes using default parameters in Network 4.6.0.0 software (http://www.fluxus-engineering.com).

Spatial inter-population structure

Spatial analysis of molecular variance (SAMOVA) was performed to estimate the structure among population groups according to pairwise geographical distances between geographical locations by Fct statistical calculations using SAMOVA v.1.0 [31]. Fct values were estimated for simulated population groups from k = 2 to k = 4 in 1,000 iterations of the data set, which corresponds to the number of eco-geographical regions of T. dimidiata populations suggested in Colombia plus or minus one. The maximized Fct was selected according to the highest significant (p<0.001) value. An interpolation-based graphical method was employed to generate a three-dimensional genetic landscape shape (GLS) within the Alleles in Space (AIS) program [32]. This analysis provides a visual perspective of the spatial distribution of the genetic structure over landscapes, with peaks in areas where pairwise genetic distances between haplotypes from each geographical location are high, and valleys where genetic distances between individuals are low (the x- and y-axes represent latitude and longitude, whereas the z-axis represents genetic distances) [32]. Georeferenced coordinates (Universal Transverse Mercator system) were provided for each individual and analyzed for the ND4 sequences. Additionally, we performed a spatial autocorrelation analysis to test whether there were significant correlations (based on Vendramin et al. [33] correlation index V) between average pairwise genetic distances of haplotypes (Ay) in each spatial class defined according to geographical distances among geographical locations (y). This analysis is illustrated by a distogram where Ay takes on a value of 0 when all individuals within distance class y are genetically identical, and takes on a value of 1 when all individuals in distance class y are completely dissimilar. Spatial autocorrelation analysis was performed on 10 spatial classes with unequal distance and equal sample size (approximately 20 observations per class) in the AIS software [32]. Likewise, to test whether the inter-geographical location structure fits an isolation by distance model, we performed a Mantel test [34] on the pairwise genetic and geographical distance matrices, and the statistical significance (p<0.001) was assessed by a permutation test of 1,000 replicates.

Molecular analysis of population history and timing

The demographic history of the Colombian T. dimidiata was investigated by comparison of mismatch distributions of pairwise nucleotide differences under an expected constant and fluctuating population size in 10,000 generations of coalescent simulations using DNaSP v. 5 [29]. The distribution of mismatch pairwise nucleotide differences was obtained for ND4 haplotypes, as well as combined with CO1 haplotypes (n = 86; sequence size, 1,016 bp), and for partial ITS-2 sequences assuming free recombination. Parameters for a sudden demographic expansion were estimated using the sum of squares deviation (SSD) [35] and Harpending's raggedness index (Rag) [36] implemented in Arlequin v 3.1 [37]. Tests for neutrality were also assessed for access to the demographic history. Fu's Fs [38] and the Ramos-Onsins and Rozas R2 [39] statistics for detecting population growth were estimated under coalescent simulations with 10,000 generations using DNAsp v.5 software [29]. To visualize the effective breeding population size (Ne) fluctuation over time, a Bayesian skyline plot (BSP) analysis [40] was performed as implemented in the BEAST 1.6 package and Tracer v1.5.1 [41]. The starting trees were tested initially for Colombian T. dimidiata using ND4 haplotypes, and combined ND4 and CO1 genes, and then tested for additional haplotype sequences of Central American and Mexican conspecifics for the ND4 (n = 39) gene and ITS-2 (n = 66; Table S1). Trees were obtained using the maximum likelihood (under GTR+G) substitution model after checking according to the Akaike criterion [42] implemented in the jModelTest software [43] with an uncorrelated lognormal relaxed molecular clock assuming one generation per year, as reported for Colombian T. dimidiata individuals [44]. Two separate runs of BMCMC were performed, and a simulated population size (ESS) greater than 200 was obtained using a chain length of 10×106, assuming 10 stepwise control points as the number of coalescent groups.

Phylogenetic relationship

The complete data set of ITS-2 (n = 83) and ND4 (n = 194) haplotypes including Central American and Mexican conspecifics was used to perform a Bayesian inference (BI) Markov Chain Monte Carlo (BMCMC) approach as implemented in the BEAST v.1.6.1 package [41]. The topologies were inferred from the GTR+G substitution model and the model parameters (base frequencies, transition/transversion ratio, rate variation shape parameter) were derived empirically. Metropolis coupling was used with two chains per analysis. BMCMC was run for 10×106 generations, with a sampling frequency of 1,000. Two independent trees were combined using LogCombiner v.1.6.1, and convergence of parameters in the Bayesian analyses was assessed with Tracer v.1.5, after discarding a 10% burn-in. Finally, a majority rule consensus tree (>0.75 posterior probability node support) was calculated from all trees sampled using TreeAnnotator v.1.6.1 in the BEAST v.1.6.1 package [41]. Additionally, a maximum likelihood (ML) tree was estimated using the GTRCAT approximation of substitution model, and the best knowledge likelihood tree (BKLT) was selected via bootstrapping (10,000 replicates) in RAxML-VI-HPC v.2.2.3 [45]. ML tree nodes showing bootstrap support of more than 75% were considered as well supported. Topologies were edited with the FigTree v.1.3.1 software (http://tree.bio.ed.ac.uk). The overall topological match score and a well-supported node match score between IB and ML topologies for both ND4 and ITS-2 markers were calculated using Compare2Trees software [46].

Accession Numbers

All nucleotide sequences are available with GenBank accession codes for ND4: KC489309–KC489463 and ITS-2: KC489292–KC489308.

Results

Genetic differentiation and geographical structure

The ND4 gene analysis at both haplotype and nucleotide diversity levels showed a statistically significant differentiation index among the three eco-geographical groups (Kst = 0.235; p<0.001 and Hst = 0.0166; p<0.001). Moreover, the ITS-2 marker indicated low variability, mostly in the SNSM region (Table 2). From this result, we consider that ND4 offers a better resolution for exploring the spatial structure of Colombian T. dimidiata populations, and therefore ITS-2 was excluded from these analyses. Thus the overall Fst value (0.482; p<0.05) using ND4 indicates high genetic differentiation among eco-geographical groups. Pairwise Fst was 0.592 (p<0.05) between CP and SNSM, 0.588 (p<0.05) between IAV and CP, and 0.512 (p<0.05) between IAV and SNSM.
Table 2

Gene diversity and differentiation between eco-geographical regions of Colombian T. dimidiata.

Gene diversity
ND4ITS-2
RegionNnπ (± SD)hHd (± SD)Nnπ (± SD)hHd (± SD)
Inter-Andean Valleys12900.025 (0.002)790.996 (0.003)6120.011 (0.002)90.939 (0.003)
Caribbean Plains7940.016 (0.002)480.959 (0.011)6220.006 (0.002)80.649 (0.111)
SNSM3440.023 (0.006)300.962 (0.019)380.001 (0.000)20.250 (0.180)
Total 22 228 0.041 (0.001) 155 0.991 (0.002) 15 42 0.007 (0.001) 17 0.702 (0.080)

Summary of genetic diversity indices and pairwise comparison based on haplotype (Hst above diagonal) and nucleotide diversities (Kst below the diagonal) of the ND4 (614-bp) gene and partial sequences of ITS-2 (252-bp) in T. dimidiata from several regions in Colombia. Notations: N: number of localities; n: number of ND4 sequences; π (±SD): nucleotide diversity (SD); h: number of haplotypes and Hd (±SD): haplotype diversity (SD); ns: not significant and *p<0.001.

Summary of genetic diversity indices and pairwise comparison based on haplotype (Hst above diagonal) and nucleotide diversities (Kst below the diagonal) of the ND4 (614-bp) gene and partial sequences of ITS-2 (252-bp) in T. dimidiata from several regions in Colombia. Notations: N: number of localities; n: number of ND4 sequences; π (±SD): nucleotide diversity (SD); h: number of haplotypes and Hd (±SD): haplotype diversity (SD); ns: not significant and *p<0.001. A large number of unique ND4 haplotypes (h = 155), which are likely to be rare or recent in a population, were widely distributed in the haplotype network according to the three eco-geographical groups suggested (Figure 2A; [10]), whereas for partial ITS-2 haplotypes (h = 17; Table S2) a clear star-shaped network with no differentiation among haplotypes from a particular eco-geographical region was observed (Figure 2B). Despite large numbers of hypothetical haplotypes (median vectors that suggest both unsampled or extinct haplotypes) observed in the ND4 network, those haplotypes from sampled sites of each eco-geographical region were closely related in almost all cases (Figure 2A and Table 1). Out of 155 ND4 haplotypes found (Table S3), only 10 were apparently more closely grouped with an unassigned eco-geographical region than indicated, suggesting possible gene flow among regions or retained haplotypes of an ancient origin (Table 1 and Figure 1).
Figure 2

Median-joining haplotype network of the ND4 gene and partial ITS-2 rDNA of T. dimidiata in Colombia.

A) ND4 and B) ITS-2 haplotypes found in the study (Tables S3 and S2, respectively) were included in a network analysis according to the eco-geographical region of origin (Table 1). Haplotype frequency is represented by the size of each node and white nodes represent hypothetical haplotypes (or median vectors). Sampled sites were assigned different colors as coded in the legend.

Median-joining haplotype network of the ND4 gene and partial ITS-2 rDNA of T. dimidiata in Colombia.

A) ND4 and B) ITS-2 haplotypes found in the study (Tables S3 and S2, respectively) were included in a network analysis according to the eco-geographical region of origin (Table 1). Haplotype frequency is represented by the size of each node and white nodes represent hypothetical haplotypes (or median vectors). Sampled sites were assigned different colors as coded in the legend. Spatial analysis of molecular variance performed to assess the substructure within T. dimidiata indicated a significant maximized Fct at k = 3 (0.485; p<0.001), supporting the eco-geographical structure previously evidenced. Collection sites from the Boyacá, Santander, and Huila departments comprised the first group (congruent with the IAV region), sites from Bolívar, Antioquia, Norte de Santander, Córdoba, and Cesar formed the second group (congruent with the CP region), and Magdalena and La Guajira sites formed the third (the SNSM region). Genetic landscape shape interpolation analysis showed that the spatial distribution of haplotype diversity across Colombia was not uniform, as indicated by the presence of peaks and valleys (Figure 3). The lowest pairwise genetic distances in T. dimidiata geographical locations were detected across the CP region, and the highest in both IAV and SNSM (Figure 3).
Figure 3

Landscape shape interpolation of genetic distances within T. dimidiata in geographical space in Colombia.

A graphical interpolation-based representation of the ND4 gene's genetic diversity was made using a 30×50 grid and a distance-weighting parameter of 1 for T. dimidiata in Colombia. The x- and y-axes represent geographical coordinates, while surface heights along the z-axis indicate genetic distance. Peaks and valleys are indicative of areas with high or low pairwise genetic distance between ND4 haplotypes, respectively. The position of sampling sites in the “base” of the graph is the approximate distribution of localities described in Table 1 and Figure 1.

Landscape shape interpolation of genetic distances within T. dimidiata in geographical space in Colombia.

A graphical interpolation-based representation of the ND4 gene's genetic diversity was made using a 30×50 grid and a distance-weighting parameter of 1 for T. dimidiata in Colombia. The x- and y-axes represent geographical coordinates, while surface heights along the z-axis indicate genetic distance. Peaks and valleys are indicative of areas with high or low pairwise genetic distance between ND4 haplotypes, respectively. The position of sampling sites in the “base” of the graph is the approximate distribution of localities described in Table 1 and Figure 1. Spatial autocorrelation analysis showed a low and nonsignificant autocorrelation value for the full data set (V = 0.01; p>0.05), suggesting that nonsignificant clustering of the haplotypes (based on pairwise genetic distance) within each of the 10 spatial classes tested could be inferred (for instance, genetic distances between haplotypes from a spatial class 100 m apart is not significantly lower than estimated between haplotypes ∼425 m apart, as expected under an isolation-by-distance model, see Figure 4A).
Figure 4

Distance-based analyses of population structure of T. dimidiata in Colombia.

A) Distogram of spatial autocorrelation analysis of T. dimidiata from Colombia showing the relationship between pairwise genetic ND4 gene distances (y-axis) and geographical distances (m) in 10 spatial classes (x-axis). Horizontal line indicates the average value of Ay for the data set. Asterisk represents a significant (p<0.05) Miller autocorrelation index (Ay). B) Mantel test of genetic (ND4 gene) and geographical distances (m) between the geographical locations of T. dimidiata analyzed in this study.

Distance-based analyses of population structure of T. dimidiata in Colombia.

A) Distogram of spatial autocorrelation analysis of T. dimidiata from Colombia showing the relationship between pairwise genetic ND4 gene distances (y-axis) and geographical distances (m) in 10 spatial classes (x-axis). Horizontal line indicates the average value of Ay for the data set. Asterisk represents a significant (p<0.05) Miller autocorrelation index (Ay). B) Mantel test of genetic (ND4 gene) and geographical distances (m) between the geographical locations of T. dimidiata analyzed in this study. The bimodal shape observed in the distogram intersect the mean genetic distance value (0.032) around the central geographical distances of the classes, and showed a significant autocorrelation when spatial classes ranking between ∼525 m and ∼625 m (Figure 4A). This results indicates that pairwise genetic differences can be higher than average at both shorter and longer geographical distances, and lower at moderate geographical distances (Figure 4A). Furthermore, genetic distances between haplotypes separated about ∼525 m are significantly lower than those haplotypes separated by ∼625 m (Figure 4A). These results could reflect the geographical range of gene flow among the three eco-geographical population groups described, although further specific analyses of intra- and inter-regional gene flow must be performed. Spatial autocorrelation analysis results were congruent with those of Mantel's test, in which the correlation index between genetic and geographical distances among individuals of the geographical locations analyzed was not significant (r = 0.098; p>0.05), indicating no isolation-by-distance model fit that could explain the geographical structure of T. dimidiata in Colombia (Figure 4B).

Inferences drawn from the population history

The mismatch distribution of ND4, ND4+CO1, and of partial ITS-2 rDNA were fit to expect a mismatch distribution with a fluctuating population size (Figure 5). The goodness of fit of the mismatch distribution between the observed and expected results – under population expansion – was identified (ND4: Rag = 0.003, p>0.05; SSD = 0.003, p>0.05; ND4+CO1: Rag = 0.001, p>0.05; SSD = 0.003, p>0.05; and ITS-2: Rag = 0.019, p>0.05; SSD = 0.002, p>0.05). Thus, the possibility that the expansion model fits cannot be rejected. In addition, significant values for Fs and R2 in the neutrality tests used for detecting population expansion were found in the three sequence data sets, indicating clear signals of population growth in T. dimidiata in Colombia (Table 3).
Figure 5

Mismatch distribution among T. dimidiata haplotypes of Colombia based on A) ND4; B) ND4+CO1 genes, and C) partial ITS-2 rDNA.

Expectations from the stepwise growth model are superimposed (black lines) on the observed values (red line).

Table 3

Diversity and neutrality estimates for ND4, combined ND4 and CO1 genes, and partial ITS-2 of Colombian T. dimidiata.

ND4ND4+CO1ITS-2
h 1558617
S 23428516
k 25.441.671.63
Hd (± SD) 0.991 (0.002)0.998 (0.002)0.702 (0.081)
θ (± SD) 0.066 (0.015)0.043 (0.014)0.015 (0.003)
π (± SD) 0.041 (0.001)0.041 (0.002)0.066 (0.001)
Fs −197.6* −41.76* −13.29*
R2 0.07* 0.09* 0.12*

Notations: h: number of haplotypes; S: number of segregating sites; k: average of pairwise nucleotide differences; Hd (±SD): haplotype diversity (SD); θ (± SD): nucleotide polymorphism (per site); π (±SD): nucleotide diversity (SD); Fs: Fu's Fs [38] neutrality test; R2: Ramos-Onsins and Rozas [39].

*Significant at the p<0.05 level after coalescent simulations (10,000 replicates).

Mismatch distribution among T. dimidiata haplotypes of Colombia based on A) ND4; B) ND4+CO1 genes, and C) partial ITS-2 rDNA.

Expectations from the stepwise growth model are superimposed (black lines) on the observed values (red line). Notations: h: number of haplotypes; S: number of segregating sites; k: average of pairwise nucleotide differences; Hd (±SD): haplotype diversity (SD); θ (± SD): nucleotide polymorphism (per site); π (±SD): nucleotide diversity (SD); Fs: Fu's Fs [38] neutrality test; R2: Ramos-Onsins and Rozas [39]. *Significant at the p<0.05 level after coalescent simulations (10,000 replicates). The Bayesian skyline plot (BSP) of ND4, and combined ND4 and CO1 genes, indicated that T. dimidiata in Colombia seems to have gone through an effective population increase ranging from 1 to ∼4 mya before the present (Figure 6A and 6B). Moreover, the BSP for the ND4 gene including Central American and Mexican isolates supported the suggested population increase approximately 2 to 3 mya before the present (Figures 6C), and the BSP for partial ITS-2 including Central American and Mexican isolates indicated that the population increase was approximately 1.5 mya (Figure 6D).
Figure 6

Bayesian skyline plots showing T. dimidiata population size changes over time.

The relative population size measured as a product-effective population size (y-axis) is shown over time in millions of years (x-axis) in simulated coalescent-based genealogies using A) ND4 Colombian haplotypes, B) combined ND4 and CO1 Colombian haplotypes, C) ND4, and D) partial ITS-2 haplotypes including Central American and Mexican conspecific sequences (Table S1). The thick black line is the median estimate and the solid (blue) interval shows the 95% highest posterior density limits.

Bayesian skyline plots showing T. dimidiata population size changes over time.

The relative population size measured as a product-effective population size (y-axis) is shown over time in millions of years (x-axis) in simulated coalescent-based genealogies using A) ND4 Colombian haplotypes, B) combined ND4 and CO1 Colombian haplotypes, C) ND4, and D) partial ITS-2 haplotypes including Central American and Mexican conspecific sequences (Table S1). The thick black line is the median estimate and the solid (blue) interval shows the 95% highest posterior density limits.

Phylogenetic analysis

The overall topological match score between IB and ML approaches was moderate for both ND4 (65%) and ITS-2 (49.6%), but high match node scores (>75%) were obtained between them for the well-supported clades observed using both markers (Figure 7). This result indicates incomplete congruence of phylogenies was obtained by the IB and ML approaches using both ND4 and ITS-2 markers, but high congruence of the monophyletic clades comprising the T. dimidiata genetic groups [13], [14].
Figure 7

Phylogenetic analyses of T. dimidiata based on A) partial ITS-2 fragment and B) the ND4 gene.

Numbers on nodes indicate posterior probability and bootstrap values (only posterior probabilities >0.75 and Bootstrap values >75% are shown); circled numbers indicate match node scores between BI and ML-based topologies. In B) the names of the three eco-geographical regions described for Colombian T. dimidiata are indicated. Branch color codes and dotted lines indicate each of the main different genetic groups that comprise the T. dimidiata species complex according to Bargues et al., (2008) and Monteiro et al., (2013).

Phylogenetic analyses of T. dimidiata based on A) partial ITS-2 fragment and B) the ND4 gene.

Numbers on nodes indicate posterior probability and bootstrap values (only posterior probabilities >0.75 and Bootstrap values >75% are shown); circled numbers indicate match node scores between BI and ML-based topologies. In B) the names of the three eco-geographical regions described for Colombian T. dimidiata are indicated. Branch color codes and dotted lines indicate each of the main different genetic groups that comprise the T. dimidiata species complex according to Bargues et al., (2008) and Monteiro et al., (2013). The topologies built using the partial ITS-2 showed three main clades (Figure 7A), congruent with the previously reported monophyletic clades using a complete ITS-2 sequence, termed groups 1, 2, and 3 [13]. Within group 1, a low node match score between the IB and ML approaches as well as low node support (posterior probability <0.7 and bootstrap values <75%, data not shown) was observed between Colombian and most Central American haplotypes, which were previously suggested as subgroups 1A and 1B, respectively [13]. This indicates that an inconclusive monophyletic status can be assigned to Colombian and Central American isolates. Moreover, in the ND4 phylogeny four well-supported monophyletic clades showing a high match node score (>95%) were found (Figure 7B). The four clades included haplotypes belonging to the suggested groups III, II, and I [14] plus a secondary clade within group I harboring haplotypes of the Colombian IAV region (Figure 7B).

Discussion

In this study, we used molecular data of mitochondrial and nuclear genealogies to access the Colombian T. dimidiata population history. The geographical structure patterns obtained using mtDNA sequences were congruent with previous results [10]. These results support the suggested eco-geographical structure of T. dimidiata expanding into Colombia, but this structure was not related to the isolation-by-distance model explaining genetic differences in the three main population groups. Instead, a strong signal arguing for a sudden change in population size and range expansion undergone in the recent past after the emergence of the Isthmus of Panama (dating between ∼3.5 mya [47] and ∼10 mya [48]) seems to be a better explanation for assessing the high diversity among T. dimidiata population groups in Colombia. The phylogeographic pattern found here is congruent with that reported for the T. dimidiata species complex, where Central American populations are grouped in a monophyletic clade with Colombian specimens (termed group 1/I according to Bargues et al., 2008; Monteiro et al., 2013), thus suggesting a possibly Mesoamerican origin of Colombian populations. However, the phylogenetic result obtained with the ND4 gene (Figure 7B), where a secondary clade was found within group I, suggests populations from the IAV region could be understood as an explicit case of local adaptation accomplished with geographical dispersion across this area of Colombia. A remarkable issue that needs to be studied further is the domiciliation process shown for the IAV populations, differing from the process undergone by the SNSM and CP populations. This issue requires rigorous ecological approaches to help understand the local forces that shaped the biological diversity of the Colombian T. dimidiata natural populations.

Spatial structure and genetic signals of population expansion in Colombian T. dimidiata

The results reported herein on the genetic structure among the three eco-geographical regions IAV, CP, and SNSM are in agreement with previous studies discussing several epidemiological considerations [9], but they also provide additional information on the spatial picture of this genetic diversity and its possible origin. Pairwise comparison between population groups at both the haplotype (Hst) and nucleotide (Kst) diversity levels indicates that the differential distribution of genetic variability is placed among the IAV, CP, and SNSM regions. Additionally, the results of spatial genetic structure analyses (Fct, genetic landscape shape, spatial autocorrelation, and the Mantel test) make at least two main conclusions possible: (i) there is a heterogenic geographical distribution of genetic diversity and (ii) none of the correlations among genetic divergence and geographical distance explains the geographical group structure of a population. Spatial interpolation of genetic diversity in Colombia shows a bimodal curve-shaped distribution with SNSM and IAV having higher diversities than the CP region, indicating that several microevolutionary processes have been involved in the genetic diversity divergence among population regions, where possible disruptive segregation of ancestral populations of T. dimidiata could be one of many probable causes. An additional observation can be made about this distribution. While T. dimidiata populations in SNSM and IAV occupy mostly diverse mountain and premountain ecosystems located around 1,000 masl, CP is for the most part composed of extended lowlands with dry and warm zones at lower altitudes, given that the departments of the CP region extend from the Colombia–Panama border to the foothills of the west mountain range of the northern Andes. The current eco-geographical structure of Colombian T. dimidiata cannot be understood separately from its history. Unlike the previous hypothesis on the possible isolation of population groups due to geographical distance explaining the genetic differences identified [10], the present results further suggest that the spatial genetic structure in Colombian populations could be the consequence of a recent (after ∼4 mya) sudden increase in population size and range occurring after the Great American Interchange (>4.5 mya), contemporary with the rapid uplift of the Eastern Cordillera of the Andes mountain range in Colombia (dating between 2 and 5 mya) [49], where local adaptations of populations shaped T. dimidiata diversity. With the emergence of the Isthmus of Panama, an important paleozoogeographical event occurred as a product of the Great American Interchange between North America via Central America to South America and vice versa for fauna during the Piacenzian (or after early Miocene [48]) age. We suggest that as consequence of this process, Central American populations colonized the Colombia–Panama border regions, inhabiting mostly hollow trees and palm trees, spreading across the Colombian Caribbean Coast Plains, SNSM, and IAV. Afterward, the concordance of (1) the allopatric separation between the two main population groups (SNSM-CP and IAV) and (2) the uplift of the Eastern Cordillera of the Andes mountains in Colombia as well as the SNSM mountain areas (dating between 2 and 11 mya), dates the beginning of the eco-geographical structure detected in T. dimidiata. We also note that an alternative phylogeographical hypothesis indicating a northern South American origin could be postulated for the Colombian IAV region T. dimidiata populations based on the high genetic diversity. Although Ecuadorian populations are considered to have been passively introduced from Nicaragua [13], [14], southern and central IAV Colombian sylvatic specimens inhabit diverse ecotopes (hollow trees and rock piles) and their high genetic diversity could indicate originating foci for this Colombian population group (or a sibling species according to Monteiro et al., 2013) in the Central Andean Valleys around Colombia. Although an unquestionably close phylogenetic relationship between T. dimidiata sensu lato and the phyllosoma species complex from Mexico [16] supports the Central America origin of this species, we consider further hypotheses should not be completely disregarded. Moreover, a more extensive phylogeographic picture including Andean Triatoma species such as T. maculata and T. dispar should be explored, and additional studies on other factors such as niche differentiation must be considered. Additional nucleotide sequences of T. dimidiata included in this study. (DOCX) Click here for additional data file. ITS-2 Haplotype distribution of Colombian T. dimidiata used in this study. Map numbers and geographic origin are detailed in Table 1. (DOCX) Click here for additional data file. ND4 haplotype distribution of T. dimidiata used in this study. Map numbers and geographic origin are detailed in Table 1. (DOCX) Click here for additional data file.
  38 in total

1.  Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: application to human mitochondrial DNA.

Authors:  S Schneider; L Excoffier
Journal:  Genetics       Date:  1999-07       Impact factor: 4.562

2.  Intraspecific gene genealogies: trees grafting into networks.

Authors: 
Journal:  Trends Ecol Evol       Date:  2001-01-01       Impact factor: 17.712

3.  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection.

Authors:  Y X Fu
Journal:  Genetics       Date:  1997-10       Impact factor: 4.562

4.  Infestation by triatomine bugs in indigenous communities of Valledupar, Colombia.

Authors:  Marleny Montilla; Hugo Soto; Edgar Parra; Mariela Torres; Pilar Carrillo; Ligia Lugo; Johana Colorado; Maria Teresa Arias
Journal:  Rev Saude Publica       Date:  2011-06-10       Impact factor: 2.106

5.  The detection of disease clustering and a generalized regression approach.

Authors:  N Mantel
Journal:  Cancer Res       Date:  1967-02       Impact factor: 12.701

Review 6.  Triatoma dimidiata (Latreille, 1811): a review of its diversity across its geographic range and the relationship among populations.

Authors:  Patricia L Dorn; Carlota Monroy; Andrew Curtis
Journal:  Infect Genet Evol       Date:  2006-11-13       Impact factor: 3.342

7.  Identification of a large hybrid zone between sympatric sibling species of Triatoma dimidiata in the Yucatan peninsula, Mexico, and its epidemiological importance.

Authors:  Melba Herrera-Aguilar; Leobardo A Be-Barragán; Maria Jesus Ramirez-Sierra; Frédéric Tripet; Patricia Dorn; Eric Dumonteil
Journal:  Infect Genet Evol       Date:  2009-09-26       Impact factor: 3.342

8.  [Life cycle of Triatoma dimidiata latreille, 1811 (Hemiptera, Reduviidae) under laboratory conditions: production of nymphs for biological tests].

Authors:  Marlene Reyes; Víctor Manuel Angulo
Journal:  Biomedica       Date:  2009-03       Impact factor: 0.935

9.  Phylogeographic pattern and extensive mitochondrial DNA divergence disclose a species complex within the Chagas disease vector Triatoma dimidiata.

Authors:  Fernando A Monteiro; Tatiana Peretolchina; Cristiano Lazoski; Kecia Harris; Ellen M Dotson; Fernando Abad-Franch; Elsa Tamayo; Pamela M Pennington; Carlota Monroy; Celia Cordon-Rosales; Paz Maria Salazar-Schettino; Andrés Gómez-Palacio; Mario J Grijalva; Charles B Beard; Paula L Marcet
Journal:  PLoS One       Date:  2013-08-05       Impact factor: 3.240

10.  Two distinct Triatoma dimidiata (Latreille, 1811) taxa are found in sympatry in Guatemala and Mexico.

Authors:  Patricia L Dorn; Claudia Calderon; Sergio Melgar; Barbara Moguel; Elizabeth Solorzano; Eric Dumonteil; Antonieta Rodas; Nick de la Rua; Roberto Garnica; Carlota Monroy
Journal:  PLoS Negl Trop Dis       Date:  2009-03-10
View more
  8 in total

Review 1.  The Evolutionary Origin of Diversity in Chagas Disease Vectors.

Authors:  Silvia A Justi; Cleber Galvão
Journal:  Trends Parasitol       Date:  2016-12-13

2.  Intraspecific Genomic Divergence and Minor Structural Variations in Leishmania (Viannia) panamensis.

Authors:  Luz H Patino; Marina Muñoz; Carlos Muskus; Claudia Méndez; Juan David Ramírez
Journal:  Genes (Basel)       Date:  2020-02-27       Impact factor: 4.096

3.  Estimating the genetic structure of Triatoma dimidiata (Hemiptera: Reduviidae) and the transmission dynamics of Trypanosoma cruzi in Boyacá, eastern Colombia.

Authors:  Natalia Velásquez-Ortiz; Carolina Hernández; Omar Cantillo-Barraza; Manuel Medina; Mabel Medina-Alfonso; Sandra Suescún-Carrero; Marina Muñoz; Laura Vega; Sergio Castañeda; Lissa Cruz-Saavedra; Nathalia Ballesteros; Juan David Ramírez
Journal:  PLoS Negl Trop Dis       Date:  2022-07-11

Review 4.  American trypanosomiasis, or Chagas disease, in Panama: a chronological synopsis of ecological and epidemiological research.

Authors:  Indra G Rodriguez; Jose R Loaiza
Journal:  Parasit Vectors       Date:  2017-10-10       Impact factor: 3.876

5.  Genetic variation and phylogeography of the Triatoma dimidiata complex evidence a potential center of origin and recent divergence of haplogroups having differential Trypanosoma cruzi and DTU infections.

Authors:  Angélica Pech-May; Carlos Jesús Mazariegos-Hidalgo; Amaia Izeta-Alberdi; Sury Antonio López-Cancino; Ezequiel Tun-Ku; Keynes De la Cruz-Félix; Carlos N Ibarra-Cerdeña; Raúl E González Ittig; Janine M Ramsey
Journal:  PLoS Negl Trop Dis       Date:  2019-01-28

6.  Genetic diversification of Panstrongylus geniculatus (Reduviidae: Triatominae) in northern South America.

Authors:  Valentina Caicedo-Garzón; Fabian C Salgado-Roa; Melissa Sánchez-Herrera; Carolina Hernández; Luisa María Arias-Giraldo; Lineth García; Gustavo Vallejo; Omar Cantillo; Catalina Tovar; Joao Aristeu da Rosa; Hernán J Carrasco; Maikell Segovia; Camilo Salazar; Juan David Ramírez
Journal:  PLoS One       Date:  2019-10-17       Impact factor: 3.240

7.  Genomic Diversification, Structural Plasticity, and Hybridization in Leishmania (Viannia) braziliensis.

Authors:  Luz H Patino; Marina Muñoz; Lissa Cruz-Saavedra; Carlos Muskus; Juan David Ramírez
Journal:  Front Cell Infect Microbiol       Date:  2020-10-16       Impact factor: 5.293

8.  Distribution and natural infection status of synantrophic triatomines (Hemiptera: Reduviidae), vectors of Trypanosoma cruzi, reveals new epidemiological scenarios for chagas disease in the Highlands of Colombia.

Authors:  Omar Cantillo-Barraza; Manuel Medina; Sara Zuluaga; María Isabel Blanco; Rodrigo Caro; Jeiczon Jaimes-Dueñez; Virgilio Beltrán; Samanta Cc Xavier; Omar Triana-Chavez
Journal:  PLoS Negl Trop Dis       Date:  2021-07-19
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.