Literature DB >> 31308378

Analysis-ready datasets for insecticide resistance phenotype and genotype frequency in African malaria vectors.

Catherine L Moyes1, Antoinette Wiebe2, Katherine Gleave3, Anna Trett3, Penelope A Hancock2, Germain Gil Padonou4, Mouhamadou S Chouaïbou5, Arthur Sovi4,6, Sara A Abuelmaali7, Eric Ochomo8, Christophe Antonio-Nkondjio9, Dereje Dengela10, Hitoshi Kawada11, Roch K Dabire12, Martin J Donnelly3, Charles Mbogo13,14, Christen Fornadel15, Michael Coleman3.   

Abstract

The impact of insecticide resistance in malaria vectors is poorly understood and quantified. Here a series of geospatial datasets for insecticide resistance in malaria vectors are provided, so that trends in resistance in time and space can be quantified, and the impact of resistance found in wild populations on malaria transmission in Africa can be assessed. Specifically, data have been collated and geopositioned for the prevalence of insecticide resistance, as measured by standard bioassays, in representative samples of individual species or species complexes. Data are provided for the Anopheles gambiae species complex, the Anopheles funestus subgroup, and for nine individual vector species. Data are also given for common genetic markers of resistance to support analyses of whether these markers can improve the ability to monitor resistance in low resource settings. Allele frequencies for known resistance-associated markers in the Voltage-gated sodium channel (Vgsc) are provided. In total, eight analysis-ready, standardised, geopositioned datasets encompassing over 20,000 African mosquito collections between 1957 and 2017 are released.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31308378      PMCID: PMC6629700          DOI: 10.1038/s41597-019-0134-2

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


Background & Summary

Current malaria control activities are heavily reliant on vector control using insecticides, which means resistance to these compounds has the potential to derail control efforts[1,2]. Studies have started to investigate the impact of resistance in certain situations[3,4] but a full understanding of impact requires comprehensive quantification of resistance. To quantify the factors that influence vector control generally, data from vector populations are required and a number of vector databases are already available for species distributions, infection prevalence, and bionomic parameters[5-12]. A database for insecticide resistance in malaria vectors, that allows users to download analysis-ready datasets, is vital so that the impact of levels of resistance found in wild populations, on malaria transmission, can be assessed. These datasets are also essential to quantify trends in resistance in space and time, filling the gaps in the available data with robust predictions, to aid resistance management and the deployment of interventions designed to counter resistance[13]. Studies of phenotypes in natural populations may be confounded by variation in the environments sampled, including factors linked to climate, land use and malaria control interventions. It is not possible to control for all variables in the natural environment but this issue can, in part, be mitigated by sampling a large number of locations encompassing different combinations of environmental variables. Large, collated datasets do, however, have potential disadvantages. Collated datasets that are a combination of data points representing different types of sample, different measurement methods, different location types and so on, risk undermining any analysis that is performed[6,14]. Each dataset should be constructed to address a specific question or set of questions, and the data within each set needs to be standardised to allow robust analyses. The goal of the current work was to collate data from multiple studies characterising the insecticide resistance phenotype and genotype in communities of malaria vectors at as many locations and times as possible. The first aim was then to generate standardised datasets designed to address specific questions using geospatial analyses. Namely, what are the trends in resistance in time and space in specific vector assemblages, to then assess whether these trends are associated with trends in malaria transmission. The second aim was to provide data that can be used to investigate associations between genetic markers for individual mechanisms of resistance and the insecticide resistance phenotype, to assess whether genetic markers can improve the ability to monitor resistance in low resource settings[15].

Methods

Data sources

Data were obtained from published journal articles, published reports, and unpublished datasets. Published journal articles were identified in the Web of Science bibliographic database by using the search terms “insecticide resistance” and “anopheles” together with the name of each malaria endemic country in turn. The Web of Science was chosen because it incorporates many relevant databases including the SciELO Citation Index from 1997 onwards, MEDLINE from 1950 onwards (from the U.S. Library of Medicine), the Data Citation Index from 1993 onwards (provides details of datasets in international data depositories), the BIOSIS Citation Index from 1969 onwards (covers pre-clinical, experimental, and animal research) and the Web of Science’s own Core Collection from 1945 to date. The earliest date was unrestricted, and the search was completed on 31 December 2017. The initial search yielded 3,685 articles published from 1956 to 2017, with the first African paper published in 1957. Data were extracted from each article as outlined below and 342 articles provided data from field samples of mosquitoes collected in malaria endemic African countries for either the insecticide resistance phenotype and/or genotype. If values for some data fields were missing, the authors were contacted. In these instances, either (i) the phenotype/genotype data was given in the article but supplementary information such as the date of sampling or mosquito identification method was missing, or (ii) the genotype/phenotype data were missing, or had been aggregated across sites or years, so the disaggregated data for each site-year were requested. In the latter instance, any genotype/phenotype data received from the authors were treated as unpublished. In total, 81 sets of authors were contacted about 114 journal articles. Of these, 56 sets of authors provided further information; 30 sets of authors confirmed details such as the collection dates and 26 sets of authors provided test results that were not published with the original articles, as well as confirming any missing details of their study. In addition, agencies reporting on vector surveillance and groups involved in large studies that had not yet published their results, were asked to provide these reports and unpublished datasets. In total, 48 reports and unpublished datasets from African countries were provided. For all unpublished data, permission to include these data in this release was requested. Of a total of 11,057 unpublished data points, permission was received to release 10,834.

Data aggregation/disaggregation

The aim of this work was to provide measures of insecticide resistance for representative samples of a species population (or a species complex or a subgroup) found at a particular time and place, rather than data at the level of an individual mosquito. Replicates from the same mosquito collection sampled at a single “site” and “collection period” were aggregated. The spatial resolution of a “site” was defined by the original field studies and classified by the current study, as described in the data geo-referencing section below. The temporal resolution of a “collection period” was also defined by the original data generators and the duration of each collection period was recorded in the current dataset, as described below. If the reported data had been pooled across multiple sites or collection periods, but was originally obtained at a finer resolution, the disaggregated data for each site-period were requested. For example, if mosquitoes were collected from five sites and bioassayed separately, giving a bioassay result for each site, but only a single average result for the region was published, then the five separate results were requested. The purpose of this disaggregation was to avoid imprecise estimates associated with large areas or long time periods wherever possible. Further details on appropriate methods to analyse data at different resolutions is given in the Usage Notes. Datasets were constructed based on mosquito samples that represented either a single species or a species complex or subgroup. Species-level data were entered wherever it was available and aggregated to provide data for the species complex later, using the original species composition. If insecticide resistance data were provided for each species but the original species composition was not available for that study, the data points for each individual species were included in the species-level datasets (provided they met the inclusion criteria below) but they were not aggregated to provide data for the species complex. Data are provided for individual species within the An. gambiae complex (An. arabiensis, An. coluzzii, An. gambiae, An. melas and An. quadriannulatus) and for species within the An. funestus subgroup (An. funestus and An. parenesis) within datasets 2, 7 and 8. Separately, values for aggregate An. gambiae complex samples and aggregate An. funestus subgroup samples are provided in datasets 1, 3, 4, 5 and 6. The majority of studies provided results for samples of a complex or subgroup, rather than single species (Table 1).
Table 1

Summary of each of the eight data files released.

NumberTitleNo. data points
1Standard WHO susceptibility test results for the Anopheles gambiae complex and Anopheles funestus subgroup.13,618
2Standard WHO susceptibility test results for individual species.3,525
3Standard CDC bottle bioassay results for the An. gambiae complex and An. funestus subgroup.1,061
4Paired WHO susceptibility test or CDC bottle bioassay results with and without a synergist (An. gambiae complex and An. funestus subgroup).1,013
5WHO and CDC intensity bioassay results (An. gambiae complex and An. funestus subgroup).1,816
6Vgsc allele frequencies for the An. gambiae complex and An. funestus subgroup.1,068
7Vgsc allele frequencies for individual species.1,890
8Paired Vgsc allele frequencies from dead and alive subsamples after an insecticide susceptibility test.296
Summary of each of the eight data files released.

Inclusion criteria for data extraction

Subgroup-, complex- or species-level insecticide resistance phenotype data generated from either a WHO susceptibility test[16-26] or a CDC bottle bioassay[27] using either the F0 or F1 generation from a field collection of Anopheles mosquitoes were included (Supplementary Information). Susceptibility tests use two types of control; for the first control, a known susceptible mosquito strain is exposed to the insecticide-treated paper to test that the paper is working, and for the second control, a subsample of the mosquito population being tested are not exposed to the insecticide to check the baseline mortality rate. Data were excluded if the susceptible strain control failed, i.e. mortality in the susceptible strain was <100% indicating that the insecticide-treated paper was not effective. If the first control was successful (all of the mosquitoes died) and a baseline mortality rate was obtained from the second control and used in Abbot’s formula to correct the mortality for the field sample, this corrected mortality value was entered in the database. Subgroup-, complex- or species-level data on the resistance variants in the voltage-gated sodium channel (Vgsc) gene that were derived from F0 or F1 generations from field collections of Anopheles mosquitoes, provided as either genotype or allele frequencies, were included. Only mosquito samples that were representative of a species complex or subgroup and/or a species were included and any samples that were subject to sub-setting that biased the original sample were excluded. For example, if a mixed species sample was collected but a bioassay result was only reported for the most common species, that bioassay result cannot be considered as representative of the species complex at that time and place. In this example, the data were included in the species-level datasets released here, but these values were not included in the datasets for species complexes. Similarly, if a mixed species sample was collected and then the F1 generation was sorted into single species by identifying the mother of each egg batch, those results cannot be considered representative of the species complex at that time and place. Furthermore, if the allele frequency was calculated for mosquitoes that survived a bioassay, and dead mosquitoes were not tested, this result cannot be considered as representative for either the species complex, or the individual species, and was not included in any dataset. If a mixture of dead and alive mosquitoes from a bioassay were tested to obtain an allele frequency, but the ratio of dead:alive was not representative of the original sample, for example 80% died in the bioassay but the sample tested was 50:50 then these data were also excluded.

Individual data files

The full database was used to generate eight individual data files (Table 1) that address specific questions for defined sets of mosquitoes. The aim in creating Data Files 1 and 2 was to provide a set of comparable results for each insecticide from bioassays that had used the same insecticide concentration and exposure duration, however, the recommended concentrations and durations vary with WHO protocol versions[18-26]. The protocol version used to define the standard insecticide concentrations and exposure durations in Data Files 1 and 2 was the 1998 WHO test procedures, because the highest volumes of data across all years were available for the concentrations and durations specified by this protocol version[25]. Insecticides that were not covered by the 1998 protocol version were specified in the 2013 version so this later protocol was also used to set the standard values for Data Files 1 and 2 [26].

Data fields

The data fields included in this release are described in Tables 2–7. The source data fields (Table 2), the sample collection data fields (Table 3), and the geo-location data fields (Table 4) are provided in all data files. The species identification data fields (Table 5) are provided in Data Files 2, 4, 7 and 8. The bioassay data fields (Table 6) are provided in Data Files 1–5. The Vgsc data fields (Table 7) are provided in Data Files 6–8.
Table 2

Data source data fields.

TitleData typeDescription
Source citationTextCitation for the data source. This field is duplicated up to four times to record instances where the full information linked to that data point came from more than one source.
Source typeCategoryEach source is categorised as ‘journal article’, ‘personal communication’ and so on.

These data fields are included in all 8 data files.

Table 7

Vgsc gene data fields.

TitleData typeDescription
Anophelines testedCategoryThe species, or species complex or subgroup, that the genetic result represents.
MethodCategoryThe molecular method used to identify alleles. This field is duplicated three times to record instances where up to three different methods were used on the same sample.
GenerationCategoryThe mosquito generation tested: F0, F1 or a mix of both.
No. mosquitoes testedIntegerThe total number of mosquitoes tested.
Genotype frequencies
L/L (no.)IntegerThe number of mosquitoes homozygous for the wildtype, susceptible allele (1014L).
L/L (%)Decimal numberThe percent of mosquitoes homozygous for the wildtype, susceptible allele (1014L).
L/F (no.)IntegerThe number of mosquitoes heterozygous for the wildtype, susceptible allele (1014L) and the 1014F resistance allele.
L/F (%)Decimal numberThe percent of mosquitoes heterozygous for the wildtype, susceptible allele (1014L) and the 1014F resistance allele.
L/S (no.)IntegerThe number of mosquitoes heterozygous for the wildtype, susceptible allele (1014L) and the 1014S resistance allele.
L/S (%)Decimal numberThe percent of mosquitoes heterozygous for the wildtype, susceptible allele (1014L) and the 1014S resistance allele.
F/F (no.)IntegerThe number of mosquitoes homozygous for the 1014F resistance allele.
F/F (%)Decimal numberThe percent of mosquitoes homozygous for the 1014F resistance allele.
F/S (no.)IntegerThe number of mosquitoes heterozygous for the 1014F and 1014S resistance alleles.
F/S (%)Decimal numberThe percent of mosquitoes heterozygous for the 1014F and 1014S resistance alleles.
S/S (no.)IntegerThe number of mosquitoes homozygous for the 1014S resistance allele.
S/S (%)Decimal numberThe percent of mosquitoes homozygous for the 1014S resistance allele.
Allele frequencies
L1014LDecimal numberThe allele frequency (%) for the 1014L wildtype, susceptible allele.
L1014FDecimal numberThe allele frequency (%) for the 1014F resistance allele.
L1014SDecimal numberThe allele frequency (%) for the 1014s resistance allele.

These data fields are included in Datasets 6–8.

Table 3

Sample collection data fields.

TitleData typeDescription
Capture methodCategoryThe method used to capture the mosquito sample. This field is duplicated four times to record instances where samples from different capture methods were pooled before testing.
Start monthIntegerThe month when the mosquito collection began.
Start yearIntegerThe year when the mosquito collection began.
End monthIntegerThe month when the mosquito collection ended.
End yearIntegerThe year when the mosquito collection ended.

These data fields are included in all 8 data files.

Table 4

Geo-locations data fields.

TitleData typeDescription
CountryCategoryThe country that the site sampled was in.
Site typeCategorySites can be a point, a polygon or multiple-points, as described in the geo-positioning section.
Site nameFree textName of the field site sampled.
LatitudeDecimal numberFor ‘point’ sites only, the geographical coordinates are given in decimal degrees. This field is duplicated multiple times to record instances where samples from more than one ‘point’ site were pooled before testing.
LongitudeDecimal numberFor ‘point’ sites only, the geographical coordinates are given in decimal degrees. This field is duplicated multiple times to record instances where samples from more than one ‘point’ site were pooled before testing.
Admin levelCategoryIf the site is a ‘polygon’ that matches an administrative unit, the administrative level (0, 1 or 2) is recorded.
GAUL codeIntegerIf the site is a ‘polygon’ that matches an administrative unit, the identifier from the Global Administrative Units Layer is recorded.

These data fields are included in all 8 data files and are described further in the data geo-referencing section of the text. GAUL is the Global Administrative Units Layer.

Table 5

Species identification data fields.

TitleData typeDescription
Identification methodCategoryThe molecular method used to identify individual species in the original sample. This field is duplicated to record instances where two methods were used.

These data fields are included in Data Files 2, 4, 7 and 8.

Table 6

WHO and CDC bioassay data fields.

TitleData typeDescription
SpeciesCategoryThe species that the bioassay result represents.
Complex/subgroupCategoryThe species complex or subgroup that the bioassay result represents.
GenerationCategoryThe mosquito generation tested: F0, F1 or a mix of both.
Test protocolCategoryThe WHO or CDC bioassay protocol followed is listed.
InsecticideCategoryThe insecticide tested is named.
Concentration (%)Decimal numberIf a WHO protocol was followed, the insecticide concentration is given as a percent.
Concentration (µg/bottle)Decimal numberIf the CDC protocol was followed, the insecticide concentration is given in µg/bottle.
Exposure period (minutes)IntegerThe period of exposure to the insecticide in minutes.
No. mosquitoes testedIntegerThe total number of mosquitoes tested in all replicates.
No. mosquitoes deadIntegerThe total number of mosquitoes that died in all replicates.
Percent mortalityDecimal numberThe percent of mosquitoes that died across all replicates, adjusted using Abbot’s formula if applicable.

There data fields are included in Data Files 1–5.

Data source data fields. These data fields are included in all 8 data files. Sample collection data fields. These data fields are included in all 8 data files. Geo-locations data fields. These data fields are included in all 8 data files and are described further in the data geo-referencing section of the text. GAUL is the Global Administrative Units Layer. Species identification data fields. These data fields are included in Data Files 2, 4, 7 and 8. WHO and CDC bioassay data fields. There data fields are included in Data Files 1–5. Vgsc gene data fields. These data fields are included in Datasets 6–8. All of the data fields were extracted and entered as they were provided by the data source. If any information was missing, no value was entered (see the missing data section below) and the authors were contacted. The only values that were generated after the data were extracted from the sources, were the geo-location values. Full details on how the geo-location data were generated is given in the next section. Data files are provided for species complexes or subgroups, and for individual species, separately. Bioassay data for individual species were obtained from studies that either sorted egg batches based on mothers’ species prior to a bioassay being performed, or disaggregated the results by species after the bioassay was performed, or instances where all mosquitoes in the original sample were found to be one species after the bioassay was performed. For Data File 4, an additional identifier (the “matched set ID”) is included to allow results from bioassays that used the same mosquito collection and exactly the same bioassay conditions, with or without a synergist, to be identified. The same approach was used for Data File 5 where the “matched set ID” allows results from bioassays that used the same mosquito collection and exactly the same bioassay conditions, but with differing insecticide concentrations and/or exposure durations, to be identified. In total, 453 matched sets are provided in the synergist dataset, 463 in the intensity assay data file, and 148 for Vgsc allele frequencies in paired dead and alive subsamples.

Data geo-referencing

In order to use these data in geospatial models at a resolution of ~5 km, each mosquito collection location was classed as either a point (defined as a site located within a 2.5 arc-minute grid cell, i.e. an area of ~5 × ~5 km) or a polygon (defined as a site with an area greater than that of a point). To determine whether a site should be classified as a point or a polygon, we used all information provided by the data source. This included text describing the site, the site name, and any maps or coordinates provided. If the text described the site as a district, or similar, then we checked that the area of that district matched our definition of a polygon and, if so, used the polygon classification. If coordinates were provided that mapped to the centroid of an administrative unit then we checked whether the site name and text description matched that administrative unit and, if so, classified the site as a polygon (unless the area of the adminstrative unit was less than 5 × 5 km). For all sites defined as ‘points’ the following steps were followed. The site name and all contextual information about the location of the site were noted, for example, the district the site was in, its proximity to a major city or other geographical features, and so on. If the data source provided coordinates, then these were converted to decimal degrees. If no coordinates were provided, the site name was searched in at least two online gazetteers (Google Maps, GeoNames, OpenStreetMap, WikiMapia and so on). All options identified by this search were cross-checked against the contextual information. If only one option matched the contextual information, the coordinates were extracted from the online gazeteer and added to the database. If more than one option matched the contextual information, or no options were found that matched the contextual information, the individuals who published or provided the data were contacted. In these instances, no coordinates were entered without external confirmation. After all possible coordinates were obtained for a study, they were plotted on a map to ensure the data spread for that study matched any information available on the authors’ overarching sampling strategy. For all sites defined as ‘polygons’, any contextual information was noted, such as the province that the district was in. The name of the area in question was searched in the FAO’s Global Administrative Unit Layers (GAUL, http://www.fao.org/geonetwork/srv/en/metadata), using fuzzy matching to allow for different spellings or transliteration, and checked against any available contextual information. If a single administrative unit in GAUL matched the area name and contextual information, the GAUL code (=a unique identifier for that area/polygon), and administrative level for that unit, were extracted and entered in the database. If an administrative unit within GAUL could not be identified, no code was entered. If an individual site could not be located, or could not be precisely located within a 2.5 arc-minute grid cell, then the data point was linked to the second order administrative division that the site falls within. The administrative division was identified using the same method as for polygons above. If multiple point locations were sampled and the mosquitoes were pooled before being tested (or only the pooled results were available), the site type was classified as a ‘multi-point’ and the coordinates for all of the individual point locations were linked to the test result.

Missing data

If data for a particular field was missing from the original data source, the value was recorded as NR, i.e. not reported. For values that were not applicable, rather than missing, NA was used. For example, if only one capture method was used, the value entered for the second capture method was NA. If the geographical coordinates for a site could not be identified (see above), NF was entered, i.e. not found. If a study did not explicitly state the insecticide concentration, exposure period and/or minimum number of mosquitoes used, but did specify the protocol followed, it may be possible to obtain the missing information from the relevant protocol[16-27]. Protocol values for the most commonly used insecticides are provided in Tables 8 and 9, and the values for all insecticides are given in the Supplementary Information, to allow data users to fill these data gaps if they wish.
Table 8

Minimum recommended number of mosquitoes and duration of exposure specified by published protocols for the WHO susceptibility test and CDC bottle bioassay.

ProtocolMin. no. mosquitoesDuration (minutes)aDuration for fenitrothion (minutes)Duration for DDT (minutes)
WHO 1963606060
WHO 197060606060
WHO 197560606060
WHO 197660606060
WHO 1980606012060
WHO 1981606012060
WHO 1986606012060
WHO 1992606012060
WHO 1998806012060
WHO 2013806012060
WHO 2016806012060
CDC bottle bioassay100303045

aThe exposure duration values from the WHO protocols apply to dieldrin, malathion, fenthion, propoxur, chlorphoxim, permethrin, deltamethrin, λ-cyhalothrin, bendiocarb, etofenprox, pirimiphos-methyl, carbosulfan, cyfluthrin, chlorfenapyr, fipronil and α-cypermethrin. The CDC protocol values apply to bendiocarb, cyfluthrin, cypermethrin, deltamethrin, fenitrothion, λ-cyhalothrin, malathion, permethrin and pirimiphos-methyl. Full details can be found in the published protocols.

Table 9

Insecticide concentrations specified by published protocols for the WHO susceptibility test and CDC bottle bioassay.

ProtocolDDTdeltamethrinpermethrinbendiocarbλ-cyhalothrinprimiphos-methyl
WHO 1963M.C.
WHO 1970M.C.
WHO 1975M.C.
WHO 1976M.C.
WHO 19804%0.025%0.25%
WHO 19814%0.025%0.25%0.1%
WHO 19864%0.025%0.25%0.1%
WHO 19924%0.025%0.25%0.1%0.1%
WHO 19984%0.05%0.75%0.1%0.05%
WHO 20134%0.05%0.75%0.1%0.05%0.25%
WHO 20164%0.05%0.75%0.1%0.05%0.25%
CDC bottle bioassay100 µg/bottle12.5 µg/bottle21.5 µg/bottle12.5 µg/bottle12.5 µg/bottle20 µg/bottle

M.C. denotes that multiple concentrations were recommended so the actual concentration used in any particular bioassay cannot be inferred from the protocol version.

Minimum recommended number of mosquitoes and duration of exposure specified by published protocols for the WHO susceptibility test and CDC bottle bioassay. aThe exposure duration values from the WHO protocols apply to dieldrin, malathion, fenthion, propoxur, chlorphoxim, permethrin, deltamethrin, λ-cyhalothrin, bendiocarb, etofenprox, pirimiphos-methyl, carbosulfan, cyfluthrin, chlorfenapyr, fipronil and α-cypermethrin. The CDC protocol values apply to bendiocarb, cyfluthrin, cypermethrin, deltamethrin, fenitrothion, λ-cyhalothrin, malathion, permethrin and pirimiphos-methyl. Full details can be found in the published protocols. Insecticide concentrations specified by published protocols for the WHO susceptibility test and CDC bottle bioassay. M.C. denotes that multiple concentrations were recommended so the actual concentration used in any particular bioassay cannot be inferred from the protocol version.

Data duplication

The data extracted came from several hundred different sources, which introduced the possibility that individual results had been entered into the database more than once. To identify duplicates the following data fields were used: original sample; species tested; date fields; no. mosquitoes tested; no. mosquitoes dead; percent mortality; site name; coordinates. Fuzzy matching was used for all fields to identify duplicates where different levels of aggregation had been used, or different data values were missing, or names were spelled differently. All partial matches were examined to identify genuine duplicates. Duplicate data points were removed, and the source details linked to the single data point that was retained. In total, 3,483 duplicated data points were removed. The final lists of species and insecticides that were included are given in Tables 10 and 11.
Table 10

List of the the species, complexes and subgroups that were tested and are included in the datasets for release.

Species, complexes and subgroups testedComplex/subgroup
An. arabiensis An. gambiae complex
An. coluzzii An. gambiae complex
An. gambiae An. gambiae complex
An. melas An. gambiae complex
An. quadriannulatus An. gambiae complex
An. gambiae complexnot applicable
An. funestus An. funestus subgroup
An. parensis An. funestus subgroup
An. funestus subgroupnot applicable
An. rivulorum An. funestus group
An. mascarensis An. mascarensis group
An. pharoensis not applicable
Table 11

List of the insectides that were tested and are included in the datasets for release.

Insecticides tested
α-cypermethrin
bendiocarb
carbosulfan
chlorpyrifos-methyl
cyfluthrin
DDT
deltamethrin
dieldrin
etofenprox
fenitrothion
fenthion
λ-cyhalothrin
malathion
permethrin
pirimiphos-methyl
propoxur
List of the the species, complexes and subgroups that were tested and are included in the datasets for release. List of the insectides that were tested and are included in the datasets for release.

Data Records

The data are available for download from the Dryad Digital Repository[28]. The spatial and temporal distributions of Data File 1, standard WHO susceptibility test results for the Anopheles gambiae complex and Anopheles funestus subgroup, are shown in Fig. 1. The spatial and temporal distributions of Data File 7, Vgsc allele frequencies for individual species, are shown in Fig. 2.
Fig. 1

Spatial and temporal distributions of Data File 1. (a) The locations of mosquito collections of the An. gambiae complex and the An. funestus subgroup that were used in standard WHO susceptibility tests. (b) The number of data points available for each year for the An. gambiae complex and the An. funestus subgroup.

Fig. 2

Spatial and temporal distributions of Data File 7. (a) The locations of mosquito collections of An. arabiensis, An. coluzzii and An. gambiae that were used to calculate Vgsc allele frequencies. (b) The number of data points available for each year.

Spatial and temporal distributions of Data File 1. (a) The locations of mosquito collections of the An. gambiae complex and the An. funestus subgroup that were used in standard WHO susceptibility tests. (b) The number of data points available for each year for the An. gambiae complex and the An. funestus subgroup. Spatial and temporal distributions of Data File 7. (a) The locations of mosquito collections of An. arabiensis, An. coluzzii and An. gambiae that were used to calculate Vgsc allele frequencies. (b) The number of data points available for each year. Data File 1 is the largest dataset but all eight have similar spatial distributions with clustered sampling in the east and west of Africa and sparse data points in the centre and southwest. They also share similar temporal distributions with phenotypic data volumes increasing throughout the time period particularly from 2008 onwards, and the genotypic data volumes peaking in 2005 and 2010. The genotype data were almost exclusively extracted from published papers and there is typically a lag of around two years between mosquito collection and the publication of a paper containing the test results. In addition to the data extracted for Vgsc allele frequencies, data were also identified for Ace-1 allele frequencies and metabolic mechanisms of resistance including cytochrome P450s, esterases and glutathione-S-transferases. The volumes of genetic and biochemical data currently available for these mechanisms of resistance did not meet our aim of providing standardised data for a large number of locations across Africa, so no collated datasets for these mechanisms were generated. Dataset 4 consists of results from synergist bioassays so it does, therefore, provide data linked to P450-mediated mechanisms of resistance. Many studies performed both bioassays and genetic tests. If links between the different tests performed on the same sample of mosquitoes were provided by the original study, and providing any subsamples tested were not biased, then it was possible to extract pairs of phenotypic and genotypic measures of resistance for samples from a specific time and place. Unfortunately, however, when instances of paired phenotypic and genotypic results for an individual species from a single time and place were extracted, only sixty pairs were identified. This volume of data did not meet our aim of providing standardised data for a large number of locations across Africa. The same was true for paired phenotypic and genotypic results for a species complex or subgroup from a single time and place. In addition, the data volumes available for species-level CDC bottle bioassay results, species-level paired bioassays with and without a synergist, and species-level intensity bioassays, were too low to meet our aim of providing standardised data for a large number of locations across Africa.

Technical Validation

Data were checked for internal consistency to ensure (i) all coordinates for point locations fell on land and in the right country, as defined by GAUL, (ii) mortality and allele frequencies never exceeded 100%, (iii) the collection end date was never earlier than the collection start date, and (iv) the species name tallied with the identification methods listed. A matrix of species identification methods and species identified by each method was prepared in order to complete this check (Supplementary Information). In addition, a second person reviewed the geographical coordinates in accordance with the geo-locations protocol outlined above.

Usage Notes

Each data file released has been designed to provide results for a representative sample of a species complex or subgroup, or an individual species, so users can be confident of what each set of results represents. Older versions of the collated datasets of WHO susceptibility test results and the Vgsc allele frequency have been used in a prior geostatistical analysis that aimed to identify associations among resistance to different insecticides in the An. gambiae species complex[15]. The data files have been designed for use in geospatial analyses and, in such analyses, the precise location for each data point is important for two reasons. First, because this allows accurate calculation of the Euclidian distances between points for analyses that exploit spatial correlations in the data[15,29]. Secondly, precise location information allows accurate matching of the data to a wide range of environmental variables, such as climatic, socio-economic and intervention variables, to exploit relationships between the biological data and these environmental variables[30,31]. The use of data linked to wider areas is a current area of research aimed at improving model predictions in circumstances where data linked to precise locations are particularly sparse[32,33]. For any kind of spatial analysis, it is essential to know whether the geographical coordinates provided represent a precise location or wider area, what the definition of a precise location is, and where the boundaries of the wider areas lie. The data points released here are linked to a mixture of precise locations and wider areas, the precise locations (referred to here as points) are defined as an area within a 2.5 arc-minute grid cell (approx. 5 × 5 km), and links to the boundaries of wider areas (referred to here as polygons) are given. The data files released here are not the result of one single, continent-wide study that used a standard sampling design. It is a compilation of many studies that used many designs and incorporates obvious sampling bias. Sites that are more easily accessed or closer to research centres may be more likely to be sampled. Sites where high levels of resistance are expected may also be more likely to be sampled, as might sites where insecticide-based interventions are planned as a result of a combination of related variables. Geostatistical models can, however, be used to model sampling intensity to check these biases before proceeding further[34]. Other data resources for insecticide resistance in malaria vector are available. Data on insecticide resistance in the Anopheles vectors of malaria are available from VectorBase, however, VectorBase’s aim and scope are much broader than those of the current data release and the data volumes for insecticide resistance in Anopheles vectors are smaller than those provided here[35]. Furthermore, these data have not been configured specifically for use in mathematical analyses including geospatial analyses. Insecticide resistance data can also be viewed on interactive maps using the IR Mapper and Malaria Threats websites but these are data visualisation tools[36]. The data shown on these sites were not collated in support of mathematical analyses and are not available for download. There are overlaps in all of these databases, including overlaps with the data being released here. The data released here includes data that were provided to the World Health Organization to support the establishment of the Malaria Threats website[37]. In addition, the data being released here were shared with the group behind the IR Mapper site so that both groups could cross-check each other’s sources to identify publications that had been missed. A geo-database of insecticide resistance in the Aedes vectors of arboviruses has previously been released but this has much smaller data volumes for Africa, and encompasses a much greater range of insecticides tested at a greater range of concentrations on both adults and larvae[38]. In contrast, the data released here provide sufficient volumes of standardised values to support a range of analyses of insecticide resistance in malaria vectors in Africa and are freely available to all. In addition to the current data release, these data have been shared with the Pan Africa Mosquito Control Association to support the establishment of an Africa -led and –managed data resource. The datasets released here will also be available for download from the IR Mapper website [www.irmapper.com]. In addition, predicted values for the prevalence of resistance (i.e. mortality in a standard WHO susceptibility test) at every location in a ~5 km resolution grid, for each year from 2005 to 2017, will be modelled and released in the coming months.
Design Type(s)data integration objective
Measurement Type(s)insecticide resistance
Technology Type(s)digital curation
Factor Type(s)geographic location • temporal_interval
Sample Characteristic(s)gambiae species complex • Africa • funestus subgroup • Anopheles gambiae • Anopheles arabiensis • Anopheles rivulorum • Anopheles pharoensis • Anopheles mascarensis • Anopheles funestus • Anopheles coluzzii • Anopheles melas • Anopheles quadriannulatus • Anophelinae
  28 in total

1.  Bayesian geostatistical modelling with informative sampling locations.

Authors:  D Pati; B J Reich; D B Dunson
Journal:  Biometrika       Date:  2011-03       Impact factor: 2.445

Review 2.  Implementation of the global plan for insecticide resistance management in malaria vectors: progress, challenges and the way forward.

Authors:  Abraham P Mnzava; Tessa B Knox; Emmanuel A Temu; Anna Trett; Christen Fornadel; Janet Hemingway; Melanie Renshaw
Journal:  Malar J       Date:  2015-04-23       Impact factor: 2.979

3.  Corrigendum: The contemporary distribution of Trypanosoma cruzi infection in humans, alternative hosts and vectors.

Authors:  Annie J Browne; Carlos A Guerra; Renato Vieira Alves; Veruska Maia da Costa; Anne L Wilson; David M Pigott; Simon I Hay; Steve W Lindsay; Nick Golding; Catherine L Moyes
Journal:  Sci Data       Date:  2017-05-30       Impact factor: 6.444

Review 4.  Converging Human and Malaria Vector Diagnostics with Data Management towards an Integrated Holistic One Health Approach.

Authors:  Konstantinos Mitsakakis; Sebastian Hin; Pie Müller; Nadja Wipf; Edward Thomsen; Michael Coleman; Roland Zengerle; John Vontas; Konstantinos Mavridis
Journal:  Int J Environ Res Public Health       Date:  2018-02-03       Impact factor: 3.390

5.  Implications of insecticide resistance for malaria vector control with long-lasting insecticidal nets: a WHO-coordinated, prospective, international, observational cohort study.

Authors:  Immo Kleinschmidt; John Bradley; Tessa Bellamy Knox; Abraham Peter Mnzava; Hmooda Toto Kafy; Charles Mbogo; Bashir Adam Ismail; Jude D Bigoga; Alioun Adechoubou; Kamaraju Raghavendra; Jackie Cook; Elfatih M Malik; Zinga José Nkuni; Michael Macdonald; Nabie Bayoh; Eric Ochomo; Etienne Fondjo; Herman Parfait Awono-Ambene; Josiane Etang; Martin Akogbeto; Rajendra M Bhatt; Mehul Kumar Chourasia; Dipak K Swain; Teresa Kinyari; Krishanthi Subramaniam; Achille Massougbodji; Mariam Okê-Sopoh; Aurore Ogouyemi-Hounto; Celestin Kouambeng; Mujahid Sheikhedin Abdin; Philippa West; Khalid Elmardi; Sylvie Cornelie; Vincent Corbel; Neena Valecha; Evan Mathenge; Luna Kamau; Jonathan Lines; Martin James Donnelly
Journal:  Lancet Infect Dis       Date:  2018-04-09       Impact factor: 25.071

6.  Providing open access data online to advance malaria research and control.

Authors:  Catherine L Moyes; William H Temperley; Andrew J Henry; Clara R Burgert; Simon I Hay
Journal:  Malar J       Date:  2013-05-16       Impact factor: 2.979

7.  An online tool for mapping insecticide resistance in major Anopheles vectors of human malaria parasites and review of resistance status for the Afrotropical region.

Authors:  Tessa B Knox; Elijah O Juma; Eric O Ochomo; Helen Pates Jamet; Laban Ndungo; Patrick Chege; Nabie M Bayoh; Raphael N'Guessan; Riann N Christian; Richard H Hunt; Maureen Coetzee
Journal:  Parasit Vectors       Date:  2014-02-21       Impact factor: 3.876

8.  A generic schema and data collection forms applicable to diverse entomological studies of mosquitoes.

Authors:  Samson S Kiware; Tanya L Russell; Zacharia J Mtema; Alpha D Malishee; Prosper Chaki; Dickson Lwetoijera; Javan Chanda; Dingani Chinula; Silas Majambere; John E Gimnig; Thomas A Smith; Gerry F Killeen
Journal:  Source Code Biol Med       Date:  2016-03-28

Review 9.  Contemporary status of insecticide resistance in the major Aedes vectors of arboviruses infecting humans.

Authors:  Catherine L Moyes; John Vontas; Ademir J Martins; Lee Ching Ng; Sin Ying Koou; Isabelle Dusfour; Kamaraju Raghavendra; João Pinto; Vincent Corbel; Jean-Philippe David; David Weetman
Journal:  PLoS Negl Trop Dis       Date:  2017-07-20

10.  Implications of insecticide resistance for malaria vector control with long-lasting insecticidal nets: trends in pyrethroid resistance during a WHO-coordinated multi-country prospective study.

Authors: 
Journal:  Parasit Vectors       Date:  2018-10-22       Impact factor: 3.876

View more
  14 in total

1.  Future use-cases of vaccines in malaria control and elimination.

Authors:  Melissa A Penny; Flavia Camponovo; Nakul Chitnis; Thomas A Smith; Marcel Tanner
Journal:  Parasite Epidemiol Control       Date:  2020-05-06

2.  Mapping trends in insecticide resistance phenotypes in African malaria vectors.

Authors:  Penelope A Hancock; Chantal J M Hendriks; Julie-Anne Tangena; Harry Gibson; Janet Hemingway; Michael Coleman; Peter W Gething; Ewan Cameron; Samir Bhatt; Catherine L Moyes
Journal:  PLoS Biol       Date:  2020-06-25       Impact factor: 8.029

3.  Indoor residual spraying for malaria control in sub-Saharan Africa 1997 to 2017: an adjusted retrospective analysis.

Authors:  Julie-Anne A Tangena; Chantal M J Hendriks; Maria Devine; Meghan Tammaro; Anna E Trett; Ignatius Williams; Adilson José DePina; Achamylesh Sisay; Ramandimbiarijaona Herizo; Hmooda Toto Kafy; Elizabeth Chizema; Allan Were; Jennifer Rozier; Michael Coleman; Catherine L Moyes
Journal:  Malar J       Date:  2020-04-10       Impact factor: 2.979

4.  Assessing cross-resistance within the pyrethroids in terms of their interactions with key cytochrome P450 enzymes and resistance in vector populations.

Authors:  C L Moyes; R S Lees; C Yunta; K J Walker; K Hemmings; F Oladepo; P A Hancock; D Weetman; M J I Paine; H M Ismail
Journal:  Parasit Vectors       Date:  2021-02-18       Impact factor: 3.876

5.  Near-term climate change impacts on sub-national malaria transmission.

Authors:  Jailos Lubinda; Ubydul Haque; Yaxin Bi; Busiku Hamainza; Adrian J Moore
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

6.  An increasing role of pyrethroid-resistant Anopheles funestus in malaria transmission in the Lake Zone, Tanzania.

Authors:  Natacha Protopopoff; Louisa A Messenger; Nancy S Matowo; Jackline Martin; Manisha A Kulkarni; Jacklin F Mosha; Eliud Lukole; Gladness Isaya; Boniface Shirima; Robert Kaaya; Catherine Moyes; Penelope A Hancock; Mark Rowland; Alphaxard Manjurano; Franklin W Mosha
Journal:  Sci Rep       Date:  2021-06-29       Impact factor: 4.379

7.  Emergence of behavioural avoidance strategies of malaria vectors in areas of high LLIN coverage in Tanzania.

Authors:  K S Kreppel; M Viana; B J Main; P C D Johnson; N J Govella; Y Lee; D Maliti; F C Meza; G C Lanzaro; H M Ferguson
Journal:  Sci Rep       Date:  2020-09-03       Impact factor: 4.379

8.  Evaluating insecticide resistance across African districts to aid malaria control decisions.

Authors:  Catherine L Moyes; Duncan K Athinya; Tara Seethaler; Katherine E Battle; Marianne Sinka; Melinda P Hadi; Janet Hemingway; Michael Coleman; Penelope A Hancock
Journal:  Proc Natl Acad Sci U S A       Date:  2020-08-25       Impact factor: 11.205

9.  Automated phenotyping of mosquito larvae enables high-throughput screening for novel larvicides and offers potential for smartphone-based detection of larval insecticide resistance.

Authors:  Steven D Buckingham; Frederick A Partridge; Beth C Poulton; Benjamin S Miller; Rachel A McKendry; Gareth J Lycett; David B Sattelle
Journal:  PLoS Negl Trop Dis       Date:  2021-06-03

Review 10.  Review and Meta-Analysis of the Evidence for Choosing between Specific Pyrethroids for Programmatic Purposes.

Authors:  Natalie Lissenden; Mara D Kont; John Essandoh; Hanafy M Ismail; Thomas S Churcher; Ben Lambert; Audrey Lenhart; Philip J McCall; Catherine L Moyes; Mark J I Paine; Giorgio Praulins; David Weetman; Rosemary S Lees
Journal:  Insects       Date:  2021-09-14       Impact factor: 2.769

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.