| Literature DB >> 31852893 |
Titus Alicai1, Anna M Szyniszewska2, Christopher A Omongo1, Phillip Abidrabo1, Geoffrey Okao-Okuja1, Yona Baguma1, Emmanuel Ogwok1, Robert Kawuki1, Williams Esuma1, Fred Tairo3, Anton Bua1, James P Legg4, Richard O J H Stutt2, David Godding2,5, Peter Sseruwagi3, Joseph Ndunguru3, Christopher A Gilligan6.
Abstract
Cassava brown streak disease (CBSD) is currently the most devastating cassava disease in eastern, central and southern Africa affecting a staple crop for over 700 million people on the continent. A major outbreak of CBSD in 2004 near Kampala rapidly spread across Uganda. In the following years, similar CBSD outbreaks were noted in countries across eastern and central Africa, and now the disease poses a threat to West Africa including Nigeria - the biggest cassava producer in the world. A comprehensive dataset with 7,627 locations, annually and consistently sampled between 2004 and 2017 was collated from historic paper and electronic records stored in Uganda. The survey comprises multiple variables including data for incidence and symptom severity of CBSD and abundance of the whitefly vector (Bemisia tabaci). This dataset provides a unique basis to characterize the epidemiology and dynamics of CBSD spread in order to inform disease surveillance and management. We also describe methods used to integrate and verify extensive field records for surveys typical of emerging epidemics in subsistence crops.Entities:
Mesh:
Year: 2019 PMID: 31852893 PMCID: PMC6920376 DOI: 10.1038/s41597-019-0334-9
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Venn diagram of Dataset C representing the union between Datasets A and B; C = A U B (with A having priority over B where duplicate records were identified).
Per-field summary data available for Uganda.
| Year | 2004 | 2005 | 2006 | 2007 | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2014 | 2015a | 2017b | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dataset A | 600 | 476 | 300 | 253 | 189 | 105 | 514 | 200 | — | 606 | 200 | 374 | 403 | 4220 |
| Dataset B | — | — | — | — | 474 | 390 | 698 | 752 | 545 | 1066 | 1031 | — | — | 4956 |
| Matched* | — | — | — | — | 176 | 102 | 479 | 197 | — | 422 | 173 | — | — | 1549 |
| Dataset C | 600 | 476 | 300 | 253 | 487 | 393 | 733 | 755 | 545 | 1250 | 1058 | 374 | 403 | 7627 |
Dataset A represents quality-assured and highly accurate records digitized from retrievable paper-based forms by a specialist company, data digitized during a data entry workshopa, or collected with the iForm survey app in the field and downloaded from the serverb. Dataset B represents previously typed survey records with most of the original paper forms no longer available. The resulting merged Dataset C represents the integration of two datasets with priority given to Dataset A where duplicate records were found, and supplemented with information from Dataset B for those records, where paper forms were not retrievable.
Fig. 2Dataset C field data collection locations. CBSD foliar symptoms are classified as present (red) or absent (blue). Dataset C is a union of the Dataset A (cross) that contains information at plant level in each field and is supplemented with additional information from the dataset B (circle) data.
Fig. 3Three levels of administrative division of Uganda in 2014. Shapefiles are obtained from the Geo-Information Services Division, Uganda Bureau of Statistics.
Variables collected in the survey with the distinction between variables derived from Datasets A and B, and collected at plant or field level.
| Variable | Resolution | Verification level | |
|---|---|---|---|
| Dataset A | Dataset B | ||
| date | Field | 1 | — |
| time | Field | 2 | — |
| year | Field | 1 | 2 |
| latitude, longitude | Field | 1 | 2 |
| village | Field | 3 | 3 |
| crop_age_months | Field | 3 | 3 |
| field_size_m2 | Field | 3 | 3 |
| num_neighbouring_fields | Field | 3 | 3 |
| intercrop | Field | 3 | — |
| variety_sampled | Field | 3 | 3 |
| variety_2, variety_3, variety_4 | Field | 3 | 3 |
cbsd_foliar_presence cbsd_foliar_severity | Dataset A: Plant | 1 | — |
cbsd_foliar_incidence cbsd_foliar_severity_mean | Dataset B: Field | — | 2 |
| cbsd_in_other_varieties | Field | 3 | 3 |
adult_whitefly_count/ adult_whitefly_mean | Dataset A: Plant/Dataset B: Field | 1 | 2 |
| county, district, subregion, region | Field | 4 | 3 |
| altitude_masl | Field | 4 | 3 |
Variables were subjected to a post-hoc verification process. 1 represents detailed post-hoc verification of variables using automated screening for plausible ranges of values and random manual checks. 2 represents moderate level of scrutiny ensuring variables are within a plausible range. 3 indicates lack of post-hoc verification process. 4 indicates the variable was derived from external sources based on GPS surveyed field location coordinates: administrative units were derived from shapefiles provided by the Geo-Information Services Division, Uganda Bureau of Statistics. Altitude (masl) was derived from the SRTM 90 meters digital elevation model (DEM) version 4.
Fig. 4The distribution of deviations between matched records in Datasets A and B. A total of 488 out of 1549 (31.5%) matched field records had deviations in reported whitefly means higher than 0.09. Mean deviation = 4.389.
| Measurement(s) | occurrence • cassava brown streak disease • Bemisia tabaci • Observed Incidence |
| Technology Type(s) | digital curation |
| Factor Type(s) | geographic location • year • date • time • variety sampled • field size • intercrop • number of neighbouring fields • crop age • altitude |
| Sample Characteristic - Organism | Manihot esculenta |
| Sample Characteristic - Environment | cultivated environment |
| Sample Characteristic - Location | Africa |