| Literature DB >> 30914076 |
Loes Soetens1,2, Jantien A Backer1, Susan Hahné1, Rob van Binnendijk1, Sigrid Gouma1,3, Jacco Wallinga1,2.
Abstract
IntroductionWith growing amounts of data available, identification of clusters of persons linked to each other by transmission of an infectious disease increasingly relies on automated algorithms. We propose cluster finding to be a two-step process: first, possible transmission clusters are identified using a cluster algorithm, second, the plausibility that the identified clusters represent genuine transmission clusters is evaluated.AimTo introduce visual tools to assess automatically identified clusters.MethodsWe developed tools to visualise: (i) clusters found in dimensions of time, geographical location and genetic data; (ii) nested sub-clusters within identified clusters; (iii) intra-cluster pairwise dissimilarities per dimension; (iv) intra-cluster correlation between dimensions. We applied our tools to notified mumps cases in the Netherlands with available disease onset date (January 2009 - June 2016), geographical information (location of residence), and pathogen sequence data (n = 112). We compared identified clusters to clusters reported by the Netherlands Early Warning Committee (NEWC).ResultsWe identified five mumps clusters. Three clusters were considered plausible. One was questionable because, in phylogenetic analysis, genetic sequences related to it segregated in two groups. One was implausible with no smaller nested clusters, high intra-cluster dissimilarities on all dimensions, and low intra-cluster correlation between dimensions. The NEWC reports concurred with our findings: the plausible/questionable clusters corresponded to reported outbreaks; the implausible cluster did not.ConclusionOur tools for assessing automatically identified clusters allow outbreak investigators to rapidly spot plausible transmission clusters for mumps and other human-to-human transmissible diseases. This fast information processing potentially reduces workload.Entities:
Keywords: algorithms; cluster identification; mumps; phylogenetic analysis; plausibility assessment; transmission
Mesh:
Substances:
Year: 2019 PMID: 30914076 PMCID: PMC6440581 DOI: 10.2807/1560-7917.ES.2019.24.12.1800331
Source DB: PubMed Journal: Euro Surveill ISSN: 1025-496X
Summary of all mumps outbreak reports by the Netherlands Early Warning Committee between January 2009–May 2016 (n = 14)
| No | Date reported | Reported by | Covering time period | Number of cases in report | Age range (years) | Remark/source | Cluster number according to current study |
|---|---|---|---|---|---|---|---|
| 1 | 09 Apr | RIVM | Aug 2007–Apr 2009 | 171 | NR | Start of nationwide mumps epidemic | 4 |
| 2 | 12 Feb | RIVM | Dec 2009–Feb 2012 | 1,264 | NR | Overview of nationwide mumps epidemic | NL |
| 3 | 12 Apr | GGD Gelderland - Midden | Mar 2012 | 22 | 15–26 | Party | 5 |
| 4 | 12 Jul | GGD Hollands - Noorden | Jul 2012 | 3 | 6–8 | School | NL |
| 5 | 12 Aug | GGD Zaanstreek - Waterland | Jul 2012–Aug 2012 | 21 | 16–48 | Unknown | 5 |
| 6 | 13 Feb | Utrecht | Feb 2013 | 8 | NR | Unknown | 5 |
| 7 | 13 Jun | GGD Hollands Noorden | Jun 2013 | 11 | 23–29 | Unknown | NL |
| 8 | 13 Nov | GGD Zaanstreek - Waterland | Sep 2013–Nov 2013 | 16 | 4–47 | All living in Volendam | 3 |
| 9 | 13 Nov | GGD Groningen | Sep 2013–Nov 2013 | 13 | 17–36 | Students | NL |
| 10 | 14 Feb | GGD Zaanstreek - Waterland, GGD Haaglanden | Feb 2014 | 3 | 25–30 | Work in healthcare setting | NL |
| 11 | 15 Apr | GGD Haaglanden | Mar 2015–Apr 2015 | 5 | NR | Sports club | 2 |
| 12 | 15 Jun | GGD Haaglanden | Apr 2015–Jun 2015 | NR | NR | Students linked to school and earlier cluster at sports club | 2 |
| 13 | 16 Mar | GGD Brabant Zuidoost | Feb 2016–Mar 2016 | 6 | 18–40 | Carnival | 1 |
| 14 | 16 Apr | GGD Hart voor Brabant | Mar 2016–Apr 2016 | 6 | 17–23 | Friends/party | 1 |
GGD: Gemeentelijke GezondheidsDienst (Municipal Health Service); NL: no link to a cluster; NR: not reported; RIVM: Rijksinstituut voor Volksgezondheid en Milieu (National institute for public health and the environment).
Figure 1Hierarchical clustering tree of the combined dissimilarities of all dimensionsa for cases of mumps in the Netherlands, January 2009–May 2016 (n = 112 cases)
Figure 2Identified clusters of mumps with cases projected on (a) an epicurve (time), (b) maps of the Netherlands (geographical location) and (c) an arbitrarily rooted maximum likelihood phylogenetic tree of the pathogen sequences (genetics), Netherlands, January 2009–May 2016 (n = 112 cases)
Figure 3Notched boxplots of the pairwise dissimilarities per dimension and cluster of mumps cases, Netherlands, January 2009–May 2016 (n = 112 cases)
Figure 4Matrix plots of the Spearman intra-cluster correlation between the pairwise dissimilarities of all four dimensions for cluster 1–5