| Literature DB >> 31825986 |
Jérôme Ambroise1, Léonid M Irenge1, Jean-François Durant1, Bertrand Bearzatto1, Godfrey Bwire2, O Colin Stine3, Jean-Luc Gala1.
Abstract
BACKGROUND: Multiple-Locus Variable Number of Tandem Repeats (VNTR) Analysis (MLVA) is widely used by laboratory-based surveillance networks for subtyping pathogens causing foodborne and water-borne disease outbreaks. However, Whole Genome Sequencing (WGS) has recently emerged as the new more powerful reference for pathogen subtyping, making a data conversion method necessary which enables the users to compare the MLVA identified by either method. The MLVAType shiny application was designed to extract MLVA profiles of Vibrio cholerae isolates from WGS data while ensuring backward compatibility with traditional MLVA typing methods.Entities:
Year: 2019 PMID: 31825986 PMCID: PMC6905556 DOI: 10.1371/journal.pone.0225848
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Formula used in the current study to compute the number of tandem repeats from the amplicon size.
*: Modifications introduced in the new formula.
| Loci | Motif | Formula used by Kendall et al. | New formula |
|---|---|---|---|
| VC0147 | aacaga | (X-150)/6 | (X-150)/6 |
| VC0437 | gacccta | (X-245)/6 | (X-252)/7* |
| VC1650 | ataatccag | (X-307)/9 | (X-308)/9* |
| VCA0171 | gctgtt | (X-270)/6 | (X-268)/6* |
| VCA0283 | ccagaa | (X-95)/6 | (X-95)/6 |
Fig 1Maximum number of tandem repeats (MNTR).
Example of the maximum number of a 6 nt motif included in a k-mer of 33 nt.
Maximum number of tandem repeats in a k-mer.
| Loci | motif length | k-mer | |||||
|---|---|---|---|---|---|---|---|
| 33 | 55 | 77 | 99 | 127 | 175 | ||
| VC0147 | 6 | 5 | 9 | 12 | 16 | 21 | 29 |
| VC0437 | 7 | 4 | 7 | 11 | 14 | 18 | 25 |
| VC1650 | 9 | 3 | 6 | 8 | 11 | 14 | 19 |
| VCA0171 | 6 | 5 | 9 | 12 | 16 | 21 | 29 |
| VCA0283 | 6 | 5 | 9 | 12 | 16 | 21 | 29 |
Fig 2Screenshot of the MLVAtype shiny application.
Nine V. cholerae genomes from DRC were uploaded and processed with this application.
MLVA profiles consisting in the number of tandem repetition for each loci (VC0147, VC0437, VC1650, VCA0171, and VCA0283) extracted from GeneScan data and from Sanger sequences.
Mismatch is indicated in bold.
| Country | Isolate | Sanger-derived MLVA profile | GeneScan-derived MLVA profile |
|---|---|---|---|
| Uganda | UG010 | (9;3;7;21;26) | (9,3,7,21,26) |
| Uganda | UG020 | (9;3;7;21;27) | (9,3,7,21,27) |
| Uganda | UG026 | (9;3;7;21;28) | (9,3,7,21,28) |
| Uganda | UG040 | (10;7;7;9;17) | (10,7,7,9,17) |
| Uganda | UG042 | (8;7;7;10;21) | (8,7,7,10,21) |
| Uganda | UG046 | (8;7;7;11;21) | (8,7,7,11,21) |
| Uganda | UG054 | (8;7;7;10;21) | ( |
| Uganda | UG060 | (10;7;7;9;17) | (10,7,7,9,17) |
| Uganda | UG071 | (10;7;7;8;18) | (10,7,7,8,18) |
| Uganda | UG086 | (10;7;7;9;18) | (10,7,7,9,18) |
WGS-derived MLVA profiles extracted from DRC and Ugandan genomes assembled with various values of the k-mer parameter and compared to the gold-standard (i.e. Sanger-derived MLVA profile).
*: Maximum Number of Tandem Repeats (MNTR) in the corresponding k-mer. n/a: not applicable. Read lengths obtained with DRC and Ugandan isolates were 300 and 150 nt, respectively.
| Country | Isolate | WGS-derived MLVA profile | Sanger-derived MLVA profile | ||||
|---|---|---|---|---|---|---|---|
| k-mer = 55 (9,7,6,9,9)* | k-mer = 77 (12,11,8,12,12)* | k-mer = 99 (16,14,11,16,16)* | k-mer = 127 (21,18,14,21,21)* | k-mer = 175 (29,25,19,29,29)* | |||
| DRC | CTMA-1402 | (≥9;≥7;≥6;≥9;≥9) | (9;7;7;10;≥12) | (9;7;7;10;≥16) | (9;7;7;10;16) | (9;7;7;10;16) | (9,7,7,10,16) |
| DRC | CTMA-1421 | (≥9;≥7;≥6;≥9;≥9) | (9;7;7;11;≥12) | (9;7;7;11;≥16) | (9;7;7;11;17) | (9;7;7;11;17) | (9,7,7,11,17) |
| DRC | CTMA-1424 | (≥9;≥7;≥6;≥9;≥9) | (10;7;7;11;≥12) | (10;7;7;11;≥16) | (10;7;7;11;16) | (10;7;7;11;16) | (10,7,7,11,16) |
| DRC | CTMA-1426 | (≥9;≥7;≥6;≥9;≥9) | (10;7;6;≥12;≥12) | (10;7;6;≥16;≥16) | (10;7;6;≥21;≥21) | (10;7;6;24;21) | (10,7,6,24,21) |
| DRC | CTMA-1427 | (≥9;≥7;≥6;≥9;≥9) | (10;7;6;≥12;≥12) | (10;7;6;≥16;≥16) | (10;7;6;≥21;≥21) | (10;7;6;24;21) | (10,7,6,24,21) |
| DRC | CTMA-1432 | (≥9;≥7;≥6;≥9;≥9) | (10;7;6;≥12;≥12) | (10;7;6;≥16;≥16) | (10;7;6;16;20) | (10;7;6;16;20) | (10,7,6,16,20) |
| DRC | CTMA-1435 | (≥9;≥7;≥6;≥9;≥9) | (10;7;6;≥12;≥12) | (10;7;6;≥16;≥16) | (10;7;6;≥21;18) | (10;7;6;23;18) | (10,7,6,23,18) |
| DRC | CTMA-1461 | (≥9;≥7;≥6;≥9;≥9) | (11;7;7;≥12;≥12) | (11;7;7;13;≥16) | (11;7;7;13;16) | (11;7;7;13;16) | (11,7,7,13,16) |
| DRC | CTMA-1473 | (≥9;≥7;≥6;≥9;≥9) | (10;7;7;≥12;≥12) | (10;7;7;12;≥16) | (10;7;7;12;16) | (10;7;7;12;16) | (10,7,7,12,16) |
| Uganda | UG010 | (≥9;3;≥6;≥9;≥9) | (9;3;7;≥12;≥12) | (9;3;7;≥16;≥16) | (9;3;7;≥21;≥21) | n/a | (9;3;7;21;26) |
| Uganda | UG020 | (≥9;3;≥6;≥9;≥9) | (9;3;7;≥12;≥12) | (9;3;7;≥16;≥16) | (9;3;7;≥21;≥21) | n/a | (9;3;7;21;27) |
| Uganda | UG026 | (≥9;3;≥6;≥9;≥9) | (9;3;7;≥12;≥12) | (9;3;7;≥16;≥16) | (9;3;7;≥21;≥21) | n/a | (9;3;7;21;28) |
| Uganda | UG040 | (≥9;≥7;≥6;≥9;≥9) | (10;7;7;9;≥12) | (10;7;7;9;≥16) | (10;7;7;9;17) | n/a | (10;7;7;9;17) |
| Uganda | UG042 | (8;≥7;≥6;≥9;≥9) | (8;7;7;10;≥12) | (8;7;7;10;≥16) | (8;7;7;10;≥21) | n/a | (8;7;7;10;21) |
| Uganda | UG046 | (8;≥7;≥6;≥9;≥9) | (8;7;7;11;≥12) | (8;7;7;11;≥16) | (8;7;7;11;≥21) | n/a | (8;7;7;11;21) |
| Uganda | UG054 | (8;≥7;≥6;≥9;≥9) | (8;7;7;10;≥12) | (8;7;7;10;≥16) | (8;7;7;10;≥21) | n/a | (8;7;7;10;21) |
| Uganda | UG060 | (≥9;≥7;≥6;≥9;≥9) | (10;7;7;9;≥12) | (10;7;7;9;≥16) | (10;7;7;9;17) | n/a | (10;7;7;9;17) |
| Uganda | UG071 | (≥9;≥7;≥6;8;≥9) | (10;7;7;8;≥12) | (10;7;7;8;≥16) | (10;7;7;8;18) | n/a | (10;7;7;8;18) |
| Uganda | UG086 | (≥9;≥7;≥6;≥9;≥9) | (10;7;7;9;≥12) | (10;7;7;9;≥16) | (10;7;7;9;18) | n/a | (10;7;7;9;18) |
Fig 3Percentage of correct, incorrect, and censored estimation of the number of tandem repeats.
Each percentage is produced by either GeneScan (GS) or WGS-based approaches; Sanger sequencing results were used as the reference values. *: SPAdes v.3.13.0. **: After modification of SPAdes v.3.13.0 source code.
WGS-derived MLVA profiles extracted with MLVAtype without taking into account the MNTR value.
Mismatches between WGS- and Sanger-derived values are indicated in bold. n/a: not applicable. Read lengths obtained with DRC and Ugandan isolates were 300 and 150 nt, respectively.
| Country | Isolate | WGS-derived MLVA profile | Sanger-derived MLVA profile | |||||
|---|---|---|---|---|---|---|---|---|
| k-mer = 55 | k-mer = 77 | k-mer = 99 | k-mer = 127 | k-mer = 175 | k-mer unsp. | |||
| DRC | CTMA-1402 | (9; | (9;7;7;10; | (9;7;7;10;16) | (9;7;7;10;16) | (9;7;7;10;16) | (9;7;7;10;16) | (9,7,7,10,16) |
| DRC | CTMA-1421 | (9;7; | (9;7;7;11; | (9;7;7;11;17) | (9;7;7;11;17) | (9;7;7;11;17) | (9;7;7;11;17) | (9,7,7,11,17) |
| DRC | CTMA-1424 | ( | (10;7;7;11; | (10;7;7;11;16) | (10;7;7;11;16) | (10;7;7;11;16) | (10;7;7;11;16) | (10,7,7,11,16) |
| DRC | CTMA-1426 | ( | (10;7;6; | (10;7;6; | (10;7;6; | (10;7;6;24;21) | (10;7;6; | (10,7,6,24,21) |
| DRC | CTMA-1427 | ( | (10;7;6; | (10;7;6; | (10;7;6; | (10;7;6;24;21) | (10;7;6; | (10,7,6,24,21) |
| DRC | CTMA-1432 | ( | (10;7;6; | (10;7;6;16; | (10;7;6;16;20) | (10;7;6;16;20) | (10;7;6;16;20) | (10,7,6,16,20) |
| DRC | CTMA-1435 | ( | (10;7;6; | (10;7;6; | (10;7;6; | (10;7;6;23;18) | (10;7;6; | (10,7,6,23,18) |
| DRC | CTMA-1461 | ( | (11;7;7; | (11;7;7;13;16) | (11;7;7;13;16) | (11;7;7;13;16) | (11;7;7;13;16) | (11,7,7,13,16) |
| DRC | CTMA-1473 | ( | (10;7;7;12; | (10;7;7;12;16) | (10;7;7;12;16) | (10;7;7;12;16) | (10;7;7;12;16) | (10,7,7,12,16) |
| Uganda | UG010 | (9;3; | (9;3;7; | (9;3;7; | (9;3;7;21; | n/a | (9;3;7; | (9;3;7;21;26) |
| Uganda | UG020 | (9;3; | (9;3;7; | (9;3;7; | (9;3;7;21; | n/a | (9;3;7; | (9;3;7;21;27) |
| Uganda | UG026 | (9;3; | (9;3;7; | (9;3;7; | (9;3;7;21; | n/a | (9;3;7; | (9;3;7;21;28) |
| Uganda | UG040 | ( | (10;7;7;9; | (10;7;7;9; | (10;7;7;9;17) | n/a | (10;7;7;9; | (10;7;7;9;17) |
| Uganda | UG042 | (8;7; | (8;7;7;10; | (8;7;7;10; | (8;7;7;10;21) | n/a | (8;7;7;10; | (8;7;7;10;21) |
| Uganda | UG046 | (8;7; | (8;7;7;11; | (8;7;7;11; | (8;7;7;11;21) | n/a | (8;7;7;11; | (8;7;7;11;21) |
| Uganda | UG054 | (8;7; | (8;7;7;10; | (8;7;7;10; | (8;7;7;10;21) | n/a | (8;7;7;10; | (8;7;7;10;21) |
| Uganda | UG060 | ( | (10;7;7;9; | (10;7;7;9; | (10;7;7;9;17) | n/a | (10;7;7;9; | (10;7;7;9;17) |
| Uganda | UG071 | ( | (10;7;7;8; | (10;7;7;8; | (10;7;7;8;18) | n/a | (10;7;7;8; | (10;7;7;8;18) |
| Uganda | UG086 | ( | (10;7;7;9; | (10;7;7;9;16) | (10;7;7;9;18) | n/a | (10;7;7;9; | (10;7;7;9;18) |
WGS-derived MLVA profiles extracted using an in silico PCR approach.
Mismatches between WGS- and Sanger-derived values are indicated in bold. U: undetermined. n/a: not applicable. Read lengths obtained with DRC and Ugandan isolates were 300 and 150 nt, respectively.
| Country | Isolate | WGS-derived MLVA profile | Sanger-derived MLVA profile | |||||
|---|---|---|---|---|---|---|---|---|
| k-mer = 55 | k-mer = 77 | k-mer = 99 | k-mer = 127 | k-mer = 175 | k-mer unsp. | |||
| DRC | CTMA-1402 | (9; | (9;7;7;10;U) | (9;7;7;10;16) | (9;7;7;10;16) | (9;7;7;10;16) | (9;7;7;10;16) | (9,7,7,10,16) |
| DRC | CTMA-1421 | (9; | (9;7;7;11;U) | (9;7;7;11;17) | (9;7;7;11;17) | (9;7;7;11;17) | (9;7;7;11;17) | (9,7,7,11,17) |
| DRC | CTMA-1424 | ( | (10;7;7;11; | (10;7;7;11;16) | (10;7;7;11;16) | (10;7;7;11;16) | (10;7;7;11;16) | (10,7,7,11,16) |
| DRC | CTMA-1426 | ( | (10;7;6;U;U) | (10;7;6;U; | (10;7;6; | (10;7;6;24;21) | (10;7;6; | (10,7,6,24,21) |
| DRC | CTMA-1427 | ( | (10;7;6;U;U) | (10;7;6;U; | (10;7;6; | (10;7;6;24;21) | (10;7;6; | (10,7,6,24,21) |
| DRC | CTMA-1432 | ( | (10;7;6;U; | (10;7;6;16; | (10;7;6;16;20) | (10;7;6;16;20) | (10;7;6;16;20) | (10,7,6,16,20) |
| DRC | CTMA-1435 | ( | (10;7;6;U; | (10;7;6;U; | (10;7;6; | (10;7;6;23;18) | (10;7;6; | (10,7,6,23,18) |
| DRC | CTMA-1461 | ( | (11;7;7; | (11;7;7;13;16) | (11;7;7;13;16) | (11;7;7;13;16) | (11;7;7;13;16) | (11,7,7,13,16) |
| DRC | CTMA-1473 | ( | (10;7;7;12;U) | (10;7;7;12;16) | (10;7;7;12;16) | (10;7;7;12;16) | (10;7;7;12;16) | (10,7,7,12,16) |
| Uganda | UG010 | (9;3; | (9;3;7;U;U) | (9;3;7;U;U) | (9;3;7;21;U) | n/a | (9;3;7; | (9;3;7;21;26) |
| Uganda | UG020 | (9;3; | (9;3;7;U;U) | (9;3;7; | (9;3;7;21;U) | n/a | (9;3;7; | (9;3;7;21;27) |
| Uganda | UG026 | (9;3; | (9;3;7;U;U) | (9;3;7; | (9;3;7;21;U) | n/a | (9;3;7; | (9;3;7;21;28) |
| Uganda | UG040 | ( | (10;7;7;9; | (10;7;7;9; | (10;7;7;9;17) | n/a | (10;7;7;9; | (10;7;7;9;17) |
| Uganda | UG042 | (8;7; | (8;7;7;10;U) | (8;7;7;10; | (8;7;7;10;21) | n/a | (8;7;7;10; | (8;7;7;10;21) |
| Uganda | UG046 | (8;7; | (8;7;7;11;U) | (8;7;7;11; | (8;7;7;11;21) | n/a | (8;7;7;11; | (8;7;7;11;21) |
| Uganda | UG054 | (8;7; | (8;7;7;10;U) | (8;7;7;10; | (8;7;7;10;21) | n/a | (8;7;7;10; | (8;7;7;10;21) |
| Uganda | UG060 | ( | (10;7;7;9;U) | (10;7;7;9; | (10;7;7;9;17) | n/a | (10;7;7;9; | (10;7;7;9;17) |
| Uganda | UG071 | ( | (10;7;7;8; | (10;7;7;8; | (10;7;7;8;18) | n/a | (10;7;7;8; | (10;7;7;8;18) |
| Uganda | UG086 | ( | (10;7;7;9; | (10;7;7;9; | (10;7;7;9;18) | n/a | (10;7;7;9; | (10;7;7;9;18) |
Fig 4Quality metrics of genome assemblies.
Number of contigs and N50 metrics, as reported by QUAST for genome assemblies from DRC and Ugandan (UGD) isolates and various selectable default and modified k-mer sizes. *: SPAdes v.3.13.0. **: After modification of the SPAdes v.3.13.0 source code.
Cost of MLVA targeting five loci.
*: Optimal cost if the five loci are amplified in a single PCR (theoretical cost analysis). **: Reported cost of a typing method based on a duplex and a triplex PCR followed by two fragment analysis runs. ***: Cost of commercial sequencing per sample in 2019. **** Reagent and consumable cost of Illumina sequencing per sample [12].
| Cost USD | ||||
|---|---|---|---|---|
| Sanger-based (5 loci) | GeneScan-based (5 loci) optimal cost* | GeneScan-based (5 loci) published protocol** | WGS-based | |
| PCR | 40 (5 x 8) | 8 | 16 (2 x 8) | - |
| Analytical cost | 35 (5 x 7) | 3.4 | 6.8 (2 x 3.4) | 99–150 |
| Total | 22.8 | |||