| Literature DB >> 26697878 |
Mónica J Pajuelo1, María Eguiluz1, Eric Dahlstrom2, David Requena1, Frank Guzmán1, Manuel Ramirez1, Patricia Sheen1, Michael Frace3, Scott Sammons3, Vitaliano Cama4, Sarah Anzick2, Dan Bruno2, Siddhartha Mahanty2, Patricia Wilkins4, Theodore Nash2, Armando Gonzalez5, Héctor H García6,7, Robert H Gilman8, Steve Porcella2, Mirko Zimic1.
Abstract
BACKGROUND: Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26697878 PMCID: PMC4689449 DOI: 10.1371/journal.pntd.0004316
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Fig 1Map of Peru depicting the location of T. solium isolates used for this study.
Map showing the location of Tumbes and Puno, as well as Huancayo (locality of one of the genome source) and Lima, the country´s capital (A). Maps of Tumbes (B) and Puno (C) showing the location of the communities of origin of the tapeworms for this study
Origin of T. solium tapeworm isolates for the evaluation of microsatellites.
| Tapeworm ID | Community | Populations | Region |
|---|---|---|---|
| 6 | San Isidro | Cruz-Isidro-pinos | Tumbes |
| 39 | San Isidro | Cruz-Isidro-pinos | Tumbes |
| 28 | San Isidro | Cruz-Isidro-pinos | Tumbes |
| 38 | La Cruz | Cruz-Isidro-pinos | Tumbes |
| 7 | Los Pinos | Cruz-Isidro-Pinos | Tumbes |
| 19 | Tumbes | Tum-Corrales-San Juan-PenaBlanca | Tumbes |
| 31 | Tumbes | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 20 | Corrales | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 27 | Corrales | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 11 | Corrales | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 12 | Fuerte 5 de Julio | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 5 | San Juan—Virgen | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 25 | Peña Blanca | Tum_corrales-san juan_PenaBlanca | Tumbes |
| 18 | La Choza | Cañaveral–Choza | Tumbes |
| 24 | Cañaveral | Cañaveral–Choza | Tumbes |
| 14 | Pueblo Nuevo | Uñagato-progreso-pueblonuevo | Tumbes |
| 36 | Pueblo Nuevo | Uñagato-progreso-pueblonuevo | Tumbes |
| 8 | Nuevo Progreso | Uñagato-progreso-pueblonuevo | Tumbes |
| 35 | Nuevo Progreso | Uñagato-progreso-pueblonuevo | Tumbes |
| 13 | Uña de Gato | Uñagato-progreso-pueblonuevo | Tumbes |
| 40 | Pharata | Fharata | Puno |
| 1 | Pharata | Fharata | Puno |
| 30 | Pharata | Fharata | Puno |
| 17 | Pharata | Fharata | Puno |
| 21 | Pharata | Fharata | Puno |
| 34 | Pharata | Fharata | Puno |
| 3 | Callata | Callata | Puno |
| 9 | Callata | Callata | Puno |
| 10 | Callata | Callata | Puno |
| 29 | Callata | Callata | Puno |
| 4 | Callata | Callata | Puno |
| 15 | Camicachi | Camicachi | Puno |
| 33 | Camicachi | Camicachi | Puno |
| 37 | Camicachi | Camicachi | Puno |
| 23 | Camicachi | Camicachi | Puno |
| 32 | Camicachi | Camicachi | Puno |
| 16 | Conchaca | Conchaca_Tuturuma | Puno |
| 2 | Conchaca | Conchaca_Tuturuma | Puno |
| 26 | Conchaca | Conchaca_Tuturuma | Puno |
| 22 | Tuturuma | Conchaca_Tuturuma | Puno |
† Populations are the groups that were formed by proximity for the phylogenetic analysis
Taenia solium genomes assemblies’ statistics.
| CDC | NIH | Hybrid | Hybrid | |
|---|---|---|---|---|
| Puno proglottid | Huancayo Cyst | Cyst/proglottid (all contigs) | Cyst/proglottid (GenBank) | |
| Total sequenced bases | 109,898,809 | 114,605,177 | 116,668,703 | 111,029,218 |
| Contigs | 47,475 | 18,361 | 19,727 | 7,979 |
| GC% | 42.96 | 42.83 | 42.86 | 42.80 |
| N50 | 4,839 | 39,744 | 46,836 | 43,923 |
| Shortest contig (nt) | 287 | 51 | 51 | 1000 |
| Largest contig (nt) | 79,438 | 307,365 | 395,362 | 39,5362 |
| Mean contig length | 2,314.9 | 6,241.8 | 5,914.2 | 13,915.2 |
Statistics of the hybrid genome are shown for the full version and for the set of contigs uploaded to GenBank
†Only contigs larger than 1000bp were submitted
Frequency of microsatellites per type found in T. solium partial Genome 1 and partial Genome 2.
| Number of nucleotides | Range of repeats | Number of microsatellites | |
|---|---|---|---|
| Genome 1 | Genome 2 | ||
| Total bp analized | 109’898,809 | 114,605,177 | |
| Mono nucleotide | 10 to 53 | 2,298 | 2,737 |
| Di nucleotide | 6 to 38 | 3,393 | 3,537 |
| Tri nucleotide | 5 to 28 | 2,393 | 3,464 |
| Tetra nucleotide | 5 to 28 | 889 | 1,044 |
| Penta nucleotide | 5 to 18 | 123 | 122 |
| Hexa nucleotide | 5 to 10 | 33 | 32 |
| Total | 9,129 | 9,936 | |
Characteristics of microsatellite loci evaluated.
| Microsatellite | Primer sequences (5'-3') | Repetition motif | Observed size (bp) | Number of Alleles | PIC |
|---|---|---|---|---|---|
| TS_SSR_01 | ACCGGTGGTCGGAATTATTA GTTCTTGCTGAGGTGGTTCC | (CCATT) | 206–226 | 5 | 0.582 |
| TS_SSR_02 | CTCCGTGTCTTGACAGCAAA TGACGAAATGGAACAGTGGA | (ATGA) | 190 | 1 | - |
| TS_SSR_03 | TTTCAAGCACGTGTCAGCAT GCTGGCAGACAGTGAGTAGG | (CATT) | 155 | 1 | - |
| TS_SSR_04 | CAGATGAGGGGATGATGCTT GAACGATCCCAACCTCCATA | (GTT) | 180 | 1 | - |
| TS_SSR_05 | GGGAAAATGCAGTTCAGAGC GGTCTGATGCGAGGTCTAGG | (TAA) | 197 | 1 | - |
| TS_SSR_06 | GACCAAGCCCAACACCTCTA CAAGAATGAACGGGAGCAAC | (GGTA) | 177 | 1 | - |
| TS_SSR_07 | GCACACAAACTGGTCACTCG TGCTATGCGTTTGCTTGTTT | (CAAT) | -- | - | - |
| TS_SSR_08 | TCGTCAGTGTGGGAGAGTGA TGGTTGGATTTGTGCTTTGA | (ACG) | -- | - | - |
| TS_SSR_09 | AAGCCAATGGTGACCAAGAG GCCAGCATAGAAGAGCCTGT | (GGT) | 166–178 | 5 | 0.534 |
| TS_SSR_10 | CGACTCACGGCATTCATCTA TCCAAGACCCTGTGAAATCC | (GT) | 220 | 1 | - |
| TS_SSR_11 | TCATCTTCCCCGTAAGGCTA ACACTCGAAGCGCAGTGTTT | (GA) | 181 | 1 | - |
| TS_SSR_12 | ATCTCGACAGGCTCGAGTTC TCCGAACAGCTTCGAGTTTT | (TG) | 192 | 1 | - |
| TS_SSR_13 | GTAGCGGTAACGGAGTGAGG TCAGGCTGGTAACGTGTCAG | (GT) | 202 | 1 | - |
| TS_SSR_14 | AGCCGGTTCTCAGTTGATTG AATGCACTCATGCCATCTCA | (TG) | 162 | 1 | - |
| TS_SSR_15 | GAAAAGAACGGCATGCAAAT GTTTGGCCATTTTGCCTCTA | (AT) | 165 | 1 | - |
| TS_SSR_16 | CGCTGGACTAGGGTCGAATA CAGCAGAACAACAGCACCAT | (GT) | 160–166 | 4 | 0.472 |
| TS_SSR_17 | GCATTCCGAGGATGAATGAT CGTTTTTCTGCACACTTGGA | (CA) | 160 | 1 | - |
| TS_SSR_18 | AGTTAGCGTGCTTGCTTGGT ATTCCTGTTGCAACCTCCAC | (GT) | 168–180 | 6 | 0.638 |
| TS_SSR_19 | TCCCTTACACCCTTCACGTC AAAGGCGGTAGATTGTGTGC | (TG) | 163 | 1 | - |
| TS_SSR_20 | GGCCATTCAGTACCAACCAT TGTGCATGCCATTGTATGTG | (CT) | 154 | 1 | - |
| TS_SSR_21 | CTATGCCACACCCAACAATG GGCCTTCAAGATCACTCGTC | (GT) | 187 | 1 | - |
| TS_SSR_22 | CCTATTCCACTGGGGTGATG TCGATGAGCTTGCTGTATGTG | (TG) | 178 | 1 | - |
| TS_SSR_23 | CCTTTTTCGGTGAAGTCGAT GCCTCCTTACACACATGCAA | (CA) | 209 | 1 | - |
| TS_SSR_24 | CCCCATTTCCTGTTTCCTCT GCGGTGGCAATATAAGCATT | (CT) | 144 | 1 | - |
| TS_SSR_25 | AGGTGGCGTTATGAATCAGC GCAAACCATCGGATAAAGGA | (AC) | 174 | 1 | - |
| TS_SSR_26 | CGGTTTGCTTTTATGCCAAT AAATGGTCGCCTGAAATGAC | (GAA) | 165 | 1 | - |
| TS_SSR_27 | GAGGTCTCGCCTCATCAAAG TTTCCACTCCCAAAAACTCG | (GAA) | 158–176 | 5 | 0.546 |
| TS_SSR_28 | TGACGCTGGTAAGCTGTTTG GGAACCTTGGCACGAGATAG | (GTA) | 202–226 | 9 | 0.843 |
| TS_SSR_29 | AAAGATGGACGGAAACAGGA GTTGGACGGAGATGTGTGTG | (AGG) | 187 | 1 | - |
| TS_SSR_30 | TGACGTGTCGTCAGGTAGGA CGCATAGCCAGTACTTGTTCC | (TCC) | 190 | 1 | - |
| TS_SSR_31 | GGTTGCTTTTGCTTGTCCTC CACTCTCCACGAGTCCACAA | (TGA) | 157/179 | 1 | - |
| TS_SSR_32 | TGACGTTAACGAGGGTGTTG AGATCTCGCCTTGCAACAAT | (AGC) | 177–210 | 10 | 0.617 |
| TS_SSR_33 | CCAGCGGCATATTACAAAGG ACTCAAAAGCGCCGAAATTA | (AGG) | 130 | 1 | - |
| TS_SSR_34 | ATCACTCCTGTCCCAACTGC GGGTCGATTGGTCAGAGAAA | (CCT) | 182 | 1 | - |
| TS_SSR_35 | GGGCGTGAACTCGAATAAAA GGGGCAGACAAGTGAAAAAG | (CCA) | 170 | 1 | - |
| TS_SSR_36 | GCCCTGATTGTTGCTTTTGT AACGACACGCGGAAAATATC | (TCT) | 175 | 1 | - |
± PIC was calculated only for polymorphic microsatellites
General characteristics of polymorphic microsatellites by region.
| Region | Number of isolates | Number of polymorphic loci | Median number of alleles per locus | Total number of different genotypes |
|---|---|---|---|---|
| Tumbes | 20 | 7 | 3 | 16 |
| Puno | 20 | 7 | 5 | 20 |