| Literature DB >> 29584782 |
Jessica Hicks1, Tod Stuber1, Kristina Lantz1, Matthew Erdman2, Suelee Robbe-Austerman1, Xiaoqiu Huang3.
Abstract
Contagious equine metritis is a disease of worldwide concern in equids. The United States is considered to be free of the disease although sporadic outbreaks have occurred over the last few decades that were thought to be associated with the importation of horses. The objective of this study was to create finished, reference quality genomes that characterize the diversity of Taylorella equigenitalis isolates introduced into the USA, and identify their differences. Five isolates of T. equigenitalis associated with introductions into the USA from unique sources were sequenced using both short and long read chemistries allowing for complete assembly and annotation. These sequences were compared to previously published genomes as well as the short read sequences of the 200 isolates in the National Veterinary Services Laboratories' diagnostic repository to identify unique regions and genes, potential virulence factors, and characterize diversity. The 5 genomes varied in size by up to 100,000 base pairs, but averaged 1.68 megabases. The majority of that diversity in size can be explained by repeat regions and 4 main regions of difference, which ranged in size from 15,000 to 45,000 base pairs. The first region of difference contained mostly hypothetical proteins, the second contained the CRISPR, the third contained primarily hemagglutinin proteins, and the fourth contained primarily segments of a type IV secretion system. As expected and previously reported, little evidence of recombination was found within these genomes. Several additional areas of interest were also observed including a mechanism for streptomycin resistance and other virulence factors. A SNP distance comparison of the T. equigenitalis isolates and Mycobacterium tuberculosis complex (MTBC) showed that relatively, T. equigenitalis was a more diverse species than the entirety of MTBC.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29584782 PMCID: PMC5870977 DOI: 10.1371/journal.pone.0194253
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Taylorella equigenitalis isolate information.
| Isolate | GenBank | Collection Year | Animal Country of Origin | Breed | Gender | Epidemiology Summary |
|---|---|---|---|---|---|---|
| 79–1587 | CP021060 | 1978 | United States | Thoroughbred | Female | Bred to a European stallion |
| 89–0490 | CP021199 | 1989 | Germany | Unknown | Female | No Data |
| 92–0972 | CP021200 | 1992 | Austria | Lipizzaner | Male | No Data |
| 98–0554 | CP021201 | 1998 | Netherlands | Unknown | Female | No Data |
| 09–0932 | CP021246 | 2009 | United States | Quarter Horse | Female | 2009 US outbreak of unknown origin [ |
Summary of isolate information showing relevant collection data and GenBank Accession numbers.
Taylorella equigenitalis isolate attributes.
| Isolate | Length | GC | Streptomycin | PFGE | MLST (ST-CC) | Total Genes | Coding Genes |
|---|---|---|---|---|---|---|---|
| 79–1587 | 1,739,054 | 37.3% | Resistant | TE003 | 1–1 | 1621 | 1546 |
| 89–0490 | 1,649,945 | 37.6% | Susceptible | TE018 | 16–2 | 1527 | 1445 |
| 92–0972 | 1,666,291 | 37.5% | Susceptible | TE011 | 4 –(no cc) | 1534 | 1455 |
| 98–0554 | 1,692,042 | 37.5% | Resistant | TE014 | 17–4 | 1565 | 1486 |
| 09–0932 | 1,635,330 | 37.6% | Resistant | TE004 | 58 - (no cc) | 1508 | 1434 |
| NC_018108.1 | 1,732,123 | 37.3% | Resistant | TE003 | 1–1 | 1619 | 1536 |
| NC_014914.1 | 1,695,860 | 37.4% | Unknown | Unknown | 2–1 | 1577 | 1512 |
Summary of isolate assembly and typing. Shown are length of the completed sequence, GC content, Streptomycin resistance status, PFGE pattern (by in vitro analysis), and MLST (sequence type–clonal complex). [15]
Fig 1Mauve alignment.
This alignment contains the 5 study isolates and NC_018108.1. Overall similarity among the 6 genomes with localized regions of variability is visible. No rearrangements are evident.
Fig 2BRIG alignment.
BRIG alignments of the 5 study isolates and the references. (A) Relative to reference NC_018108.1 four major regions with poor alignment are visible. (B) A possible fifth region with poor alignment becomes visible when isolates are aligned with respect to NC_014914.1.
Fig 3Mean and variance of percent coverage by window.
Mean percent coverage and variance of the percent coverage of each isolate at a minimum depth of 10X using the MiSeq reads from the 200 NVSL diagnostic isolates of T. equigenitalis. Results are given by 35,000 bp window.
Percent identity and coverage calculated from Lastz output performing pairwise comparisons.
| Isolates | Identity | Coverage | Variable Regions Causing Breaks in the Alignment | |||
|---|---|---|---|---|---|---|
| 79–1587 vs 89–0490 | 99.54% | 89–0490 | 91.19% | 79–1587 | 86.56% | 1,3,4 |
| 79–1587 vs 92–0972 | 99.34% | 92–0972 | 97.12% | 79–1587 | 93.11% | 1,2,3,4 |
| 79–1587 vs 98–0554 | 99.57% | 98–0554 | 90.67% | 79–1587 | 88.26% | 1,3,4 |
| 79–1587 vs 09–0932 | 99.49% | 09–0932 | 98.73% | 79–1587 | 92.85% | 1,3,4 |
| 89–0490 vs 09–0932 | 99.36% | 09–0932 | 99.72% | 89–0490 | 98.76% | 3,4 |
| 89–0490 vs 92–0972 | 99.46% | 92–0972 | 99.62% | 89–0490 | 97.82% | 1, 2 |
| 98–0554 vs 89–0490 | 99.54% | 98–0554 | 97.91% | 89–0490 | 100.44% | 1 |
| 98–0554 vs 09–0932 | 99.38% | 09–0932 | 99.57% | 98–0554 | 96.14% | 3,4 |
| 98–0554 vs 92–0972 | 99.57% | 92–0972 | 96.07% | 98–0554 | 94.61% | 1,2,3 |
| 92–0972 vs 09–0932 | 99.36% | 09–0932 | 99.15% | 92–0972 | 97.29% | 1,2,3 |
| 79–1587 vs NC_018108.1 | 99.97% | NC_018108.1 | 97.99% | 79–1587 | 97.58% | None |
| NC_014914.1 vs NC_018108.1 | 99.61% | NC_014914.1 | 98.08% | NC_018108.1 | 96.11% | 1,2 |
| 09–0932 vs NC_014914.1 | 99.55% | 09–0932 | 97.80% | NC_014914.1 | 94.27% | 2,3,4 |
aIsolates aligned using Lastz.
bPercent identity of the sequence aligned between the two isolates.
cPercent coverage of the alignment for each genome.
dVariable regions causing breaks in the alignment are noted.
Fig 4kSNP Maximum Likelihood (ML) tree for T. equigenitalis.
The tree contains the six isolates discussed and NC_016043 (T. asinigenitalis) as an out group.
T. equigenitalis SNP distances from kSNP.
| 09–0932 | 79–1587 | 89–0490 | 92–0972 | 98–0554 | NC_014914 | NC_016043 | NC_018108 | |
|---|---|---|---|---|---|---|---|---|
| 09–0932 | 3439 | 3922 | 4208 | 3855 | 3045 | 7817 | 3430 | |
| 79–1587 | 3439 | 4140 | 4386 | 3924 | 2886 | 7836 | 44 | |
| 89–0490 | 3922 | 4140 | 3896 | 1888 | 3949 | 7741 | 4132 | |
| 92–0972 | 4208 | 4386 | 3896 | 3965 | 4269 | 7793 | 4381 | |
| 98–0554 | 3855 | 3924 | 1888 | 3965 | 3730 | 7821 | 3919 | |
| NC_014914 | 3045 | 2886 | 3949 | 4269 | 3730 | 7861 | 2877 | |
| NC_016043 | 7817 | 7836 | 7741 | 7793 | 7821 | 7861 | 7834 | |
| NC_018108 | 3430 | 44 | 4132 | 4381 | 3919 | 2877 | 7834 |
SNP distances were calculated in kSNP using all called SNPs with NC_016043 (T. asinigenitalis) as an outgroup.