| Literature DB >> 31364699 |
Abdulwahed Fahad Alrefaei1,2, Ross Low3, Neil Hall1,3, Rodrigo Jardim4, Alberto Dávila4, Rick Gerhold5, Shinto John6, Sascha Steinbiss7, Andrew A Cunningham6, Becki Lawson6, Diana Bell1, Kevin Tyler8.
Abstract
In Europe, Trichomonas gallinae recently emerged as a cause of epidemic disease in songbirds. A clonal strain of the parasite, first found in the United Kingdom, has become the predominant strain there and spread to continental Europe. Discriminating this epidemic strain of T. gallinae from other strains necessitated development of multilocus sequence typing (MLST). Development of the MLST was facilitated by the assembly and annotation of a 54.7 Mb draft genome of a cloned stabilate of the A1 European finch epidemic strain (isolated from Greenfinch, Chloris chloris, XT-1081/07 in 2007) containing 21,924 protein coding genes. This enabled construction of a robust 19 locus MLST based on existing typing loci for Trichomonas vaginalis and T. gallinae. Our MLST has the sensitivity to discriminate strains within existing genotypes confidently, and resolves the American finch A1 genotype from the European finch epidemic A1 genotype. Interestingly, one isolate we obtained from a captive black-naped fruit dove Ptilinopsus melanospilus, was not truly T. gallinae but a hybrid of T. gallinae with a distant trichomonad lineage. Phylogenetic analysis of the individual loci in this fruit dove provides evidence of gene flow between distant trichomonad lineages at 2 of the 19 loci examined and may provide precedence for the emergence of other hybrid trichomonad genomes including T. vaginalis.Entities:
Keywords: MLST; emerging infectious disease; genome; hybrid; molecular epidemiology; trichomoniasis; trichomonosis; wild bird
Mesh:
Substances:
Year: 2019 PMID: 31364699 PMCID: PMC6735722 DOI: 10.1093/gbe/evz164
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
List of Case Isolate ID, Bird Species, Year Found, Location, Evidence of Upper Alimentary Tract Lesions Consistent With Trichomonosis, Genotype
| Case No. | Host Species | Year Found | Isolate Origin | Oropharyngeal Lesions | Genotype * |
|---|---|---|---|---|---|
| XT-1081/07 (GF1c) |
Greenfinch
| 2007 |
Norfolk, England Wild | Yes | A1 Europe |
| HF1 |
House finch
| 2006 |
Kentucky, United States Wild | Yes | A1 United States |
| R11 |
Feral pigeon
| 2004 |
Georgia, United States Wild | No | A2 |
| R-1604/13 (BND1c) |
Black-naped Fruit Dove
| 2013 |
UK Zoological Collection Captive | No | M1 |
| 5 UEA |
Feral pigeon
| 2012 |
Norfolk, England Wild | No | C2 |
| Fh49001 |
Woodpigeon
| 2014 |
Norfolk, England Wild | Yes | C4 |
| Norfolk31/15 |
Feral pigeon
| 2015 |
Norfolk, England Wild | No | C8 |
| Norfolk32/15 |
Feral pigeon
| 2015 |
Norfolk, England Wild | No | C9 |
| R-138/14 |
Socorro dove
| 2015 |
UK Zoological Collection Captive | No | C10 |
Note.—Location refers to United Kingdom counties and United States state where the bird was found. *Genotyping scheme according to Chi et al. 2013.
BUSCO Reports for T. gallinae and T. vaginalis Giving a Measure of the Completeness of the Genome and Showing that the T. gallinae Genome and Annotation is Comparable to the T. vaginalis Genome
| BUSCO Content |
|
|
|
|
|---|---|---|---|---|
| Complete | 186 | 207 | 186 | 209 |
| Single copy | 163 | 158 | 154 | 160 |
| Duplicated | 23 | 49 | 32 | 49 |
| Fragmented | 13 | 9 | 15 | 16 |
| Missing | 104 | 87 | 102 | 78 |
| Completeness (%) | 65.67 | 71.28 | 66.33 | 74.25 |
Summary for the Trichomonas gallinae GF1c genome assembly (GenBank Accession Number MRSU00000000)
| Feature | Value |
|---|---|
| Genome | |
| Size of assembly (bp) | 54,799,485 |
| Gene G + C content (%) | 38.77 |
| No. of scaffolds | 11,704 |
| N50 scaffold size (bp) | 20,741 |
| Protein-coding genes | |
| No. of predicted genes | 21,924 |
| Mean gene length (bp) | 1,392 |
| Gene G + C content (%) | 38.25 |
| Gene density (genes/Mb) | 400.08 |
| Nonprotein-coding genes | |
| No. of nonprotein-coding genes | 327 |
| Predicted tRNA genes | 319 |
| Ribosomal RNA (rRNA) | 4 |
| Predicted snRNAs | 4 |
. 1.—MUMmer plot showing collinearity between Trichomonas gallinae and Trichomonas vaginalis. Red points denote collinearity matches >75% sequence identity and blue matches are inverted for both the contigs (A) and the coding sequences (B) for T. gallinae and T. vaginalis.
List of GenBank Accession Numbers for Each Locus of the MLST
| GenotypeGene ID | A1 | A1_US | A2 | M1 | C2 | C4 | C8 | C9 | C10 |
|---|---|---|---|---|---|---|---|---|---|
| TGA-000149300 | MK550746 | MK550747 | MK550748 | MK550749 | MK550750 | MK550751 | MK550752 | MK550753 | MK550754 |
| TGA-001385000 | MK550836 | MK550837 | MK550838 | MK550839 | MK550840 | MK550841 | MK550842 | MK550843 | MK550844 |
| TGA-00112400 | MK550737 | MK550738 | MK550739 | MK550740 | MK550741 | MK550742 | MK550743 | MK550744 | MK550745 |
| TGA-000731500 | MK550791 | MK550792 | MK550793 | MK550794 | MK550795 | MK550796 | MK550797 | MK550798 | MK550799 |
| TGA-000149500 | MK550755 | MK550756 | MK550757 | MK550758 | MK550759 | MK550760 | MK550761 | MK550762 | MK550763 |
| TGA-000080800 | MK550728 | MK550729 | MK550730 | MK550731 | MK550732 | MK550733 | MK550734 | MK550735 | MK550736 |
| TGA-002154000 | MK550872 | MK550873 | MK550874 | MK550875 | MK550876 | MK550877 | MK550878 | MK550879 | MK550880 |
| TGA-001611300 | MK550854 | MK550855 | MK550856 | MK550857 | MK550858 | MK550859 | MK550860 | MK550861 | MK550862 |
| TGA-000024800 | MK550719 | MK550720 | MK550721 | MK550722 | MK550723 | MK550724 | MK550725 | MK550726 | MK550727 |
| TGA-000367600 | MK550764 | MK550765 | MK550766 | MK550767 | MK550768 | MK550769 | MK550770 | MK550771 | MK550772 |
| TGA-000478600 | MK550773 | MK550774 | MK550775 | MK550776 | MK550777 | MK550778 | MK550779 | MK550780 | MK550781 |
| TGA-001175900 | MK550818 | MK550819 | MK550820 | MK550821 | MK550822 | MK550823 | MK550824 | MK550825 | MK550826 |
| TGA-001325800 | MK550827 | MK550828 | MK550829 | MK550830 | MK550831 | MK550832 | MK550833 | MK550834 | MK550835 |
| TGA-002155200 | MK550881 | MK550882 | MK550883 | MK550884 | MK550885 | MK550886 | MK550887 | MK550888 | MK550889 |
| TGA-000818700 | MK550809 | MK550810 | MK550811 | MK550812 | MK550813 | MK550814 | MK550815 | MK550816 | MK550817 |
| TGA-000730800 | MK550782 | MK550783 | MK550784 | MK550785 | MK550786 | MK550787 | MK550788 | MK550789 | MK550790 |
| TGA-000739900 | MK550800 | MK550801 | MK550802 | MK550803 | MK550804 | MK550805 | MK550806 | MK550807 | MK550808 |
| TGA-001849400 | MK550863 | MK550864 | MK550865 | MK550866 | MK550867 | MK550868 | MK550869 | MK550870 | MK550871 |
| TGA-001506800 | MK550845 | MK550846 | MK550847 | MK550848 | MK550849 | MK550850 | MK550851 | MK550852 | MK550853 |
Multilocus Sequence Typing (MLST) Loci Employed
|
|
| Locus ( | Gene Name | Length | SNPs | Nucleotides per SNP |
|---|---|---|---|---|---|---|
| TGA-000149300 | TVAG-087140 | A |
| 808 | 4 | 202 |
| TGA-001385000 | TVAG-400860 | B | Clan MA, family M8, leishmanolysin-like metallopeptidase ( | 682 | 16 | 43 |
| TGA-00112400 | TVAG-005070 | C | Mismatch repair MutL homolog ( | 708 | 59 | 12 |
| TGA-000731500 | TVAG-302400 | D | Mismatch repair MutL homolog ( | 966 | 32 | 30 |
| TGA-000149500 | TVAG-021420 | E | Coronin ( | 904 | 18 | 50 |
| TGA-000080800 | TVAG-364940 | F | Antigenic protein P1, putative ( | 741 | 71 | 10 |
| TGA-002154000 | TVAG-216430 | G | Clan MA, family M8, leishmanolysin-like metallopeptidase (G | 771 | 53 | 15 |
| TGA-001611300 | TVAG-303420 | H | Vesicular mannose-binding lectin, putative ( | 725 | 36 | 20 |
| TGA-000024800 | TVAG-291830 | I | Vesicular mannose-binding lectin, putative, PS( | 736 | 32 | 23 |
| TGA-000367600 | TVAG-228710 | J | Clan CA, family C1, cathepsin L-like cysteine peptidase | 817 | 53 | 15 |
| TGA-000478600 | TVAG-485880 | K | Clan CA, family C1, cathepsin L-like cysteine peptidase | 721 | 80 | 9 |
| TGA-001175900 | TVAG-171780 | L | HIV-1 rev binding protein, putative | 759 | 53 | 14 |
| TGA-001325800 | TVAG-086190 | M | Vesicular mannose-binding lectin, putative | 910 | 40 | 28 |
| TGA-002155200 | TVAG-291970 | N | Multidrug resistance pump, putative | 743 | 50 | 15 |
| TGA-000818700 | TVAG-459080 | O | Aspartic peptidase | 900 | 29 | 31 |
| TGA-000730800 | TVAG-414100 | P | Tropomyosin isoforms1/2, putative | 655 | 33 | 20 |
| TGA-000739900 | TVAG-192620 | Q | Actin depolymerizing factor, putative | 657 | 31 | 21 |
| TGA-001849400 | TVAG-094560 | R | Clan CE, family C48, cysteine peptidase | 867 | 52 | 17 |
| TGA-001506800 | TVAG-309150 | S | Conserved hypothetical protein (with PF03388 Domain) | 700 | 31 | 23 |
. 2.—A concatenated phylogeny of Trichomonas gallinae based on 19 housekeeping genes of MLST created by using maximum likelihood provides resolution for all strains tested. Trichomonas vaginalis is included as an outgroup. Numbers on branches indicate bootstrap values based on 2,000 replicates. (A) ITS ribotypes alone resolve only 3 groups of avian trichomonads A, C, and M. (B) Fe-hydrogenase is able to resolve 8 genotypes from the 10 avian trichomonads strains tested. (C) MLST based on 19 loci (listed in table 3) provides confident resolution of all the avian trichomonad strains tested. A1 (epidemic) encompasses isolates of the European finch epidemic A1 strain.
. 3.—Most markers group the M1 isolate with the A1 (European finch epidemic A1 strain). Phylogenetic trees and split network tree based on concatenated sequence analysis of Trichomonas gallinae strains, with 16 loci of MLST (all except loci A, D, and I) and the Fe-hydrogenase gene.
. 4.—Introgression of Trichomonas gallinae genes. The phylogenetic (A) and split network (B) trees based on concatenated sequence of Loci D and I are incongruous with other loci grouping the M1 isolate with C8 and C9 strains.
. 5.—Introgression of non-Trichomonas gallinae Trichomonad genes. The phylogenetic (A) and split network (B) trees based on concatenated sequence of Locus A and the ITS region are incongruous with other loci placing the M1 isolate outside of the T. gallinae group and thus suggesting a divergent trichomonad as the origin or these two loci.