Literature DB >> 34854270

Toxoplasma gondii virulence prediction using hierarchical cluster analysis based on coding sequences (CDS) of sag1, gra7 and rop18.

Didik T Subekti1, Fitrine Ekawasti2, Muhammad Ibrahim Desem2, Zul Azmi3.   

Abstract

Toxoplasma gondii consists of three genotypes, namely genotype I, II and III. Based on its virulence, T. gondii can be divided into virulent and avirulent strains. This study intends to evaluate an alternative method for predicting T. gondii virulence using hierarchical cluster analysis based on complete coding sequences (CDS) of sag1, gra7 and rop18 genes. Dendrogram was constructed using UPGMA with a Kimura 80 nucleotide distance measurement. The results showed that the prediction errors of T. gondii virulence using sag1, gra7 and rop18 were 7.41%, 6.89% and 9.1%, respectively. Analysis based on CDS of gra7 and rop18 was able to differentiate avirulent strains into genotypes II and III, whereas sag1 failed to differentiate.
© 2021 The Korean Society of Veterinary Science.

Entities:  

Keywords:  Toxoplasma gondii; cluster analysis; genotype; virulence

Mesh:

Substances:

Year:  2021        PMID: 34854270      PMCID: PMC8636655          DOI: 10.4142/jvs.2021.22.e88

Source DB:  PubMed          Journal:  J Vet Sci        ISSN: 1229-845X            Impact factor:   1.672


INTRODUCTION

Toxoplasma gondii is one of the most common zoonotic diseases and is reported to infect nearly one third of the world's population [1]. The successful isolation of various T. gondii strains has led to the knowledge of genetic variations in many isolates. Variations in T. gondii isolates showed distinct pathogenicity in mice and, therefore, T. gondii virulence determination is based on its pathogenicity in mice [2]. T. gondii is categorized as virulent if LD100 = 1 and avirulent if LD50 ≥ 103 when its pathogenicity is tested on mice [2]. Other author divided avirulent into two types, namely intermediate avirulent with LD50 ≥ 103 and avirulent with LD50 ≥105 [3] while the other one classified as low virulence with LD50 ≥ 103 and non virulent with LD50 ≥105 [4]. However, hereinafter both opinions are only called avirulent strains, referring to the previously well-established categorization. [25]. T. gondii is known to have a clonal population based on genetic analysis by polymerase chain reaction–restriction fragment length polymorphism (PCR-RFLP) and microsatellite markers associated with its virulence. Based on its genetic analysis, the clonal population of T. gondii was divided into genotypes I, II and III [5]. Genotype I corresponds to a virulent strain, while genotypes II and III correspond to avirulent strains with LD50 ≥ 103 and LD50 ≥ 105, respectively [234]. Some virulent strains, such as CAST and GPHT, cannot actually be categorized into three existing genotypes (archetypal lineages), so they are called atypical (non-archetypal lineage) [6]. Virulence corresponds to the genotype of T. gondii that has been reported to be associated with clinical manifestations and severity of congenital cases [478]. T. gondii virulence can also be predicted through cell culture by observing their growth rate and their ability to penetrate through the biological barrier [9]. Other researchers also developed various approaches to predicting T. gondii virulence. These approaches include isoenzyme analysis using a six variable enzyme system [10] and serotyping using peptide membrane arrays [11]. However, all of them have produced unsatisfactory results. The mouse bioassay has proven to be the most accurate in establishing virulence, while PCR-RFLP is the preferred method for genotyping. Unfortunately, the PCR-RFLP method requires much effort and is often inconclusive, especially if the parasite strain is atypical. However, there are circumstances where tachyzoite isolation fails to be achieved or cannot be grown due to tachyzoites dying, whereupon a mouse bioassay cannot be implemented. These situations require looking for alternative methods. Therefore, another approach is needed, including hierarchical cluster analysis, which is based on a complete coding sequence (CDS) of sag1, gra7 and rop18 that encode a virulence-associated protein of T. gondii. SAG1 and GRA7 proteins contain different structures between virulent and avirulent strains and are thought to be used to predict T. gondii virulence [10]. PCR-RFLP on sag1 locus has been reported as being able to differentiate between virulent and avirulent strains of T. gondii [12]. Other proteins that have been shown to be virulence factors for T. gondii are GRA7 [13] and ROP18 [14]. Deletion of gra7 or rop18 genes results in partial attenuation of T. gondii [14]. However, if deletion is carried out in both genes, there is a complete attenuation that makes T. gondii become avirulent [14]. Based on this information, sag1, gra7 and rop18 were selected as genetic markers for predicting T. gondii virulence. This study is the first attempt to explore the possibility of using a hierarchical cluster analysis method based on a complete coding sequence of sag1, gra7 and rop18 for predicting T. gondii virulence.

MATERIALS AND METHODS

Virulent and avirulent strain

The virulence and genotypes of each T. gondii strain are obtained from the available genome database. The selected strain is one that has had its virulence status and genotype has been recognized. The genes of each strain are extracted from the available database, namely sag1, gra7 and rop18 (Table 1). Based on its virulence, T. gondii is grouped into virulent and avirulent strains. The genotype is divided into four groups, namely genotype I, II, III and atypical.
Table 1

Summary of Toxoplasma gondii virulence and genotypes used in this study

sag1 CDSgra7 CDSrop18 CDS
StrainsAccession numberVir.Gen.StrainsAccession numberVir.Gen.StrainsAccession numberVir.Gen.
RHJX045360VIRHJX045573.1VIRHJX045330.1VI
GT1JX045361VIGT1JX045574.1VIGT1GQ243202.1VI
ENTJX045418VIENTJX045578.1VIENTJX045328.1VI
VELJX045412VIVELJX045580.1VIVELJX045327.1VI
MORJX045394VIMORJX045579.1VIMORJX045326.1VI
FOUJX045357VIFOUJX045576.1VIFOUJX045332.1VI
ME49JX045362AVIIME49JX045583.1AVIIOH3JX045324.1VI
DEGJX045364AVIIDEGJX045584.1AVIIME49JX045319.1AVII
PIHJX045374AVIIPIHJX045586.1AVIIDEGJX045318.1AVII
WTD1JX045375AVIIBEVEU157141.1AVIIPTGGQ243204.1AVII
WTD3JX045376AVIIWTD1JX045589.1AVIIPIHJX045320.1AVII
RAYJX045379AVIIWTD3JX045595.1AVIIARIJX045322.1AVII
ARIJX045387AVIIRAYJX045590.1AVIIQHOGQ243205.1AVII
R961JX045389AVIIARIJX045611.1AVIICTGJX045346.1AVIII
B41JX045382AVIIR961JX045594.1AVIIVEGJX045348.1AVIII
H44JX045373AVIIB41JX045592.1AVIINEDJX045325.1AVIII
CTGJX045386AVIIIH44JX045588.1AVIISTRLJX045351.1AVIII
VEGJX045384AVIIICTGJX045617.1AVIIIM7741JX045353.1AVIII
STRLJX045385AVIIIVEGJX045618.1AVIIIP89JX045347.1AVAtyp.
M7741JX045366AVIIINEDDQ459455.2AVIIIRUBJX045336.1VAtyp.
B73JX045365AVIIISTRLJX045621.1AVIIIBOFJX045331.1VAtyp.
RODJX045405AVIIIM7741JX045623.1AVIIICASTJX045345.1VAtyp.
GPHTJX045415.1VAtyp.B73JX045622.1AVIII
BOFJX045400VAtyp.RODJX045619.1AVIII
CASTJX045358VAtyp.P89JX045616.1AVAtyp.
RUBJX045356VAtyp.RUBDQ459450.2VAtyp.
P89JX045409AVAtyp.GPHTJX045582.1VAtyp.
BOFJX045575.1VAtyp.
CASTJX045613.1VAtyp.

CDS, coding sequence; Atyp., atypical strains; Vir., virulence; Gen., genotypes; V, virulent; AV, avirulent.

CDS, coding sequence; Atyp., atypical strains; Vir., virulence; Gen., genotypes; V, virulent; AV, avirulent.

Selection of sag1, gra7 and rop18 genes

The three desired genes were obtained from the GenBank database provided by National Center of Biotechnology Information (NCBI, US National Library of Medicine, https://www.ncbi.nlm.nih.gov/genbank/). The sag1, gra7 and rop18 genes used must meet two criteria. First, the genes must be in a full-length sequence of CDS, which becomes a template for mature protein translation. Therefore, all partial sequences will be ignored. Second, all of these genes come from reference strains whose virulence and genotype are well known.

Data analysis

Cluster analysis was performed under hierarchical cluster analysis by CLC Sequence Viewer Ver. 8. Dendrogram was constructed by UPGMA (unweighted pair group method with arithmetic mean) with a Kimura 80 nucleotide distance measurement. Successful grouping in dendrograms was evaluated based on the lowest number of individual strains misplaced into the wrong cluster.

RESULTS

The sag1 coding sequence only differentiates T. gondii population into two clusters: virulent and avirulent (Fis. 1A and 2A). Misplacement occurs in a ROD strain that is categorized into the virulent category under genotype I (Fig. 1A), with an error rate of 4.3% (1/23). ROD has been recognized as an avirulent strain [10] and was assigned to genotype III [15]. In a further analysis involving atypical strains, the prediction error rate increased to 7.4% (2/27) due to misplacement of ROD and P89 strains (Fig. 2A). The dendrogram pattern in this study is similar to a previous study that used neighbour joining under Neil's genetic distance measurement [15]. In general, sag1 is only suitable for predicting T. gondii virulence, has an error rate of 4.3%–7.4%, and has even failed to distinguish genotypes in avirulent strains.
Fig. 1

Clustering of archetypal Toxoplasma gondii isolates as constructed by the UPGMA method. (A) sag1 CDS from 23 T. gondii isolates consist of genotypes I, II and III. (B) gra7 CDS from 25 T. gondii isolates consist of genotypes I, II and III. (C) rop18 CDS from 18 T. gondii isolates consist of genotypes I, II and III (red dot = avirulent strain, genotype II; green dot = avirulent strain, genotype III; small arrow = misplacement based on its virulence).

V, virulent; AV, avirulent; I, genotype I; II, genotype II; III, genotype III.

Fig. 2

Clustering of archetypal and non-archetypal Toxoplasma gondii isolates as constructed by the UPGMA method. (A) sag1 CDS from 27 isolates of T. gondii consist of 23 archetypal and 4 atypical. (B) gra7 CDS from 29 isolates of T. gondii consist of 25 archetypal and 4 atypical. (C) rop18 CDS from 22 isolates of T. gondii consist of 18 archetypal and 4 atypical (red dot = avirulent strain, genotype II; green dot = avirulent strain, genotype III; blue dot = atypical strains and there is a dispute over virulence or genotype; small arrow = misplacement based on its virulence).

V, virulent; AV, avirulent; I, genotype I; II, genotype II; III, genotype III.

Clustering of archetypal Toxoplasma gondii isolates as constructed by the UPGMA method. (A) sag1 CDS from 23 T. gondii isolates consist of genotypes I, II and III. (B) gra7 CDS from 25 T. gondii isolates consist of genotypes I, II and III. (C) rop18 CDS from 18 T. gondii isolates consist of genotypes I, II and III (red dot = avirulent strain, genotype II; green dot = avirulent strain, genotype III; small arrow = misplacement based on its virulence).

V, virulent; AV, avirulent; I, genotype I; II, genotype II; III, genotype III.

Clustering of archetypal and non-archetypal Toxoplasma gondii isolates as constructed by the UPGMA method. (A) sag1 CDS from 27 isolates of T. gondii consist of 23 archetypal and 4 atypical. (B) gra7 CDS from 29 isolates of T. gondii consist of 25 archetypal and 4 atypical. (C) rop18 CDS from 22 isolates of T. gondii consist of 18 archetypal and 4 atypical (red dot = avirulent strain, genotype II; green dot = avirulent strain, genotype III; blue dot = atypical strains and there is a dispute over virulence or genotype; small arrow = misplacement based on its virulence).

V, virulent; AV, avirulent; I, genotype I; II, genotype II; III, genotype III. The gra7 coding sequence has been used successfully to classify archetypal isolates from the T. gondii population into three clusters without error (Fig. 1B). Virulent clusters correspond to genotype I, while 2 avirulent clusters correspond to genotypes II and III. These results indirectly support the evidence that GRA7 relates to T. gondii virulence. The addition of several atypical strains in hierarchical cluster analysis using UPGMA led to the misplacement of some isolates (Fig. 2B). Two misplacements (6.9%) occurred, putting virulent strains CAST and RUB into avirulent clusters. One genotype (3.5%) misidentification, i.e., RUB strains that should be atypical (I/III), are grouped into genotype II. In general, gra7 can predict T. gondii virulence with an error rate of about 0%–6.9%. Overall, gra7 is better than sag1 because it can also distinguish genotypes with a prediction error of about 0%–3.5%. The rop18 coding sequence has been used to classify archetypal isolates from the T. gondii population into three clusters with an 11.1% (2/18) error rate due to misplacement of NED (genotype III) and PTG (genotype II) into virulent clusters (Fig. 1C). The addition of several atypical strains in hierarchical cluster analysis using UPGMA did not change the error rate for classifying virulence and genotyping of T. gondii (Fig. 2C). In general, rop18 is able to distinguish the virulence and genotypes of T. gondii with a prediction error rate of about 11.1%.

DISCUSSION

There is still a dispute about the virulence and genotype of RUB, CAST, and P89 strains. RUB and CAST were classified as virulent strains according to an analysis based on CDS of sag1 and rop18 but were identified as avirulent strains based on CDS of gra7. Meanwhile, P89 has been identified as a virulent strain based on CDS of sag1, but gra7 and rop18 have been identified as avirulent strains. The actual status of the P89 strain is still disputed by some researchers, some suggesting P89 is a virulent strain with genotype I [15], but other researchers claim that it is an intermediate virulent strain [10] with an atypical genotype [16]. The intermediate virulent strain is actually an avirulent strain with LD50 ≥ 103 [3]. P89 was identified as genotype III based on multilocus sequence analysis under sag2A and sag3 genetic markers [17]. This evidence supports several reports suggesting that P89 is atypical with the possibility of genetic recombination (type I/III). Therefore, classifying P89 as an avirulent strain and genotype III in this study was normally acceptable as it is in line with the analysis based on CDS of gra7 and rop18. RUB is categorized as a virulent strain [10] with an atypical genotype [16]. RUB can be grouped differently when analyzed using multilocus markers on a PCR-RFLP. RUB is grouped with genotype I when using sag1 and sag2 as locus markers [18]. Conversely, if using sag4 and bsr4 as locus markers, RUB will be in one cluster of genotype III [18]. Similarly, CAST is categorized as a virulent strain and genotype I [15] but other researchers consider its genotype atypical [616]. Based on a multilocus genotyping analysis using several genetic markers, CAST is an atypical genotype grouped into I/III recombinant isolate [6]. Therefore, in this study RUB and CAST were grouped into a virulent strain of genotype I as this result is also in line with the analysis based on CDS of sag1 and rop18. Based on the analysis and discussion above, two aspects can be considered limitations of this approach. First, this analysis requires the complete CDS of the gene. Second, prediction errors may occur if the T. gondii being analyzed is an atypical genotype. However, the frequency of these errors is quite low and is limited to genotype grouping. We consider that this is not a serious problem because it can be reduced by performing a combination analysis using sag1, gra7 and rop18. Overall, this approach has several advantages. First, it requires fewer genes, namely sag1, gra7 and rop18 when compared to PCR-RFLP, which requires multilocus genes, and the results are often inconclusive. Second, this approach is able to predict the virulence of T. gondii satisfactorily and it is easier to conclude if the analysis is based on a combination of both or all three genes. Third, this approach does not depend on live parasites as used in the mouse bioassay. In conclusion, the average misplacement of T. gondii strains according to its virulence by sag1, gra7 and rop18 were 7.41%, 6.89% and 9.1%, respectively. Analysis based on CDS of gra7 and rop18 were able to categorize avirulent strains into genotypes II and III, whereas sag1 failed to differentiate. Conclusions drawn from the analysis based on the CDS of both or all three genes will reduce prediction errors in classifying atypical T. gondii.
  18 in total

Review 1.  Sexual recombination and clonal evolution of virulence in Toxoplasma.

Authors:  Michael E Grigg; Yasuhiro Suzuki
Journal:  Microbes Infect       Date:  2003-06       Impact factor: 2.700

2.  Genotyping of Toxoplasma gondii by multilocus PCR-RFLP markers: a high resolution and simple method for identification of parasites.

Authors:  C Su; X Zhang; J P Dubey
Journal:  Int J Parasitol       Date:  2006-04-18       Impact factor: 3.981

3.  Success and virulence in Toxoplasma as the result of sexual recombination between two distinct ancestries.

Authors:  M E Grigg; S Bonnefoy; A B Hehl; Y Suzuki; J C Boothroyd
Journal:  Science       Date:  2001-10-05       Impact factor: 47.728

Review 4.  Toxoplasma gondii, "new" genotypes and virulence.

Authors:  M L Dardé
Journal:  Parasite       Date:  2008-09       Impact factor: 3.000

5.  Virulent strains of Toxoplasma gondii comprise a single clonal lineage.

Authors:  L D Sibley; J C Boothroyd
Journal:  Nature       Date:  1992-09-03       Impact factor: 49.962

Review 6.  Population structure of Toxoplasma gondii: clonal expansion driven by infrequent recombination and selective sweeps.

Authors:  L David Sibley; James W Ajioka
Journal:  Annu Rev Microbiol       Date:  2008       Impact factor: 15.500

7.  Genotyping of Toxoplasma gondii Directly from Human and Animal Biological Samples: From Partial Genotypes to a New Genotype.

Authors:  M M Cunha; A C A V Carneiro; J G L Costa; R W A Vitor
Journal:  J Parasitol       Date:  2015-09-29       Impact factor: 1.276

8.  Genotype of 86 Toxoplasma gondii isolates associated with human congenital toxoplasmosis, and correlation with clinical findings.

Authors:  Daniel Ajzenberg; Nadine Cogné; Luc Paris; Marie-Hélène Bessières; Philippe Thulliez; Denis Filisetti; Hervé Pelloux; Pierre Marty; Marie-Laure Dardé
Journal:  J Infect Dis       Date:  2002-08-05       Impact factor: 5.226

9.  ROP18 is a key factor responsible for virulence difference between Toxoplasma gondii and Neospora caninum.

Authors:  Tao Lei; Hui Wang; Jing Liu; Huizhu Nan; Qun Liu
Journal:  PLoS One       Date:  2014-06-13       Impact factor: 3.240

10.  Virulence of atypical Toxoplasma gondii strains isolated in French Guiana in a murine model.

Authors:  Stéphane Simon; Benoit de Thoisy; Aurélien Mercier; Mathieu Nacher; Magalie Demar
Journal:  Parasite       Date:  2019-09-24       Impact factor: 3.000

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.