| Literature DB >> 33028684 |
McKenzie Carter1, Stephanie Gomez1, Sam Gritz1, Stephen Larson1, Eugenia Silva-Herzog2, Hee-Sook Kim3, Danae Schulz4, Galadriel Hovel-Miner5.
Abstract
Trypanosoma brucei is an early branching protozoan parasite that causes human and animal African trypanosomiasis. Forward genetics approaches are powerful tools for uncovering novel aspects of trypanosomatid biology, pathogenesis, and therapeutic approaches against trypanosomiasis. Here, we have generated a T. brucei cloned ORFeome consisting of >90% of the targeted 7,245 genes and used it to make an inducible gain-of-function parasite library broadly applicable to large-scale forward genetic screens. We conducted a proof-of-principle genetic screen to identify genes whose expression promotes survival in melarsoprol, a critical drug of last resort. The 57 genes identified as overrepresented in melarsoprol survivor populations included the gene encoding the rate-limiting enzyme for the biosynthesis of an established drug target (trypanothione), validating the tool. In addition, novel genes associated with gene expression, flagellum localization, and mitochondrion localization were identified, and a subset of those genes increased melarsoprol resistance upon overexpression in culture. These findings offer new insights into trypanosomatid basic biology, implications for drug targets, and direct or indirect drug resistance mechanisms. This study generated a T. brucei ORFeome and gain-of-function parasite library, demonstrated the library's usefulness in forward genetic screening, and identified novel aspects of melarsoprol resistance that will be the subject of future investigations. These powerful genetic tools can be used to broadly advance trypanosomatid research.IMPORTANCE Trypanosomatid parasites threaten the health of more than 1 billion people worldwide. Because their genomes are highly diverged from those of well-established eukaryotes, conservation is not always useful in assigning gene functions. However, it is precisely among the trypanosomatid-specific genes that ideal therapeutic targets might be found. Forward genetics approaches are an effective way to identify novel gene functions. We used an ORFeome approach to clone a large percentage of Trypanosoma brucei genes and generate a gain-of-function parasite library. This library was used in a genetic screen to identify genes that promote resistance to the clinically significant yet highly toxic drug melarsoprol. Hits arising from the screen demonstrated the library's usefulness in identifying known pathways and uncovered novel aspects of resistance mediated by proteins localized to the flagellum and mitochondrion. The powerful new genetic tools generated herein are expected to promote advances in trypanosomatid biology and therapeutic development in the years to come.Entities:
Keywords: ORFeome; Trypanosomazzm321990; drug resistance mechanisms; forward genetics; parasitology; redox signaling
Mesh:
Substances:
Year: 2020 PMID: 33028684 PMCID: PMC7568655 DOI: 10.1128/mSphere.00769-20
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
FIG 1Generating a T. brucei ORFeome. (A) ORFeome cloning strategy: attB site addition to T. brucei ORFs during PCR amplification, BP Gateway cloning into pDONR221 to generate the pENTR ORFeome, and LR Gateway cloning into T. brucei-specific pDEST (pSUN6) (see Fig. S1 in the supplemental material) to generate the complete pTrypLib ORFeome. (B) Assessment of PCR amplification by SYBR green relative fluorescence units (RFU). Each color represents one of the 21 384-well plates, and each dot represents a PCR in a single well as measured by SYBR green RFU. (C) Percentages of PCR-positive wells (SYBR assessment) for each of the 21 384-well plates, from the first time amplified. (D) Agarose gel bands from each of the original 21 384-well PCR plates pooled prior to gel extraction and cloning, from the first time amplified, compared to 1-kb DNA ladder.
384-well oligo plates
| Plate name | Min length (bp) | Max length (bp) | No. of ORFs per plate |
|---|---|---|---|
| 0_hypothetical | 102 | 375 | 373 |
| 0_known | 147 | 591 | 315 |
| 1_hypothetical | 375 | 522 | 374 |
| 1_known | 594 | 849 | 365 |
| 2_hypothetical | 522 | 666 | 372 |
| 2_known | 849 | 1056 | 363 |
| 3_hypothetical | 669 | 822 | 382 |
| 3_known | 1056 | 1287 | 367 |
| 4_hypothetical | 822 | 993 | 372 |
| 4_known | 1287 | 1524 | 348 |
| 5_hypothetical | 993 | 1155 | 365 |
| 5_known | 1524 | 1857 | 342 |
| 6_hypothetical | 1158 | 1365 | 375 |
| 6_known | 1857 | 2337 | 362 |
| 7_hypothetical | 1368 | 1635 | 378 |
| 7_known | 2340 | 3504 | 376 |
| 8_hypothetical | 1635 | 1953 | 381 |
| 9_hypothetical | 1953 | 2508 | 381 |
| 10_hypothetical | 2508 | 3501 | 381 |
| hypothetical_last | 3504 | 4488 | 135 |
| known_last | 3507 | 4497 | 138 |
| Total | 7245 |
FIG 2Assessment of pENTR and pTrypLib plasmid libraries. (A) Bar graph showing the number of targeted ORFs with zero detectable aligned reads from the first round of cloning (pENTR_1 and pTrypLib_1) and after both rounds of cloning (pENTR_Final and pTrypLib_Final) using analyses generated from both uniquely and multiply aligned reads. (B) Histograms showing the distribution of ORF lengths for the target gene list (red) and the set of ORFs with zero detectable aligned reads after both rounds of cloning (labeled as missing). Analyses from unique (dark blue) and multiply (light blue) aligned reads are shown. (Inset graph) Target ORF lengths have been left out to better visualize the lengths of the missing ORFs. (C) Histogram showing the distribution of normalized read counts for each ORF in the pooled pTrypLib plasmid libraries (uniquely aligned reads shown) (see Fig. S2 for both uniquely and multiply aligned reads).
FIG 3Generation and validation of the T. brucei GoF library. (A) Transfection of pTrypLib ORFeome into parental landing pad (LP) cell line harboring pRPaSce* plasmid for I-SceI-induced enzymatic cleavage of a single rDNA spacer site to increase transfection efficiency, as previously reported (6). (B) Inducible expression of a low-complexity GoF library measured by RT-qPCR following 12 and 24 h of doxycycline induction compared to that in uninduced cells (no Dox). (C) Generation of the pTrypLib ORFeome-based GoF parasite library. Graph shows the recovery of GoF library-harboring cells (blue line) compared to that from mock transfection (red line) in blasticidin (BSD) (“BSD recover” indicates recovery of the selected GoF library) added at time 0, 12 h posttransfection. *, cells spun and resuspended in 300 ml HMI-9; †, addition of 500 ml HMI-9; ○, time of GoF library harvest. (D) Assessment of the number of ORFeome genes present in the GoF library following initial transfection [GoF Lib (BSD recover), blasticidin-recovered population] and following freeze-thaw and 3 days of growth to generate GoF_L, which was then used to generate NGS libraries using two alternative protocols (see Materials and Methods) resulting in GoF_L1 and GoF_L2.
FIG 4Isolation of melarsoprol survivor populations from a GoF screen. (A) Growth of landing pad (LP) parental cell line in 17 nM (blue line), 26 nM (purple line), 35 nM (red line), or no (black line) melarsoprol (black line). (B) GoF library screen in 35 nM melarsoprol treatment: LP cell line, solid red line; uninduced GoF library, red dotted line; induced GoF library, red dashed line. Dotted and dashed lines overlap. (C) GoF genetic screen in 17 nM melarsoprol. Timeline at the bottom of the graph indicates days on which either Dox (+Dox), melarsoprol (+Drug), or both (+/+) were added. All cultures (other than GoF_L) were continuously grown in the presence of 17 nM melarsoprol. On days 3, 6, and 7, the triplicate cultures were centrifuged and resuspended in fresh medium with melarsoprol and Dox for induction (noted as spin +/+). Biological triplicate cultures are as follows: GoF_L, untreated GoF library-harboring cells grown for 3 days (black line); LP parental cell line, solid blue line; uninduced GoF library (no Dox), blue triangles on dotted line; induced GoF library (+ Dox), blue circles on dashed line, harvested on day 9 (red circle on blue line) to produce MEL1. On day 7, biological triplicates from GoF_MEL1 +Dox (blue circles on dashed line) were split into two sets of triplicate samples, both in 17 nM melarsoprol, one of which was not further induced (no Dox, green triangles on dotted line). The other continued to be induced (+Dox, green squares on dashed green line) and was harvested on day 10 to produce MEL2 (red circle indicates harvest). (D) Principal-component analysis (PCA) comparing GoF_L libraries (L1 and L2) (see Materials and Methods) with libraries arising following continuous melarsoprol selection (MEL1 and MEL2). (E) Table of sample names and NGS sequencing samples with full description.
FIG 5Identification of significantly overrepresented ORFs in melarsoprol GoF survivor populations. (A) The numbers of genes with changes of >1.5-, 2-, or 4-fold for comparisons among all three replicates (Rep1, Rep2, and Rep3) of GoF_L2. (B) Hit-calling pipeline to identify genes overrepresented in melarsoprol survivor populations. (C) Volcano plot showing the −log10 P adjusted values versus log2 fold change in normalized counts for the comparison of melarsoprol-selected MEL1/GoF_L2 for each ORF in the targeted library. Blue dots represent overrepresented ORFs common to all four comparisons described in panel D. (D) Venn diagram illustrates the significantly overrepresented genes common to each comparison between GoF_L (GoF_L1 and GoF_L2) and melarsoprol treated (MEL1 and MEL2) and shared among all comparisons between replicates, resulting in 57 overrepresented hits identified in melarsoprol survivor populations compared to those in GoF_L populations.
Hits overrepresented in melarsoprol survivors (comparison of GoF_L2 vs. MEL1)
| Gene ID | Description (Tryptag 062920) | Localization (TrypTag or proteome) | PubMed | Category | Fold change overrepresented | DESeq |
|---|---|---|---|---|---|---|
|
| Hypothetical protein, conserved | Cytoplasm (points, weak), endocytic | Endocytic+ | 597 | 3.77E-42 | |
|
| Hypothetical protein, conserved | Strong mitochondrial signal | Mitochondrial* | 350 | 1.81E-39 | |
|
| Hypothetical protein, conserved, XAC1 | Cytoplasm | 26784394 | Gene Expression# | 322 | 2.83E-86 |
|
| Gamma-glutamylcysteine synthetase, GSH1 | Nucleoplasm | 8663359 | Biosynthetic# | 191 | 2.19E-43 |
|
| Eukaryotic translation initiation factor eIF2Â | Strong cytoplasm signal | 24945722 | Gene Expression# | 126 | 4.50E-25 |
|
| Dynein light chain lc6 | Strong flagellum axoneme signal | Flagellar+ | 120 | 1.30E-26 | |
|
| Ring finger domain containing protein, ubiqutin ligasê | Strong endocytic system signal | Endocytic+ | 113 | 3.13E-40 | |
|
| Hypothetical protein, conserved | Nuclear localization (mass spectrometry) | Nuclear+ | 97 | 5.89E-48 | |
|
| Translation initiation factor EIF-2b alpha subunit̂ | Cytoplasm | Gene Expression# | 81 | 1.14E-48 | |
|
| Stress granule protein | Localization to starvation stress granules | 26187993 | Stress Granule# | 53 | 2.04E-39 |
|
| Zinc finger CCCH domain-containing protein 8, ZC3H8 | Cytoplasm (points) | 26784394 | Gene Expression# | 53 | 2.45E-13 |
|
| Mediator of RNA polymerase II transcription subunit 7, MED-T7 | Nucleoplasm (points) | 20876299 | Gene Expression# | 49 | 1.98E-18 |
|
| Ring finger domain containing protein̂ | Moderate nuclear lumen signal, weak cytoplasm signal | Nuclear+ | 47 | 7.14E-27 | |
|
| Hypothetical protein, conserved | Nuclear, mass spec | Nuclear○ | 39 | 3.88E-14 | |
|
| tRNA-specific adenosine deaminase, ADAT3 | Cytoplasm (reticulated, weak) | 17483465 | Gene Expression# | 37 | 1.59E-18 |
|
| Hypothetical protein, conserved | Cytoplasm (points, reticulated) | 26784394 | Gene Expression# | 35 | 1.10E-22 |
|
| Hypothetical protein, conserved | Weak axoneme signal | Flagellar+ | 31 | 4.45E-20 | |
|
| Hypothetical protein, conserved | Endocytic, cytoplasm | Endocytic+ | 31 | 9.17E-31 | |
|
| LSU ribosomal protein, mitochondrial̂ | Kinetoplast, mitochondrion | 18364347 | Mitochondrial* | 31 | 2.16E-34 |
|
| Hypothetical protein, conserved | Kinetoplast, mitochondrion | Mitochondrial* | 31 | 5.61E-28 | |
|
| Exosome-associated protein 3,3' exoribonuclease, putative, EAP3 | Nucleoplasm | Nuclear+ | 28 | 9.55E-27 | |
|
| Zinc finger domain, LSD1 subclasŝ | Cytoplasm (reticulated) | Zinc finger | 25 | 8.84E-47 | |
|
| Hypothetical protein, conserved | Endocytic, cytoplasm | Endocytic+ | 24 | 3.59E-12 | |
|
| Hypothetical protein, conserved | Cytoplasm (patchy, points) | Mitochondrial & ER○ | 23 | 1.14E-23 | |
|
| 3-Oxoacyl-ACP reductasê | Mitochondrion, kinetoplast (strong) | 17166831 | Mitochondrial# | 23 | 2.16E-34 |
|
| Phosphoglycerate mutase, iPGAM̂ | Mitochondrion (75%), kinetoplast (75%) | Mitochondrial* | 23 | 6.79E-05 | |
|
| AN1-like zinc finger-containing protein̂ | Cytoplasm | Zinc finger | 20 | 1.50E-30 | |
|
| Conserved protein | Endocytic, cytoplasm (weak) | Endocytic+ | 19 | 2.60E-21 | |
|
| Hypothetical protein, conserved | Cytoplasm, flagellar cytoplasm, | Flagellar + | 19 | 3.79E-27 | |
|
| Isy1-like splicing family | Nucleoplasm | 9250687 | Gene Expression# | 16 | 8.36E-27 |
|
| Mitochondrial ribosomal protein S18̂ | Cytoplasm (reticulated) | 18951088 | Mitochondrial# | 16 | 3.68E-10 |
|
| Ring finger domain-containing protein̂ | Flagellar pocket (ring) | Flagellar + | 16 | 1.50E-18 | |
|
| Hypothetical protein, conserved | Cytoplasm (points) | 24945722 | Gene Expression | 15 | 1.78E-09 |
|
| DUF2407 ubiquitin-like domain-containing protein̂ | Flagellum matrix proteome (BSF) | 24741115 | Flagellar* | 14 | 2.78E-12 |
|
| CMGC/CLK family protein kinasê | Nucleoplasm, cytoplasm | 24453978 | Kinase | 14 | 2.82E-35 |
|
| Hypothetical protein, conserved | Nucleus, cytoplasm (reticulated) | Nuclear+ | 13 | 2.19E-05 | |
|
| Zinc finger domain, LSD1 subclasŝ | Tagging not successful, ND | Mitochondrial & ER○ | 13 | 7.88E-22 | |
|
| Hypothetical protein, conserved | Paraflagellar rod | Flagellar* | 13 | 6.29E-06 | |
|
| Hypothetical protein, conserved | Nucleoplasm | Nuclear+ | 13 | 1.28E-11 | |
|
| Temperature-dependent protein affecting M2 dsRNA replication̂ | Cytoplasm (weak) | 20592024 | Gene Expression# | 12 | 6.62E-16 |
|
| Small nuclear ribonucleoprotein SmD3, putative, SmD3 | Nucleoplasm | 10900267 | Gene Expression# | 11 | 4.78E-11 |
|
| Haemolysin-III related̂ | Cytoplasm | Pathogenesis | 10 | 8.31E-27 | |
|
| Heat shock 70-kDa protein, HSP70̂ | Cell tip (anterior), cytoplasm, flagellar cytoplasm | 30506377 | Flagellar * | 10 | 3.15E-26 |
|
| Divalent cation transporter̂ | Cytoplasm (reticulated) | Trafficking | 9 | 7.16E-54 | |
|
| Splicing factor 3B subunit 10, SF3b10̂ | Nucleoplasm | Gene Expression | 9 | 1.80E-49 | |
|
| Archaic translocase of outer membrane 14-kDa subunit, ATOM14 | Mitochondrion | 22267727 | Mitochondrial# | 8 | 9.21E-11 |
|
| Hypothetical protein, conserved | Basal body | Flagellar + | 8 | 5.86E-10 | |
|
| Leucine-rich repeat-containing protein | Hook complex | Flagellar + | 8 | 1.61E-10 | |
|
| Small GTP-binding protein RAB6̂ | Golgi apparatus | Golgi+ | 8 | 7.70E-22 | |
|
| Hypothetical protein, conserved | Plasma membrane (posterior) | 18242729 | Glycosomal# | 8 | 2.28E-09 |
|
| Hypothetical protein, conserved | Cytoplasm | 24945722 | Gene Expression# | 7 | 7.02E-19 |
|
| Hypothetical protein, conserved | Mitochondrion | Mitochondrial+ | 7 | 8.48E-21 | |
|
| U5Cwc21 small nuclear ribonucleoprotein | Nucleoplasm | 19429779 | Gene Expression# | 7 | 8.83E-10 |
|
| Hypothetical protein, conserved | Not in the genome version used for tagging | ND | 7 | 9.18E-10 | |
|
| Translation initiation factor IF-2̂ | Cytoplasm | Gene Expression | 6 | 1.12E-11 | |
|
| Conserved protein, unknown function | Cytoplasm (reticulated) | 26784394 | Gene Expression# | 6 | 1.34E-05 |
|
| Hypothetical protein, conserved | Axoneme [50%], cytoplasm | Flagellar + | 6 | 3.88E-10 |
Tryptag descriptions containing the word “putative” have been replaced with ^. Category data sources: +, TrypTag; ○, proteome; *, both; #, published.
FIG 6Melarsoprol resistance following gene overexpression. Induced expression of four hits (red lines), in comparison to those in parental cells (SM, black lines), during melarsoprol treatment with cell viability measured by alamarBlue assay to measure the resulting EC50: (A) Tb927.10.12370. (B) Tb927.11.590. (C) Tb927.7.2780. (D) Tb927.9.15020. (E) Relative EC50s following overexpression of each gene for at least 3 biological replicates. P values were derived from one-way analysis of variance (ANOVA) with Dennett’s multiple-comparison test. *, P < 0.05; **, P < 0.01; ***, P < 0.001.
FIG 7Categorization of GoF Hits. Hits arising from melarsoprol survival screening are shown proximal to bloodstream-form T. brucei cell cartoon. Rectangular boxes indicate the number of hits occurring in each major GoF screen category (see Table 2 for details). Italicized gene names in boxes are shown for genes whose induced expression promoted melarsoprol resistance (Fig. 6). The cell diagram also highlights the flagellum and flagellar pocket with the melarsoprol transporter AQP2 localized as seen in bloodstream form (34, 64). Trypanothione (T[SH]2) biosynthesis and redox pathways are loosely depicted as follows: T(SH)2 biosynthesis is highly simplified showing the rate-limiting enzyme GSH1, which was identified in the melarsoprol GoF screen; T(SH)2 provides reducing equivalents to tryparedoxin (Tpx), which is used to reduce disulfides (not shown), peroxidases (Px), and ribonucleotide reductase (RR) for the reduction of hydroperoxides and generation of dNTPs, respectively. T(SH)2 and Tpx are also utilized in the mitochondrion for redox reactions that include reduction of peroxidases (mPx). Melarsoprol uptake, conversion to melarsen oxide, binding with T(SH)2 to from the stable adduct MelT, and its inhibition of trypanothione reductase (TR), which prevents the conversion of trypanothione disulfide back to T(SH)2, are all indicated in red. Green and red spheres at the flagellar pocket indicate import and export pathways, respectively.