The genetic incorporation of
non-canonical amino acids (NCAAs) into recombinant proteins has provided
researchers with powerful tools for the study of protein structure
and function. The ability to genetically incorporate NCAAs with bioorthogonal
functional groups, coupled with click chemistry, allows for the introduction
of site-selective protein modifications that have been used for fluorescence
labeling[1−8]and photo-cross-linking,[9−13] as well as for the study of posttranslational modifications that
are traditionally difficult or impossible to obtain in bacterial expression
systems.[14−18] Other analytic probes that have been site-selectively introduced
using the genetic incorporation approach include infrared, Raman,
NMR, and EPR spectroscopic probes and heavy atoms for X-ray crystallographic
studies.[19−21] Generally, NCAAs are introduced into a protein through
the suppression of an in frame nonsense or quadruplet codon by a suppressor
tRNA coupled with an evolved aminoacyl-tRNA synthetase (aaRS). One
widely used synthetase for this purpose is the tyrosyl-tRNA synthetase
(MjTyrRS) derived from Methanocaldococcus
jannaschii.[22] Indeed, directed
evolution of MjTyrRS, coupled with a suppressor tRNACUATyr, for the suppression
of the TAG (amber) codon has been highly successful at the incorporation
of a number of NCAAs into proteins in Escherichia coli. Despite the success of engineering the MjTyrRS-tRNACUATyr pair, its
weakness lies in that it cannot be used to incorporate NCAAs into
eukaryotic organisms owing to the fact that tRNACUATyr contains a recognition element
that allows for the aminoacylation of tRNACUATyr with canonical amino acids by the
endogenous aaRSs in eukaryotic organisms.[23,24] Alternatives to the MjTyrRS-tRNACUATyr pair include the EcTyrRS-tRNACUATyr pair, which was derived from E. coli tyrosyl-tRNA
synthetase-tRNATyr pair, the EcLeuRS-tRNACUALeu pair derived
from E. coli leucyl-tRNA synthetase-tRNALeu pair, and the pyrrolysyl-tRNA synthetase (PylRS)-tRNACUAPyl pair, which
naturally exists in certain methanogenic archaea and some bacteria.
Both EcTyrRS-tRNACUATyr and EcLeuRS-tRNACUALeu pairs can
be applied in eukaryotic cells but not in bacterial systems.[11,25,26] However, tRNACUAPyl is a naturally occurring
amber suppressor tRNA that is specifically recognized by PylRS but
does not cross-interact with endogenous aaRSs in both bacterial and
eukaryotic cells.[27−31]To date,
the PylRS-tRNACUAPyl pair has been successfully used for the
genetic incorporation of NCAAs into recombinant proteins expressed
in bacteria and a variety of eukaryotes including mammalian cells
and multicellular organisms.[32,33] While the ability of
the PylRS-tRNACUAPyl pair to be used in both bacterial and eukaryotic cells is advantageous
in that it allows for the same NCAA to be used in different model
systems, the method still has limitations. The incorporation of a
new NCAA typically requires selection of large libraries of aaRS with
randomized active site mutations. Schultz and co-workers have developed
a powerful method for the directed evolution of aaRSs involving both
positive and negative selections based on the ability to suppress
a nonsense mutation in the presence of the desired NCAA.[34] However, this process still requires multiple
rounds of selection for each new substrate. Therefore, a single aaRS
with broad substrate promiscuity is ideal to facilitate rapid progress
and widespread use of the genetic NCAA incorporation technique. We
have previously reported the ability of a single rationally designed
PylRS mutant (PylRS(N346A/C348A)), coupled with tRNACUAPyl, to genetically incorporate
a wide variety of phenylalanine derivatives with large para and small meta substitutions at amber mutation
sites in living cells.[35−37] More recent studies indicate that this mutant is
also able to recognize more than 10 phenylalanine derivatives with
large meta substitutions (Tuley and Liu, unpublished
data). PylRS(N346A/C348A) displays high activities toward these NCAAs
while remaining relatively orthogonal toward canonical amino acids.
Here we reveal that the same enzyme is also able to mediate the genetic
incorporation of seven ortho-substituted phenylalanine
derivatives, which brings the total substrate availability of this
enzyme close to 40 and leads to the conclusion that PylRS(N346A/C348A)
is able to recognize phenylalanine derivatives with a substitution
at any side-chain aromatic position as a substrate. To our knowledge,
PylRS(N346A/C348A) is the only aaRS with such a high substrate promiscuity.
We also demonstrate, for the first time, the ability of PylRS(N346A/C348A)
to facilitate the genetic incorporation of phenylalanine derivatives
in mammalian cells. Aside from o-nitro-phenylalanine,[38] which has been genetically encoded in E. coli using an evolved MjTyrRS-tRNACUATyr pair, this
constitutes the first report of the genetic incorporation for all
of these ortho-substituted phenylalanine derivatives.[39] One of the ortho-substituted
phenylalanine derivatives, o-cyano-phenylalanine
(oCNF) is a small, environmentally sensitive, fluorescent
probe that can be used as a sensor to probe the local environment
of proteins, and we applied this to study protein folding/unfolding.Wang and co-workers previously determined the crystal structure
of a PylRS mutant (OmeRS) in complex with O-methyl-tyrosine.[40] Figure 1 presents the
active site structure of this enzyme–substrate complex superimposed
on the wild-type PylRS with pyrrolysyl-AMP bound. The two overlaid
structures clearly show that the N346 side chain is adjacent to the ortho position of the O-methyl-tyrosine
side-chain phenyl group. Removing the N346 side chain would potentially
leave space for adding an additional substitution to this position.
Since PylRS(N346A/C348A) is expected to bind to a phenylalanine derivative
in a similar fashion as OmeRS, we suspect that PylRS(N346A/C348A)
may well recognize an ortho-substituted phenylalanine
derivative and, together with tRNACUAPyl, mediate its incorporation at an amber mutation
site. We exploited an E. coli cell system harboring
plasmids pEVOL-pylT-PylRS(N346A/C348A) and pET-pylT-sfGFPS2TAG to
assess the ability of the PylRS(N346A/C348A)-tRNACUAPyl pair to incorporate ortho-substituted phenylalanine derivatives in response
to an amber mutation. The plasmid pEVOL-pylT-PylRS(N346A/C348A) contains
genes coding for PylRS(N346A/C348A) and tRNACUAPyl, while pET-pylT-sfGFPS2TAG contains
genes coding for tRNACUAPyl and superfolder green fluorescent protein (sfGFP) with
an amber mutation at its S2 position and a 6 × His tag at the
C-terminus.[35] Growing cells in GMML medium
(a minimal medium supplemented with 1% glycerol and 0.3 mM l-leucine) led to translation termination at the amber mutation site
of sfGFP; therefore no full-length sfGFP was expressed. However, supplementing
the medium with 2 mM of one of seven ortho-substituted
phenylalanine derivatives (1–7, shown
in Figure 2A) promoted overexpression of full-length
sfGFP (Figure 2B). Incorporation of NCAAs was
confirmed via electrospray ionization mass spectrometry (ESI-MS) (Figure 2C). For each of the seven NCAAs, the detected molecular
weight agreed well with the theoretical value (Table 1). Except for 6, this is the first report of
the genetic incorporation of these NCAAs in living cells, and for
all of the NCAAs this is the first report of their incorporation using
an aaRS derive from PylRS.
Figure 1
Superimposed structures of the OmeRS complex
with O-methyltyrosine and the PylRS complex with
pyrrolysyl-adenosyl monophosphate
(Pyl-AMP). The structures are based on PDB entries 2Q7H and 3QTC. The PylRS complex
with Pyl-AMP is shown in orange for the protein carbon atoms and pink
for the Pyl-AMP carbon atoms. Four mutated residues in OmeRS and the p-methoxy-phenylalanine ligand are shown in cyan for the
carbon atoms. Letters in parentheses indicate the four mutated residues
in OmeRS.
Figure 2
(A) Structures of 1–7. (B) Site-specific
incorporation of 1–7 into sfGFP at
its S2 site. N/A indicates no NCAA was available in the medium. ND
represents nondetected. (C) Deconvoluted ESI-MS of sfGFP with NCAAs 1–7 incorporated at the S2 position. sfGFP-X in which X is one of 1–7 represents the specific sfGFP variant with X incorporated at its S2 site.
Table 1
ESI-MS Characterization of sfGFP Variants
Incorporated with Different NCAAs
NCAA
calculated mass (Da)
detected mass (Da)a
1
27764
27761
2
27808
27804
3
27855
27855
4
27743
27741
5
27759
27759
6
27774
27773
7
27754
27751
With an error of ±1 Da.
Superimposed structures of the OmeRS complex
with O-methyltyrosine and the PylRS complex with
pyrrolysyl-adenosyl monophosphate
(Pyl-AMP). The structures are based on PDB entries 2Q7H and 3QTC. The PylRS complex
with Pyl-AMP is shown in orange for the protein carbon atoms and pink
for the Pyl-AMPcarbon atoms. Four mutated residues in OmeRS and the p-methoxy-phenylalanine ligand are shown in cyan for the
carbon atoms. Letters in parentheses indicate the four mutated residues
in OmeRS.(A) Structures of 1–7. (B) Site-specific
incorporation of 1–7 into sfGFP at
its S2 site. N/A indicates no NCAA was available in the medium. ND
represents nondetected. (C) Deconvoluted ESI-MS of sfGFP with NCAAs 1–7 incorporated at the S2 position. sfGFP-X in which X is one of 1–7 represents the specific sfGFP variant with X incorporated at its S2 site.With an error of ±1 Da.Along with 1–7, two additional
amino acids were tested, namely, o-trifluoromethyl-phenylalanine
(8) and o-fluoro-phenylalanine (9) (Supplementary Figure 1). In
GMML supplemented with 2 mM NCAA 8, we obtained minimal
expression of full-length sfGFP. The mass spectra of the purified
protein suggested the incorporation of both 8 and phenylalanine
at the S2 site (Supplementary Figure 2),
indicating the recognition of 8 by PylRS(N346A/C348A)
is not significantly better than that of phenylalanine itself. Minimal
incorporation of phenylalanine at an amber codon is a typical observation
for evolved MjTyrRS-tRNACUATyr and PylRS-tRNACUAPyl pairs.[41] In a previous study, we found that while PylRS(N346A/C348A)
was able to incorporate m-fluoro-phenylalanine at
an amber codon, ESI-MS analysis of the expressed protein indicated
that this NCAA was also incorporated at phenylalanine codons.[36] Given the size similarity of fluorine and a
proton, we attributed the miss-incorporation of m-fluoro-phenylalanine at phenylalanine sites to an inability of the
endogenous phenylalanyl-tRNA synthetase to distinguish between the
two substrates. Similarly, when testing the incorporation of 9 at the S2 site of sfGFP, we found that the miss-incorporation
rate was high. Indeed, major ESI-MS peaks of the purified protein
suggest the incorporation of 9 into sfGFP at multiple
sites (Supplementary Figure 3). Taken together,
these observations suggest that while the PylRS(N346A/C348A)-tRNACUAPyl pair is able
to direct the genetic incorporation of a monofluorinated phenylalanine
at amber codons, E. coli cells with a more stringent
phenylalanyl-tRNA synthetase that excludes monofluorinated phenylalanine
as a substrate will need to be engineered for cleaner incorporation.In one of our previous publications, we showed that PylRS(N346A/C348A)
recognizes phenylalanine and mediates its incorporation at an amber
mutation site in coordination with tRNACUAPyl.[35] However,
this background incorporation was suppressed when a NCAA that serves
as a better substrate of PylRS(N346A/C348A) was provided. For instance,
the PylRS(N346A/C348A)-tRNACUAPyl pair induced significant background phenylalanine
incorporation at an amber mutation at F27 of sfGFP in LB medium. However,
providing m-trifluoromethyl-phenylalanine obviated
this background incorporation, and the expressed sfGFP had quantitative
occupancy of m-trifluoromethyl-phenylalanine at F27.[36] We used the same system to test the incorporation
efficiencies of seven ortho-substituted phenylalanine
derivatives (2 mM) in LB. Our data show that, except for 5, all NCAAs led to sfGFP expression significantly better (1.8–3.9-fold)
than the background (Supplementary Figure 4). Further, although the sfGFP expression yield in the presence of 5 was comparable to the background, the major peak in the
ESI-MS spectrum of purified sfGFP indicated occupancy of 5 at the designated amber mutation site (Supplementary
Figure 5), indicating 5 was efficiently incorporated.
Presumably, further optimization studies that modify expression conditions,
such as increasing the concentration of 5, may lead to
a higher incorporation efficiency. These data demonstrate that the
practical use of the PylRS(N346A/C348A)-tRNACUAPyl pair in LB for the genetic incorporation
of NCAAs is dependent on the NCAA identities and their concentrations.Next we asked whether the PylRS(N346A/C348A)-tRNACUAPyl pair could be used to incorporate
NCAAs into proteins in mammalian cells. Given that rich media containing
high concentrations of canonical amino acids is required for mammalian
cell growth, PylRS(N346A/C348A) must be more reactive toward a NCAA
than phenylalanine in the medium for its practical use in mammalian
cells. To test the ability of the PylRS(N346A/C348A)-tRNACUAPyl pair to facilitate
the selective incorporation of NCAAs into proteins in mammalian cells,
HEK293T cells were transfected with plasmids containing genes encoding
a C-terminally GFP-fused epidermal growth factor receptor (EGFR-GFP)
with an amber mutation at the N128 position as well as PylRS(N346A/C348A)
and tRNACUAPyl.[42] Cells in cultures supplemented with
5 mM of NCAAs 1 or 3–7 displayed much greater GFP fluorescence intensities compared to
cells in the absence of a NCAA, suggesting enhanced suppression of
the amber codon in the presence of NCAAs (Figure 3). Supplementing cultures with 5 mM 2 led to
cell death. This is likely due to the insolubility of 2 in the growth medium. As expected, a low level of background protein
expression was observed in the control culture, which was probably
attributable to background phenylalanine incorporation at the amber
mutation.
Figure 3
Expression of EGFP-N128→X in HEK293T cells
cotransfected with the plasmid containing genes encoding PylRS(N346A/C348A)
and tRNACUAPyl. Cell cultures were supplemented with 5 mM of 1–7 or no NCAA (NA). Supplementing the cultures with 5 mM of
compound 2 resulted in nearly 100% cell mortality. The
relative ratio of expressed EGFP proteins compared to control based
on the fluorescent intensity is 2.6, 4.3, 3.2, 4.1, 2.2, and 4.9 for
NCAAs 1, 2, 4, 5, 6, and 7, respectively.
Expression of EGFP-N128→X in HEK293T cells
cotransfected with the plasmid containing genes encoding PylRS(N346A/C348A)
and tRNACUAPyl. Cell cultures were supplemented with 5 mM of 1–7 or no NCAA (NA). Supplementing the cultures with 5 mM of
compound 2 resulted in nearly 100% cell mortality. The
relative ratio of expressed EGFP proteins compared to control based
on the fluorescent intensity is 2.6, 4.3, 3.2, 4.1, 2.2, and 4.9 for
NCAAs 1, 2, 4, 5, 6, and 7, respectively.Of the seven ortho-substituted
phenylalanine substrates
available to PylRS(N346A/C348A), 3 has the potential
to be used for protein labeling via the palladium-catalyzed Suzuki–Miyaura
cross-coupling reaction,[37,43] and 6 can
be used to cleave the polypeptide backbone upon irradiation at 365
nm, as a fluorescent distance probe due to its ability to quench the
fluorescence of tryptophan, and as an IR probe due to the strong vibrational
absorption of the nitro group.[38,44,45]7 could also be used as an IR probe due to the strong
vibrational absorption of the nitrile group[19] and as a sensor of protein local environment, protein folding/dynamics,
and protein–ligand interactions due to the environmentally
sensitive fluorescent features of the benzonitrile moiety. Indeed,
an isomeric NCAA of oCNF, p-cyano-phenylalanine
(pCNF), has been widely used as a reporter of protein
local environment to study folding/dynamic analysis and protein–peptide
interactions.[46,47] Additionally, pCNF can be selectively excited in the presence of both tryptophan
and tyrosine and has considerable spectral overlap with both fluorophores;
therefore, its ability to form a Förster resonance energy transfer
(FRET) with these residues was also recently explored.[48,49]oCNF could potentially serve as an alternative
to pCNF and, because of its naturally occurring orthogonal
tRNACUAPyl,
could be used in eukaryotic systems. Since the intermediate polarity
of the nitrile group is well tolerated in both polar and nonpolar
environments and the nitrile group is relatively small, replacing
phenylalanine with oCNF will minimally disturb the
protein’s structure and folding pattern.[49] Our preliminary measurements revealed that 7 has absorption and fluorescence spectra similar to those of pCNF and displays a nearly 10-fold increase in fluorescence
when going from a hydrophobic to hydrophilic solvent (Supplementary Figure 6), suggesting that 7 could be used as a probe of protein local environment.To test whether 7 could be used as a fluorescent probe
of protein local environment, two sfGFP proteins were expressed: sfGFP-N135→7 and sfGFP-F27→7 with 7 incorporated
at N135 and F27, respectively. N135 is solvent exposed and 7 is expected to be highly fluorescent at this site; however, F27
is buried within the hydrophobic interior of the protein, and therefore 7 incorporated at this position should display reduced fluorescence.
Previous measurements indicate that mutations at these two positions
do not significantly alter the folding of sfGFP.[36] The pEVOL-pylT-PylRS(N346A/C348A) plasmid was cotransformed
into E. coli Top10 cells along with pBAD-sfGFP that
contained sfGFP with an amber mutation at either the 27th or 135th
position. Cultures were grown in a synthetic autoinduction medium[50] supplemented with 2 mM NCAA 7 to
afford full-length sfGFP-N135→7 and sfGFP-F27→7 in yields of 140 and 220 mg/L, respectively. No protein
was detected in the absence of 7 (Supplementary Figure 7). Figure 4 shows
the fluorescence spectra of the folded proteins in phosphate buffered
saline at pH 7.5 when excited at 240 nm. In the folded state, sfGFP-F27→7 displays very little fluorescence, consistent with the positioning
of 7 in the hydrophobic interior of the protein. However,
sfGFP-N135→7 displays nearly an 8-fold increase
in fluorescence intensity consistent with 7 being solvent
exposed in this protein. As a control, we also measured the fluorescence
of the wild-type protein under the same conditions. The wild-type
protein displayed fluorescence intensity similar to that of sfGFP-F27→7 with a slightly red-shifted spectrum. These results indicate
that oCNF can be selectively excited in the presence
of both tyrosine and tryptophan, as sfGFP contains one tryptophan
residue and nine tyrosine residues, and demonstrate that 7 serves as an effective indicator of local environment within a protein.
Figure 4
Fluorescence
spectra of sfGFP-F27→7 (red line),
sfGFP-N135→7 (blue line), and wild-type sfGFP
(black line) excited at 240 nm. Proteins used were 20 μM in
phosphate buffered saline at pH 7.5. The difference in fluorescence
is due to the increased solvent exposure of 7 in sfGFP-N135→7. The structure of sfGFP is presented in the left side of
the figure with F27 and N135 labeled.
Fluorescence
spectra of sfGFP-F27→7 (red line),
sfGFP-N135→7 (blue line), and wild-type sfGFP
(black line) excited at 240 nm. Proteins used were 20 μM in
phosphate buffered saline at pH 7.5. The difference in fluorescence
is due to the increased solvent exposure of 7 in sfGFP-N135→7. The structure of sfGFP is presented in the left side of
the figure with F27 and N135 labeled.Next we demonstrated the use of 7 for the study
of
protein folding using the chromodomain of M-phase phosphoprotein 8
(MPP8). MPP8 has been shown to bind specifically to methylated lysine
9 of histone 3 and overexpression of MPP8 is linked to increase tumor
metastasis.[51] The crystal structure of
MPP8 revealed that F59 is located in the hydrophobic cage that forms
a binding pocket for the methylated lysine (Figure 5A).[52] Therefore, in the folded
protein, this residue is not exposed to the solvent and an increase
in the fluorescence of 7 would be expected as the protein
is denatured. The chromodomain of MPP8, containing an amber mutation
at F59, was expressed as an N-terminal fusion protein to sfGFP in E. coli Top10 cells that also expressed the PylRS(N346/C348A)-tRNACUAPyl pair. Fusion
to sfGFP allowed for high levels of the soluble MPP8 to be expressed.[53] The cells were grown in the synthetic autoinduction
medium supplemented with 2 mM NCAA 7 to afford the full
length fusion protein (Supplementary Figure 8). Figure 5B displays the fluorescence spectra
of MPP8 in the presence of varying concentrations of urea. At low
concentrations of the denaturant, a peak corresponding to the fluorescence
of 7 was not detected, consistent with this residue being
buried in the hydrophobic cage. As the concentration of urea is increased
from 0 to 8 molar, an increase in 7 fluorescence at 297
nm was observed accompanied by a decrease in the tryptophan fluorescence
of the protein, which is consistent with both 7 and tryptophan
becoming solvent exposed. No increase in fluorescence at 297 nm was
observed for the wild-type protein (Supplementary
Figure 9). Figure 5C shows the two-state
unfolding curve for MPP8-F59→7 obtained by exciting
the protein at 240 nm and measuring the resulting fluorescence at
297 nm.
Figure 5
(A) The structure of MPP8 (based on the PDB entry 3QO2). F59 is labeled.
(B) Fluorescence spectra of the MPP8-F59→7 at
varied concentrations of urea. (C) Fluorescence intensity of MPP8-F59→7 measured at 297 nm as a function of the urea concentration.
Proteins used were 2.5 μM in phosphate buffered saline at pH
7.5. The excitation wavelength was 240 nm. A seven point, second order
Savitsky–Golay filter was applied to Figure 4B to refine curves.
(A) The structure of MPP8 (based on the PDB entry 3QO2). F59 is labeled.
(B) Fluorescence spectra of the MPP8-F59→7 at
varied concentrations of urea. (C) Fluorescence intensity of MPP8-F59→7 measured at 297 nm as a function of the urea concentration.
Proteins used were 2.5 μM in phosphate buffered saline at pH
7.5. The excitation wavelength was 240 nm. A seven point, second order
Savitsky–Golay filter was applied to Figure 4B to refine curves.In conclusion, we have demonstrated an expanded ability of
the
PylRS(N346A/C348A) mutant to incorporate phenylalanine derivatives
with substitutions at the ortho position. The enzyme
demonstrates remarkable substrate promiscuity while retaining relative
orthogonality toward canonical amino acids, making it a valuable tool
for the genetic incorporation of NCAAs. Nearly 40 phenylalanine derivatives
have been hitherto genetically encoded at amber codons using the PylRS(N346A/C348A)-tRNACUAPyl pair. In addition,
we have demonstrated the ability of PylRS(N346A/C348A) to facilitate
the genetic incorporation of NCAAs in mammalian cell lines. Finally,
we have demonstrated the ability of oCNF to act as
a sensitive fluorescent probe of protein local environment and protein
folding. oCNF shows significant differences in fluorescence
intensity between hydrophobic and hydrophilic environments, and the
nitrile substitution at the ortho position makes
it an extremely small and noninvasive substitution in protein studies.
We believe this advancement will greatly promote the adoption of the
genetic NCAA incorporation approach in a broader biochemistry research
field.
Authors: Yane-Shih Wang; Bo Wu; Zhiyong Wang; Ying Huang; Wei Wan; William K Russell; Pei-Jing Pai; Yin N Moe; David H Russell; Wenshe R Liu Journal: Mol Biosyst Date: 2010-03-30
Authors: Sherry K Blight; Ross C Larue; Anirban Mahapatra; David G Longstaff; Edward Chang; Gang Zhao; Patrick T Kang; Kari B Green-Church; Michael K Chan; Joseph A Krzycki Journal: Nature Date: 2004-08-25 Impact factor: 49.962
Authors: Jason W Chin; Stephen W Santoro; Andrew B Martin; David S King; Lei Wang; Peter G Schultz Journal: J Am Chem Soc Date: 2002-08-07 Impact factor: 15.419
Authors: Carla Polycarpo; Alexandre Ambrogelly; Amélie Bérubé; SusAnn M Winbush; James A McCloskey; Pamela F Crain; John L Wood; Dieter Söll Journal: Proc Natl Acad Sci U S A Date: 2004-08-16 Impact factor: 11.205
Authors: Jason W Chin; T Ashton Cropp; J Christopher Anderson; Mridul Mukherji; Zhiwen Zhang; Peter G Schultz Journal: Science Date: 2003-08-15 Impact factor: 47.728
Authors: Jason W Chin; Andrew B Martin; David S King; Lei Wang; Peter G Schultz Journal: Proc Natl Acad Sci U S A Date: 2002-08-01 Impact factor: 11.205
Authors: Nidhi Kakkar; Jessica G Perez; Wenshe R Liu; Michael C Jewett; Wilfred A van der Donk Journal: ACS Chem Biol Date: 2018-02-21 Impact factor: 5.100