| Literature DB >> 32649182 |
Zachary Wu1, Kevin K Yang1, Michael J Liszka2, Alycia Lee3, Alina Batzilla2, David Wernick2, David P Weiner2, Frances H Arnold1.
Abstract
Short (15-30 residue) chains of amino acids at the amino termini of expressed proteins known as signal peptides (SPs) specify secretion in living cells. We trained an attention-based neural network, the Transformer model, on data from all available organisms in Swiss-Prot to generate SP sequences. Experimental testing demonstrates that the model-generated SPs are functional: when appended to enzymes expressed in an industrial Bacillus subtilis strain, the SPs lead to secreted activity that is competitive with industrially used SPs. Additionally, the model-generated SPs are diverse in sequence, sharing as little as 58% sequence identity to the closest known native signal peptide and 73% ± 9% on average.Entities:
Keywords: Bacillus subtilis; machine learning; protein design; secretion; signal peptides
Mesh:
Substances:
Year: 2020 PMID: 32649182 DOI: 10.1021/acssynbio.0c00219
Source DB: PubMed Journal: ACS Synth Biol ISSN: 2161-5063 Impact factor: 5.110