What Is Protein Engineering?
Protein engineering is a powerful biotechnological process that focuses on creating new enzymes or proteins and improving the functions of existing ones by manipulating their natural macromolecular architecture.1
Each protein contains a unique genetically encoded sequence of amino acids. Protein synthesis occurs through translation and is based on mRNA codons.2 Scientists use recombinant DNA technology to modify codons and develop diverse proteins with potentially enriched activities.3
Genetic engineering technologies that enable cloning of any gene found in nature and DNA chemical synthesis have immensely contributed to the protein engineering field. In addition, technological advancements such as x-ray crystallography and computer modeling help researchers design amino acid sequences that fold into precise 3D structures, synthesizing proteins with specific properties.4
Protein Engineering Methods
Protein engineering encompasses multiple strategies including rational design, directed evolution, semirational design, peptidomimetics, and de novo protein design. Scientists use these strategies to develop novel proteins or optimize existing protein properties that are relevant to medicine and biotechnology.5 Researchers then screen newly developed protein variants to identify those with desirable functions. For this, they have developed efficient screening methods such as fluorescence activated cell sorting (FACS) and phage display technology to examine large libraries of synthetic proteins and enzymes.6
Rational method
Rational design is the classical protein engineering method that involves site directed mutagenesis.7 Scientists perform specific point mutations via insertions or deletions in the coding sequence based on structural and functional knowledge of the target protein. Typically, they mutate coding regions that correspond to a protein’s activity.
A key limitation of the rational method is that researchers must know a protein’s structural, functional, and molecular information. Although the rational protein design approach offers an increased possibility of beneficial alterations, it is not easy to accurately predict the sequence-structure-function relationship, particularly at the single amino acid level.7 However, artificial intelligence (AI) has substantially improved protein structure prediction based on amino acid sequence, which is vital for rational design strategies and newer engineering methods, such as semirational and de novo protein design.
In comparison to other methods such as directed evolution, rational design is less time consuming as it does not require large library screening. Scientists use this strategy to engineer protein-based vaccines, antibodies, and enzymes with high thermal stability and catalytic efficiency to meet industrial demands.8
Directed evolution
In 2018, Frances H. Arnold won the Nobel Prize in Chemistry for the directed evolution of enzymes. The prize was shared with George P. Smith and Sir Gregory P. Winter for the phage display of peptides and antibodies. The directed evolution method is a robust protein engineering technique that generates random mutations in a gene of interest, followed by rapid protein variant selection based on favorable properties for specific applications.7
Scientists commonly use error-prone polymerase chain reaction (EP-PCR) to generate random mutations throughout a gene or gene region.7 This method does not require any prior information regarding the protein’s structure and mechanisms, as it mimics the process of natural evolution. The success of the directed evolution method lies in generating mutant libraries of significant size and diversity.
Semirational protein design
Semirational protein design is a combination of rational and directed evolution methods.9 Scientists consider this strategy more effective because they can use computational or bioinformatic modeling to obtain information on the protein’s function and structure and, therefore, select the most promising protein region to change.
This results in a small but high-quality library. The semirational protein design approach provides researchers with an increased opportunity to select biocatalysts with a wider substrate range, specificity, selectivity, and stability without compromising on their catalytic efficiency.
Peptidomimetics
Peptidomimetics is the design and synthesis of metabolically stable peptide analogs that mimic or block natural enzyme or peptide functions.5 This approach employs a variety of biological techniques including solid phase synthesis of nonpeptide libraries that extend the range of amino acid sequences incorporated into engineered proteins.10 Peptidomimetics also uses combinatorial approaches that employ multiple synthetic biology techniques and result in rapid protein variant generation.
De novo protein design
In 2024, David Baker won the Nobel Prize in Chemistry for computational protein design and the prize was shared with Demis Hassabis, and John Jumper for protein structure prediction.
Scientists use de novo protein design to synthesize proteins with specific structural and functional properties from scratch.11 For example, researchers use this strategy to generate proteins that fold into a particular topology, bind to a specific target, or contain a particular catalytic site. Machine learning models such as denoising diffusion probabilistic models (DDPM) enable photorealistic image generation to visualize protein folding and support de novo protein design.12 Researchers have improved diffusion models by integrating powerful structure prediction methods such as RoseTTAFold (RF) and AlphaFold2. The RF-based diffusion model can design diverse functional proteins from simple molecular specifications.11
Autonomous Protein Engineering Systems
Because the creation of new proteins with improved or novel functions can be slow and labor intensive, scientists have developed autonomous platforms to accelerate the process.13 These platforms computationally estimate mutation effects, particularly when screening by combinatorial techniques is difficult. Furthermore, screening methods such as mass spectrometry (MS), however efficient and specific they are, require time-consuming sample preparation steps. This shortcoming is overcome by recently developed autosamplers that use electrospray ionization (ESI) for fast sample preparation.14
Additionally, robot scientists and self-driving laboratories combine laboratory experiments and automated learning and reasoning to accelerate new biomolecule design. For example, the fully autonomous protein engineering platform Self-driving Autonomous Machines for Protein Landscape Exploration (SAMPLE) is equipped with AI programs that learn protein sequence–function relationships and design new proteins. Subsequently, a fully automated robotic system performs experiments to test the designed proteins and provide feedback.15
Protein Engineering Applications
As industrial enzymes are often sourced from mesophilic organisms, they are typically active in moderate reaction conditions.16 However, an ideal industrial enzyme must withstand harsh conditions such as extreme temperature, pH, and salinity. Scientists use protein engineering techniques to improve the properties of industrially important enzymes such as lipases, esterases, amylases, proteases, xylanases, and cellulases for high specificity, thermostability, and catalytic efficiency.17 There are numerous protein engineering applications, including biocatalysts for food and industry, medicine, and the environment. Additionally, remarkable progress in protein engineering over the past decade has improved therapeutics by enabling researchers to produce antivirals, vaccine antigens, and drug-delivery nanovehicles.18
Table 1: Protein engineering applications19-22
Application Area | Examples of Engineered Enzymes and Proteins | Mutagenesis Approach | Mutant Properties |
Basic protein science | Semirational approach | Stability | |
Detergent industry | Alkaline proteases | Site directed mutagenesis and/or random mutagenesis | High activity at alkaline pH and low temperatures |
Food industry | α-amylase | Site directed mutagenesis | Thermostability |
Medicine | Insulin | Site directed mutagenesis | Fast acting monomeric insulin |
Agriculture | 5-enolpyruvyl-shikimate-3-phosphatesynthase | EP-PCR | Enhanced kinetic properties and confer herbicide tolerance (glyphosate) |
Tissue engineering | Site directed mutagenesis | Enhanced elasticity and self-assembling properties | |
Nanobiotechnology | Site directed mutagenesis | Highly-conductive protein nanowires |
Protein Engineering Challenges
Protein design not only offers multiple opportunities in terms of applications, but also presents challenges due to knowledge gaps around folding mechanisms, which are the physiochemical principles underlying protein stability and interactions with the environment. Computational methods allow scientists to generate 3D protein structures, which help elucidate the process of protein folding; however, it is not easy to manipulate the factors that determine protein conformation for targeted purposes.
Furthermore, it is difficult to accurately predict the protein conformational changes that happen during the process of binding with other molecules.18 This information is vital to determine how designed proteins respond to the environment. Researchers focus on overcoming these challenges by using machine learning tools and computational design methods to generate new proteins with favorable properties.
References
- Li C, et al. Protein engineering for improving and diversifying natural product biosynthesis. Trends Biotechnol. 2020;38(7):729-744.
- Schimmel P, Alexander RW. Protein synthesis. In: Encyclopedia of Physical Science and Technology. Academic Press;2003:219-240.
- Morbioli GG, et al. Recombinant drugs-on-a-chip: The usage of capillary electrophoresis and trends in miniaturized systems – A review. Analytica Chimica Acta. 2016;935:44-57.
- Kuhlman B, Bradley P. Advances in protein structure prediction and design. Nat Rev Mol Cell Biol. 2019;20:681-697.
- Turanli-Yildiz B, et al. Protein engineering methods and applications. InTech. 2012.
- Xiao H, et al. High throughput screening and selection methods for directed enzyme evolution. Ind Eng Chem Res. 2015;54(16):4011-4020.
- Sellés Vidal L, et al. A primer to directed evolution: current methodologies and future directions. RSC Chem Biol. 2023;4(4):271-291.
- Liu R, et al. Advances in protein engineering and its application in synthetic biology. In: New Frontiers and Applications of Synthetic Biology. Academic Press;2022:147-158.
- Chica RA, et al. Semi-rational approaches to engineering enzyme activity: Combining the benefits of directed evolution and rational design.Curr Opin Biotechnol. 2005;16(4):378-384.
- Vagner J, et al. Peptidomimetics, a synthetic tool of drug discovery. Curr Opin Chem Biol. 2008;12(3):292-296.
- Watson JL, et al. De novo design of protein structure and function with RFdiffusion. Nature. 2023;620(7976):1089-1100.
- Guo Z, et al. Diffusion models in bioinformatics and computational biology. Nat Rev Bioeng. 2024;2(2):136-154.
- Setiawan D, et al. Recent advances in automated protein design and its future challenges. Expert Opin Drug Discov. 2018;13(7):587-604.
- Alexovič M, et al. Recent advances in robotic protein sample preparation for clinical analysis and other biomedical applications. Clinica Chimica Acta. 2020;507:104-116.
- Rapp JT, et al. Self-driving laboratories to autonomously navigate the protein fitness landscape.Nat Chem Eng. 2024;1(1):97-107.
- Mesbah NM. Industrial biotechnology based on enzymes from extreme environments. Front Bioeng Biotechnol. 2022;10:870083.
- Rigoldi F, et al. Review: Engineering of thermostable enzymes for industrial applications. APL Bioeng. 2018;2(1):011501.
- Listov D, et al. Opportunities and challenges in design and optimization of protein function.Nat Rev Mol Cell Biol. 2024;25(8):639-653.
- Goh MK, et al. Trends and tips in protein engineering, a review. J Teknol 2021;59(1).
- Yurkova MS, Fedorov AN. GroEL-A versatile chaperone for engineering and a plethora of applications.Biomolecules. 2022;12(5):607.
- Wang Y, et al. Protein-engineered functional materials. Adv Healthc Mater. 2019;8(11):e1801374.
- Shapiro DM, et al. Protein nanowires with tunable functionality and programmable self-assembly using sequence-controlled synthesis. Nat Commun. 2022;13(1):1-10.