scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Synthesis of native proteins by chemical ligation.

01 Jan 2000-Annual Review of Biochemistry (Annual Reviews 4139 El Camino Way, P.O. Box 10139, Palo Alto, CA 94303-0139, USA)-Vol. 69, Iss: 1, pp 923-960
TL;DR: The facile access to novel analogs provided by chemical protein synthesis has led to original insights into the molecular basis of protein function in a number of systems.
Abstract: ▪ Abstract In just a few short years, the chemical ligation of unprotected peptide segments in aqueous solution has established itself as the most practical method for the total synthesis of native proteins. A wide range of proteins has been prepared. These synthetic molecules have led to the elucidation of gene function, to the discovery of novel biology, and to the determination of new three-dimensional protein structures by both NMR and X-ray crystallography. The facile access to novel analogs provided by chemical protein synthesis has led to original insights into the molecular basis of protein function in a number of systems. Chemical protein synthesis has also enabled the systematic development of proteins with enhanced potency and specificity as candidate therapeutic agents.

Summary (7 min read)

INTRODUCTION: Protein Science in the Postgenome Era

  • An important current objective in biomedical research is to understand the molecular basis of the numerous and intricate biological activities of proteins and therefore to be able to predict and control these activities.
  • These attempts to incorporate other amino acids have had very limited successobtaining adequate amounts of pure protein from the cell-free translation systems can be a significant challenge (8) , and many unnatural amino acids are simply not compatible with ribosomal polypeptide synthesis (9) .
  • The use of synthetic chemistry promises the unlimited variation of the covalent structure of a polypeptide chain with the objective of understanding the molecular basis of protein function.

DOMAINS: Building Blocks of the Protein World

  • Because proteins are large molecules, applying chemical synthesis to them is a considerable challenge.
  • Furthermore, the biological functions of proteins originate in the tertiary structure of the protein molecule-that is, in the precise three-dimensional folded structure of the polypeptide chain.
  • Protein domains are defined as autonomous units of folding and, frequently, of function (17, 18) .
  • The challenge confronting the chemist is, first, the total synthesis of folded domains and then the ability to stitch these domains together to build complex protein molecules.

CHEMICAL PROTEIN SYNTHESIS: The State of the Art in 1990

  • Since last reviewed in this journal (19) , total chemical synthesis of native proteins has made a number of important contributions to biomedical research.
  • The existence of this virally encoded aspartyl proteinase had been postulated based on an analysis of viral nucleic acid sequence data, and molecular genetic studies had indicated that its action in processing the gag-pol polyprotein was essential to the viral life cycle (21) .
  • The first preparations of the enzyme of defined molecular composition were produced by chemical synthesis (22) , using a highly optimized version of stepwise solid-phase peptide synthesis (19) .
  • In a strikingly important contribution, total chemical synthesis was also used to prepare large amounts of homogeneous enzyme for the determination of the original crystal structures of the HIV-1 protease molecule .
  • These structural data were made freely available to the research community and formed the foundation for the successful worldwide programs of structure-based drug design (28) that led to the development of the highly effective protease inhibitor class of acquired immune deficiency syndrome therapeutic agents (29) .

SYNTHETIC-PEPTIDE CHEMISTRY: Useful but Bounded

  • At the start of the decade of the 1990s total chemical synthesis, by the standard methods of peptide chemistry of even a small protein molecule remained a daunting task, often requiring large teams and taking years to complete, with no guarantee of success.
  • Classical solution synthetic chemistry involves the preparation of fully protected peptide segments and their subsequent condensation in organic solvents for the convergent synthesis of large polypeptides (36) .
  • This poor solubility made such protected peptide segments difficult to work with, and the low concentrations attainable for reacting segments often led to slow and incomplete reactions (37, 38) .
  • This ingenious chemical synthesis method, the progenitor of all polymersupported organic chemistry, was introduced in 1963 by Merrifield (13) .
  • Such methods have not found widespread use.

CHEMICAL LIGATION OF UNPROTECTED PEPTIDE SEGMENTS

  • As recently as 1991 (48) , the challenge remained: namely, to develop methods that enable the general application of the tools of chemistry to the world of the protein molecule.
  • Based on this premise, in the early 1990s the principle of chemoselective reaction (49) was adapted to enable the use of unprotected peptide segments in chemical protein synthesis (50) .
  • The product polypeptide is obtained directly in final form.
  • A variety of ligation chemistries has been used (Table 1 ), and the chemical ligation of unprotected peptide segments has provided access to a range of protein targets.

NATIVE CHEMICAL LIGATION

  • Only a single reaction product is obtained, even in the presence of additional Cys residues in either segment.
  • Simply mixing together two peptide segments that contain correctly designed, mutually reactive functionalities led to the formation of a single polypeptide product containing a native peptide bond at the ligation site.
  • This highly chemoselective reaction is performed in aqueous solution at neutral pH under denaturing conditions.
  • Detailed studies of mechanistic aspects of the native chemical ligation reaction have been published (63, 65) .

Conformationally Assisted Ligation

  • In some cases, folding conditions can be used to accelerate the rate of native chemical ligation (76) .
  • In these cases, the revealed N and C terminals of the peptide fragments are located in close proximity at the site of chain scission.
  • This greatly increases the collision frequency, and a weakly activated C-terminal group such as a thioester can be used to religate the fragments.
  • Total synthesis of proteins using this approach has been demonstrated with the chymotrypsin inhibitor CI2 .
  • When two synthetic peptide segments spanning the CI2 molecule, one incorporating a C-terminal thioester and the other an N-terminal cysteine, are mixed together under folding conditions, conformationally-assisted ligation proceeds in <2 min, compared with several hours for chemical ligation under denaturing conditions (76) .

SCOPE OF NATIVE CHEMICAL LIGATION FOR THE SYNTHESIS OF PROTEINS

  • The first applications of native chemical ligation were to small, Cys-rich proteins such as disulfide-cross-linked secretory proteins or the zinc-finger proteins.
  • In all, >300 biologically active proteins from >20 different families have been successfully prepared by total chemical synthesis with this method.
  • These are still early results in what will surely be more widespread application of the method, but they demonstrate routine synthetic access to single-domain proteins and suggest that native chemical ligation will provide the basis of a general synthetic access to the world of proteins.

FOLDING SYNTHETIC PROTEINS

  • The activity of a protein molecule originates in the precise tertiary structure of the folded polypeptide chain.
  • (Top) Ribbon structure of the crystalline dimer.
  • The successful syntheses of such proteins suggests that this ability to accurately fold synthetic polypeptide chains may hold true both for single domains and for more complex proteins.
  • Control of the folding process can be particularly important in the production of proteins that are toxic to the cell, such as proteolytic enzymes.
  • This control over enzymatic activity was one of the key features of the success of chemical protein synthesis in the early work on the HIV-1 protease (20) .

Noncoded Amino Acids

  • Demonstrating the power of the chemical protein synthesis method, large amounts of each protein analog were made, purified, and fully characterized.
  • Noncoded amino acids are frequently found in native proteins in vivo.
  • One common modification of this type is γ -carboxy-glutamic acid (Gla), found for example in the eponymous Gla domains in plasma proteins.
  • This multidomain protein consists of an N-terminal Gla domain that contains 11 Gla residues, which is followed by a thrombin-sensitive region, three epidermal growth factor domains, and a sex hormone-binding globulinlike region.
  • Folding of this polypeptide chain produced a three-domain protein, microProtein S, that displayed anticoagulant cofactor activity.

Precise Covalent Modification

  • The ability to prepare native proteins by total synthesis, using chemical ligation of unprotected peptide segments, provides a convenient and general route to sitespecific modification of the protein molecule.
  • The full range of synthetic peptide and peptidomimetic chemistry (86) is at the command of the researcher who wants to make precise and controlled changes in a protein's covalent structure.
  • Such changes are not limited by the genetic code or by the strictures of the ribosomal machinery.
  • An early example of the utility of this approach was the total chemical synthesis of (BTD) HIV-1 protease (87), a protein in which the Gly-Gly sequence found in a β-turn in the native protein (20) was replaced by the sterically constrained bicyclic compound BTD, a rigid mimetic of type II β-turn geometry (88) .
  • The resulting enzyme showed full activity and a significantly enhanced thermostability (87) .

Site-Specific Tagged Proteins

  • Chemical synthesis enables the specific labeling of a protein molecule at unique site(s).
  • Such specific modification is less likely to perturb the structure or activity of the protein than uncontrolled reaction with labeling reagents that stochastically target all amino or other particular functional groups in the protein.
  • Fluorescently tagged proteins are extremely useful tools for biology and drug discovery, and synthesis of native proteins by chemical ligation allows the facile incorporation of fluorescent dye molecules at any desired position in a protein molecule.
  • Dye chelator-labeled proteins have been made for time-resolved fluorescence studies, in which it is possible to largely eliminate background emission by use of suitably "time-gated" detection (C Hunter, G Kochendoerfer, 89a).
  • Finally, total chemical synthesis allows the ready introduction of affinity tags, such as biotin, at precise sites in the protein molecule, while preserving biological activity, again something that is straightforward with chemistry.

Backbone Engineering

  • Another intriguing example of site-specific modification of the protein molecule, enabled by chemical ligation, is the covalent modification of the polypeptide backbone itself.
  • A functionally important peptide bond (i.e. backbone amide) in the HIV-1 protease molecule was site-specifically replaced by a thioester moiety in each monomer of the homodimeric protein molecule, to investigate the direct involvement of that specific peptide bond in the mechanism of action of the enzyme, as suggested by the X-ray crystallographic data (20, 56) .
  • The results of these studies showed that the two flap regions of the homodimeric native HIV-1 protease molecule work analogously to the single flap moiety in the two-domain, single-polypeptide chain, cell-encoded aspartyl proteinases (90) .
  • More recently, an engineered backbone structure was introduced into bovine pancreatic trypsin inhibitor, by replacing one Cys residue involved in forming a disulfide bond with an NGly, to investigate the effects of such a substitution on the folding, activity, structure, and stability of the resulting protein molecule (92) .
  • The side chain of the Cys residue has effectively been moved to the backbone amide N atom.

Protein Medicinal Chemistry

  • Synthetic access enables the systematic application of the principles of medicinal chemistry to the protein molecule itself.
  • An example is the total chemical synthesis of the potent anti-HIV molecule AOP-RANTES (79) .
  • The chemical protein analog NNY-RANTES, which resulted from the first phase of this program, is >30-fold more effective as an anti-HIV compound and has been shown to prevent HIV infection at low nanomolar concentrations in the huPBL-SCID mouse model for acuired immune deficiency syndrome (93) .
  • NNY-RANTES is the most potent known anti-HIV compound.
  • It is believed to work by inhibiting receptor recycling (94) , thus clearing CCR5 from the surface of peripheral blood cells, a mechanism distinct from current clinical therapies for acquired immune deficiency syndrome.

Rapid Access to Functional Gene Products

  • In the past few years, an important new application has emerged for chemical protein synthesis-to enable rapid access to functional wild-type protein molecules directly from gene sequence data .
  • Success of the genome projects has resulted in the discovery of >100,000 new proteins (1).
  • The probable roles of many of these predicted proteins can be tentatively assigned by analogy to proteins of known function, using bioinformatics.
  • Nevertheless, the precise biochemical properties of a mature gene product can only be assessed at the level of the protein molecule itself.
  • Synthesis of native proteins by chemical ligation of unprotected peptides can provide access in a matter of days to large (10 + mg) amounts of functional protein molecules of exquisite homogeneity, based directly on gene sequence data.

Structural Biology

  • Facile access to the large (i.e. multiple tens of milligram) amounts of high-purity preparations produced by chemical protein synthesis can be of great utility for studies of protein structure by NMR spectroscopy and by X-ray crystallography.
  • New methods for NMR spectroscopy have considerably enhanced the speed with which the structure of small (i.e. <200-aa-residue) proteins can be determined.
  • Examples include, the chemokine SDF-1α (81), the chemical protein analog AOP-RANTES , and the mirror-image enzyme molecule D-HIV-1 protease .
  • Such high-throughput structure determination will require access to great numbers of proteins in high purity and large amount.
  • Chemical protein synthesis by the methods described here is well suited to provide the proteins needed for genomic structural biology.

Expressed Protein Ligation

  • From its inception, the native chemical ligation method was also envisioned for use with peptides that are produced by recombinant means (62) .
  • There are now multiple examples of the chemical ligation of recombinant peptides.
  • These recombinant products can be reacted with synthetic peptide-thioesters to generate native polypeptides of hybrid biological and chemical origin.
  • More recently, intein-based protein expression vectors have been adapted to generate polypeptide thioesters by recombinant means for use in native chemical ligation (104, 105) .
  • With the approaches described above, both the peptidethioester and the N-terminal Cys peptide can be of recombinant origin.

Solid-Phase Protein Synthesis

  • The principles of polymer-supported organic synthesis (13, 19, 111) have been applied to the chemical ligation of unprotected peptide segments in aqueous solution [(112); Figure 17 ].
  • In solid-phase chemical ligation, unprotected peptide segments of 35-50 amino acids (i.e. ∼5 kDa each) are used as building blocks to assemble the target polymer-bound polypeptide by consecutive ligation on a water-compatible polymer support.
  • Strategies for segment condensation in both the N-to-C and C-to-N directions have been used successfully for solid-phase protein synthesis (112) and alternative linker chemistries developed (112a).
  • Target molecules have been constructed from as many as eight peptide segments by solid-phase chemical ligation [e.g. the polypeptide of the tissue plasminagen activator catalytic domain; M w 25,000 (W Lu, unpublished data), and the polypeptide chain of the enzyme secretory PLA2 GV has beenassembled in a single day Figure 16 Expressed protein ligation (104) .
  • This process uses intein-mediated (73) preparation of a recombinant peptide-α thioester, which is then reacted with a Cys-peptide segment by native chemical ligation to prepare the desired product.

Membrane Proteins

  • An important aspect of the study of proteins which have been predicted from gene sequence data is the integral membrane class of proteins.
  • Computer-aided analysis of the predicted open reading frames from a number of completely sequenced (13, 19, 111) are applied to solid-phase protein synthesis.
  • After removal of the Cys-protecting group (PG), successive rounds of ligation can be carried out to give the polymer-bound target polypeptide.
  • Yet integral membrane proteins are difficult to express at high levels by recombinant-DNA-based methods and have proven hard to isolate in homogeneous form in chemically defined media (114) .
  • The M2 protein had previously proven refractory to multiple attempts at expression by recombinant-DNA-based methods (W Degrado, personal communication), but was readily obtained by chemical ligation of unprotected synthetic peptides.

Glycoprotein Synthesis

  • Recently, the Bertozzi laboratory (116) reported the first total synthesis of a glycoprotein, using native chemical ligation in conjunction with innovative methods for the synthesis of glycopeptide-α thioesters.
  • One of the most important applications of chemical protein synthesis will be the systematic preparation of glycoforms of gylcosylated proteins as homogeneous molecular species of defined covalent structure, to establish the role of the carbohydrate moiety in the biological function of the glycoprotein.

FUTURE DEVELOPMENTS Ligation Sites

  • In its current form, native ligation chemistry uses a Cys residue at the site of formation of the new peptide bond joining two unprotected peptide segments.
  • The work of Muir and coworkers is illustrative of this expedient but effective approach (66, 104, 110) .
  • Also, biological researchers frequently insert Cys residues into a polypeptide chain to investigate the structurefunction relationships in a protein molecule (119) or as a site for the introduction of a spectroscopic probe, such as an electron spin resonance label (120) .
  • This proven utility of arbitrarily introduced Cys residues provides considerable flexibility in synthetic design for the preparation of functional protein molecules by native chemical ligation at Cys.
  • Additionally, it would be desirable to have the option to use thioester-mediated chemical ligation at residues other than Cys.

Size of Protein Targets

  • To date, it has proved possible to make every protein that has been attempted by the chemical ligation of unprotected peptide segments in aqueous solution, even integral membrane proteins.
  • Some targets are significantly more work than others-especially if there are multiple intermediate ligation products to handle.
  • The recently developed solid-phase protein synthesis method (see above), using polymer-supported chemical ligation (112), provides a very effective means for the ready isolation of these intermediate products and will significantly simplify syntheses requiring ligation of multiple segments.
  • The work of their own and others' laboratories, including the laboratories of Offord (University of Geneva, Switzerland) and Muir (Rockefeller University, New York, NY), has failed to show any inherent size limitations for application of the chemical ligation method, up to several-hundred kilodaltons in the latter case (122) .
  • Folding of chemically synthesized polypeptide chains to form native proteins, in which significant problems might have been anticipated, is usually straightforward for the domain size proteins made to date.

Chemical Synthesis of Peptide Segments

  • Virtually any target protein can be prepared by total chemical synthesis, provided that a suitable set of high-purity peptide-thioester segments is available.
  • Ironically, for many researchers the most challenging aspect of applying the chemical ligation method to proteins is making the peptide segments.
  • To date, the principal constraint on widespread application of the native ligation method has been the lack of methods for the facile chemical synthesis of unprotected peptide-α thioester segments.
  • The need to make large numbers of analogs of thousands of native proteins by chemical ligation, and hence to prepare many tens-of-thousands of peptide segments, provides an unprecedented impetus for the development of efficient methods of peptide synthesis.
  • The authors can look forward with confidence to the development of radically improved methods for the rapid, costeffective preparation of large numbers of unprotected peptide-thioester segments for use in chemical protein synthesis (124) .

SUMMARY AND CONCLUSIONS

  • Total synthesis by the chemical ligation of unprotected peptide segments can now provide general access to native proteins of ≤30 kDa in size.
  • This size range encompasses the structural and functional domains that are the modular building blocks of function in the protein world, from enzymes to receptors, from signal transduction adaptor molecules to large multisubunit protein assemblies.
  • This will enable the dissection at the level of the protein molecule of important biochemistry, such as the intracellular signal transduction pathways.
  • The stage is now set for the application of the tools of chemistry to the entire universe of proteins.
  • "Where nucleic acids are the codes, proteins are the substance of life" (125) .

Did you find this useful? Give us your feedback

Figures (20)

Content maybe subject to copyright    Report

P1: FDC/FGE P2: FDC
July 12, 2000 11:26 AR102 CHAP29
Annu. Rev. Biochem. 2000. 69:923–60
Copyright
c
2000 by Annual Reviews. All rights reserved
SYNTHESIS OF NATIVE PROTEINS
BY
CHEMICAL LIGATION
Philip E. Dawson
1
and Stephen B. H. Kent
2
1
The Scripps Research Institute, La Jolla, California 92037; e-mail: dawson@scripps.edu;
2
Gryphon Sciences, South San Francisco, California 94080; e-mail: skent@gryphonsci.
com
Key Words chemical protein synthesis, thioester, protein, peptide, solid phase
synthesis, polymer-supported synthesis, protein engineering
Abstract In just a few short years, the chemical ligation of unprotected peptide
segments in aqueous solution has established itself as the most practical method for the
total synthesis of native proteins. A wide range of proteins has been prepared. These
synthetic molecules have led to the elucidation of gene function, to the discovery of
novel biology, and to the determination of new three-dimensional protein structures by
both NMR and X-ray crystallography. The facile access to novel analogs provided by
chemicalproteinsynthesishasledtooriginalinsightsintothemolecularbasisofprotein
function in a number of systems. Chemical protein synthesis has also enabled the
systematic developmentof proteins with enhanced potency and specificity as candidate
therapeutic agents.
CONTENTS
INTRODUCTION: Protein Science in the Postgenome Era ................... 924
DOMAINS: Building Blocks of the Protein World
......................... 925
CHEMICAL PROTEIN SYNTHESIS: The State of the Art in 1990
.............926
SYNTHETIC-PEPTIDE CHEMISTRY: Useful but Bounded
.................926
CHEMICAL LIGATION OF UNPROTECTED PEPTIDE SEGMENTS
.........929
NATIVE CHEMICAL LIGATION
.................................... 933
BIOCHEMICAL PEPTIDE LIGATION
................................ 935
Protein Splicing
................................................ 935
Conformationally Assisted Ligation
.................................. 938
SCOPE OF NATIVE CHEMICAL LIGATION FOR THE SYNTHESIS
OF PROTEINS
................................................. 939
At the time of the invitation, as now, Stephen Kent is President and Chief Scientist at
Gryphon Sciences. Gryphon Sciences is focused on the development and sale of enhanced
protein therapeutics using chemical protein synthesis. The core technology of the company
is largely the subject matter of the chapter we have submitted.
0066-4154/00/0707-0923/$14.00
923

P1: FDC/FGE P2: FDC
July 12, 2000 11:26 AR102 CHAP29
924 DAWSON
¥
KENT
FOLDING SYNTHETIC PROTEINS ..................................940
CASE STUDIES IN THE APPLICATION OF CHEMICAL
PROTEIN SYNTHESIS
........................................... 944
Noncoded Amino Acids
.......................................... 944
Precise Covalent Modification
...................................... 945
Site-Specific Tagged Proteins
...................................... 945
Backbone Engineering
........................................... 946
Protein Medicinal Chemistry
....................................... 946
Rapid Access to Functional Gene Products
............................. 947
Structural Biology
.............................................. 947
CURRENT DEVELOPMENTS
...................................... 949
Expressed Protein Ligation
........................................ 949
Solid-Phase Protein Synthesis
...................................... 950
Membrane Proteins
............................................. 951
Glycoprotein Synthesis
........................................... 953
FUTURE DEVELOPMENTS
........................................ 954
Ligation Sites
.................................................. 954
Size of Protein Targets
........................................... 955
Chemical Synthesis of Peptide Segments
.............................. 956
SUMMARY AND CONCLUSIONS
................................... 956
INTRODUCTION: Protein Science in the Postgenome Era
An important current objective in biomedical research is to understand the molecu-
larbasis ofthe numerousand intricatebiological activitiesof proteins and therefore
to be able to predict and control these activities. The importance of this goal is
dramatically increased today because of the explosive success of the genome-
sequencing projects, which have revealed hundreds of thousands of new proteins,
but only as predicted sequence data (1). For the biologist, elucidation of the bi-
ological function of a predicted protein molecule is thus a challenge of great
significance. In the words of Freeman Dyson, “[In the post-genome era], proteins
will emerge as the big problem and the big opportunity. When this revolution oc-
curs, it will have a more profound effect than the Human Genome Project on the
future of science and medicine” (2).
For the past 20 years, most studies of the molecular basis of protein action have
been carried out by recombinant DNA-based expression of proteins in genetically
engineered cells (3). From its introduction, this powerful method revolutionized
the study of proteins by enabling the production of large amounts of proteins of
defined molecular composition and by allowing the systematic variation of the
amino acid sequence of proteins (4). Expression of proteins in engineered cells
is now a mature technology, and its scope and limitations are well understood:
(a) Small proteins (i.e. <30 kDa) are easier to express than large, multidomain
proteins; (b) folding of large-protein molecules can also be a challenge; (c) product

P1: FDC/FGE P2: FDC
July 12, 2000 11:26 AR102 CHAP29
CHEMICAL PROTEIN SYNTHESIS 925
heterogeneity is frequently a problem, caused by uncontrolled processing of the
nascent polypeptide in the cell; and (d) the overexpression of proteins that are toxic
to the cell, such as proteases, can be difficult (5).
Additionally, because the cell is used as a protein factory, such molecular bi-
ology studies are inherently limited to the 20 genetically encoded amino acids.
Efforts have been made to use cell-free synthesis to expand the repertoire of ri-
bosomal synthesis to include noncoded amino acids as building blocks (6, 7).
These attempts to incorporate other amino acids have had very limited success—
obtaining adequate amounts of pure protein from the cell-free translation systems
can be a significant challenge (8), and many unnatural amino acids are simply not
compatible with ribosomal polypeptide synthesis (9).
Chemical synthesis is an attractive alternative to biological methods of protein
production. The use of synthetic chemistry promises the unlimited variation of
the covalent structure of a polypeptide chain with the objective of understanding
the molecular basis of protein function. Chemistry also promises the ability to
systematically tune the properties of a protein molecule in a completely general
fashion.
This vision was one of the prime imperatives of organic chemistry in the time of
Emil Fischer at the beginning of the 20th century. In a 1905 letter to Adolf Baeyer,
Fischer wrote, “My entire yearning is directed toward the first synthetic enzyme.
If its preparation falls into my lap with the synthesis of a natural protein material,
I will consider my mission fulfilled” (10). In the decades since then, the challenge
of applying the methods of chemistry to the study of protein action has stimulated
numerous advances in synthetic methods. Historically, these advances included
the use of novel reversible protecting groups (11), novel activation methods for the
formation of covalent bonds (12), and even polymer-supported synthesis (13), all
of which sprang from the drive to apply the science of chemistry to the study of
proteins.
DOMAINS: Building Blocks of the Protein World
Because proteins are large molecules, applying chemical synthesis to them is a
considerable challenge. Furthermore, the biological functions of proteins orig-
inate in the tertiary structure of the protein molecule—that is, in the precise
three-dimensional folded structure of the polypeptide chain. The typical protein
molecule is 30 kDa in size and consists of two 15-kDa domains (14–16);
each domain has a polypeptide chain length of 130 (±40) amino acids (14–16).
Protein domains are defined as autonomous units of folding and, frequently, of
function (17, 18). As such, domains are the building blocks of the protein world.
The challenge confronting the chemist is, first, the total synthesis of folded do-
mains and then the ability to stitch these domains together to build complexprotein
molecules.

P1: FDC/FGE P2: FDC
July 12, 2000 11:26 AR102 CHAP29
926 DAWSON
¥
KENT
CHEMICAL PROTEIN SYNTHESIS: The State
of the Art in 1990
Since last reviewed in this journal (19), total chemical synthesis of native pro-
teins has made a number of important contributions to biomedical research. It is
notable that the Kent laboratory at the California Institute of Technology used total
chemical synthesis based on predicted gene sequence data to carry out pioneering
studies of human immunodeficiency virus 1 (HIV-1) protease enzyme (20). The
existence of this virally encoded aspartyl proteinase had been postulated based on
an analysis of viral nucleic acid sequence data, and molecular genetic studies had
indicated that its action in processing the gag-pol polyprotein was essential to the
viral life cycle (21). For this reason, the HIV-1 protease was, early on, proposed as
an important target for drug development. The first preparations of the enzyme of
defined molecular composition were produced by chemical synthesis (22), using a
highly optimized version of stepwise solid-phase peptide synthesis (19). This work
proved that the active form of the HIV-1 protease was a homodimer consisting of
two identical 99-residue polypeptide chains, and it showed that the chemically
synthesized enzyme accurately processed the putative cleavage sites in the viral
gag-pol translation product (22).
In a strikingly important contribution, total chemical synthesis was also used to
prepare large amounts of homogeneous enzyme for the determination of the orig-
inal crystal structures of the HIV-1 protease molecule (Figure 1). The structure of
the unliganded synthetic enzyme (23) corrected a seriously flawed low-resolution
structure (24) that had been obtained by using protein derived from recombinant
expression in Escherichia coli. Even more significantly, use of chemically synthe-
sized enzyme provided the first high-resolution cocrystal structures of the HIV-1
protease molecule complexed with substrate-derived inhibitors (25–27). These
structural data were made freely available to the research community and formed
the foundation for the successful worldwide programs of structure-based drug de-
sign (28) that led to the development of the highly effective protease inhibitor class
of acquired immune deficiency syndrome therapeutic agents (29).
SYNTHETIC-PEPTIDE CHEMISTRY: Useful but Bounded
Despite successful syntheses of the HIV-1 protease (22) and of a limited number
of other proteins (30–35), at the start of the decade of the 1990s total chem-
ical synthesis, by the standard methods of peptide chemistry of even a small
protein molecule remained a daunting task, often requiring large teams and tak-
ing years to complete, with no guarantee of success. The routine, reproducible
preparation of synthetic polypeptides of defined chemical structure was limited
to products of 50 amino acid residues (19; Figure 2). This size limitation ap-
plied equally to synthesis by solution or by solid-phase methods, but for differing
reasons.

P1: FDC/FGE P2: FDC
July 12, 2000 11:26 AR102 CHAP29
CHEMICAL PROTEIN SYNTHESIS 927
Figure 1 Crystal structures of chemically synthesized HIV-1 protease. These were the
original high-resolution structures (23, 25–27) of this protein and guided the subsequent
drug design programs. The synthetic protein preparation used for X-ray crystallography
contained
L-α-amino-n-butyricacidresiduesinplaceofthetwoCysresiduesineachsubunit.
(Left) Molscript representation of the synthetic enzyme in complex with the substrate-
derived inhibitor MVT101 (25). (Right upper panel) 2Fo-Fc electron density map for the
side chains of the unnatural amino acids used to replace the two Cys residues in each subunit
of the synthetic enzyme (23). (Lower panel) Side chains of the
L-α-amino-n-butyric acid
residues superimposed on the mercury atoms from Cys-containing enzyme (24) that has
been crystallized in the same space group. This shows that the side chains of the unnatural
amino acid have the same conformation as the natural Cys side chains. (Adapted from
References 23 and 25).
Figure 2 Historical progress in the size of synthetically accessible polypeptides.

Citations
More filters
Journal ArticleDOI
TL;DR: The bioorthogonal chemical reactions developed to date are described and how they can be used to study biomolecules.
Abstract: The study of biomolecules in their native environments is a challenging task because of the vast complexity of cellular systems. Technologies developed in the last few years for the selective modification of biological species in living systems have yielded new insights into cellular processes. Key to these new techniques are bioorthogonal chemical reactions, whose components must react rapidly and selectively with each other under physiological conditions in the presence of the plethora of functionality necessary to sustain life. Herein we describe the bioorthogonal chemical reactions developed to date and how they can be used to study biomolecules.

2,537 citations

Journal ArticleDOI
TL;DR: The development of new orthogonal aminoacyl-tRNA synthetase/tRNA pairs has led to the addition of approximately 70 unnatural amino acids to the genetic codes of Escherichia coli, yeast, and mammalian cells, which provide new opportunities to generate proteins with enhanced or novel properties and probes of protein structure and function.
Abstract: The development of new orthogonal aminoacyl-tRNA synthetase/tRNA pairs has led to the addition of approximately 70 unnatural amino acids (UAAs) to the genetic codes of Escherichia coli, yeast, and mammalian cells. These UAAs represent a wide range of structures and functions not found in the canonical 20 amino acids and thus provide new opportunities to generate proteins with enhanced or novel properties and probes of protein structure and function.

1,554 citations


Cites background from "Synthesis of native proteins by che..."

  • ...Similarly, p-azidophenylalanine (7), which absorbs in a clear spectral window at ∼2100 cm−1 and is sensitive to changes in electrostatic environment, was incorporated site specifically at various sites in rhodopsin....

    [...]

  • ...The resulting thioester linkage undergoes an acyl rearrangement to form a native peptide bond, joining the two smaller peptides (7)....

    [...]

Journal ArticleDOI
TL;DR: Chemistries that Facilitate Nanotechnology Kim E. Sapsford,† W. Russ Algar, Lorenzo Berti, Kelly Boeneman Gemmill,‡ Brendan J. Casey,† Eunkeu Oh, Michael H. Stewart, and Igor L. Medintz .
Abstract: Chemistries that Facilitate Nanotechnology Kim E. Sapsford,† W. Russ Algar, Lorenzo Berti, Kelly Boeneman Gemmill,‡ Brendan J. Casey,† Eunkeu Oh, Michael H. Stewart, and Igor L. Medintz*,‡ †Division of Biology, Department of Chemistry and Materials Science, Office of Science and Engineering Laboratories, U.S. Food and Drug Administration, Silver Spring, Maryland 20993, United States ‡Center for Bio/Molecular Science and Engineering Code 6900 and Division of Optical Sciences Code 5611, U.S. Naval Research Laboratory, Washington, D.C. 20375, United States College of Science, George Mason University, 4400 University Drive, Fairfax, Virginia 22030, United States Department of Biochemistry and Molecular Medicine, University of California, Davis, School of Medicine, Sacramento, California 95817, United States Sotera Defense Solutions, Crofton, Maryland 21114, United States

1,169 citations

Journal ArticleDOI
15 Aug 2003-Science
TL;DR: A general and rapid route for the addition of unnatural amino acids to the genetic code of Saccharomyces cerevisiae is described, providing a gateway to the systematic expansion of the genetic codes of multicellular eukaryotes.
Abstract: We describe a general and rapid route for the addition of unnatural amino acids to the genetic code of Saccharomyces cerevisiae. Five amino acids have been incorporated into proteins efficiently and with high fidelity in response to the nonsense codon TAG. The side chains of these amino acids contain a keto group, which can be uniquely modified in vitro and in vivo with a wide range of chemical probes and reagents; a heavy atom-containing amino acid for structural studies; and photocrosslinkers for cellular studies of protein interactions. This methodology not only removes the constraints imposed by the genetic code on our ability to manipulate protein structure and function in yeast, it provides a gateway to the systematic expansion of the genetic codes of multicellular eukaryotes.

857 citations

References
More filters
Book
01 Jan 2003
TL;DR: In this paper, the authors present a method for the synthesis of organic compounds using Spectroscopic methods and Spectral Spectral Methods (SSTM) with a focus on alicyclic and aliphatic compounds.
Abstract: 1. Organic synthesis 2. Experimental techniques. 3. Spectroscopic methods. 4. Solvents and reagents. 5. Aliphatic compounds. 6. Aromatic compounds. 7. Selected alicyclic compounds. 8. Selected heterocyclic compounds. 9. Investigation and characterisation of organic compounds. 10. Physical constants of organic compounds.

6,578 citations

PatentDOI
04 May 1995-Science
TL;DR: The technique of native chemical ligation is employable for chemically synthesizing full length proteins as discussed by the authors, which are chemically identical to proteins produced by cell free synthesis, and can be refolded and/or oxidized to form native disulfide-containing protein molecules.
Abstract: Proteins of moderate size having native peptide backbones are produced by a method of native chemical ligation. Native chemical ligation employs a chemoselective reaction of two unprotected peptide segments to produce a transient thioester-linked intermediate. The transient thioester-linked intermediate then spontaneously undergoes a rearrangement to provide the full length ligation product having a native peptide bond at the ligation site. Full length ligation products are chemically identical to proteins produced by cell free synthesis. Full length ligation products may be refolded and/or oxidized, as allowed, to form native disulfide-containing protein molecules. The technique of native chemical ligation is employable for chemically synthesizing full length proteins.

3,347 citations

Journal ArticleDOI
John M. Coffin1
27 Jan 1995-Science
TL;DR: Results lead to a simple steady-state model in which infection, cell death, and cell replacement are in balance, and imply that the unique feature of HIV is the extraordinarily large number of replication cycles that occur during infection of a single individual.
Abstract: Several recent reports indicate that the long, clinically latent phase that characterizes human immunodeficiency virus (HIV) infection of humans is not a period of viral inactivity, but an active process in which cells are being infected and dying at a high rate and in large numbers. These results lead to a simple steady-state model in which infection, cell death, and cell replacement are in balance, and imply that the unique feature of HIV is the extraordinarily large number of replication cycles that occur during infection of a single individual. This turnover drives both the pathogenic process and (even more than mutation rate) the development of genetic variation. This variation includes the inevitable and, in principle, predictable accumulation of mutations such as those conferring resistance to antiviral drugs whose presence before therapy must be considered in the design of therapeutic strategies.

2,020 citations

Journal ArticleDOI
TL;DR: Detailed statistical analyses of integral membrane proteins of the helix‐bundle class from eubacterial, archaean, and eukaryotic organisms for which genome‐wide sequence data are available suggest that uni‐cellular organisms appear to prefer proteins with 6 and 12 transmembrane segments, whereas Caenorhabditis elegans and Homo sapiens have a slight preference for proteins with seven transmemBRane segments.
Abstract: We have carried out detailed statistical analyses of integral membrane proteins of the helix-bundle class from eubacterial, archaean, and eukaryotic organisms for which genome-wide sequence data are available. Twenty to 30% of all ORFs are predicted to encode membrane proteins, with the larger genomes containing a higher fraction than the smaller ones. Although there is a general tendency that proteins with a smaller number of transmembrane segments are more prevalent than those with many, uni-cellular organisms appear to prefer proteins with 6 and 12 transmembrane segments, whereas Caenorhabditis elegans and Homo sapiens have a slight preference for proteins with seven transmembrane segments. In all organisms, there is a tendency that membrane proteins either have many transmembrane segments with short connecting loops or few transmembrane segments with large extra-membraneous domains. Membrane proteins from all organisms studied, except possibly the archaeon Methanococcus jannaschii, follow the so-called "positive-inside" rule; i.e., they tend to have a higher frequency of positively charged residues in cytoplasmic than in extra-cytoplasmic segments.

1,519 citations

Frequently Asked Questions (1)
Q1. What are the contributions in this paper?

The facile access to novel analogs provided by chemical protein synthesis has led to original insights into the molecular basis of protein function in a number of systems.