Advertisement

If you have an ACS member number, please enter it here so we can link this account to your membership. (optional)

ACS values your privacy. By submitting your information, you are gaining access to C&EN and subscribing to our weekly newsletter. We use the information you provide to make your reading experience better, and we will never sell your data to third party members.

ENJOY UNLIMITED ACCES TO C&EN

Environment

Understanding The Workings Of Life

Pinpointing three-dimensional arrangements of proteins has been crucial for understanding the chemistry of living cells

by Sarah Everts
September 9, 2013 | A version of this story appeared in Volume 91, Issue 36

CORRECTION: This story was updated on Sept. 13, 2013, to correct errors: The computers used by Kendrew’s and Perutz’ teams were one-off machines built at the University of Cambridge, called EDSAC-1 and EDSAC-2; they were not commercially available. Also, punched-card computers were in existence before computers that used paper tape to input data, as did EDSAC-1 and -2.

In 1937, a year into his doctoral research, Max F. Perutz was bored. The young Austrian scientist was at Cambridge University, using X-ray crystallography to study mining waste. He was working to solve the crystal structures of mineral fragments in slag heaps, and the samples were “horrible things to work with,” Perutz told Vega Institute historians in a video shot in 2001, a year before he died.

Look at proteins, from hand-drawn masterpieces to electron-density map sculptures. Click here.

PIONEERS
[+]Enlarge
Credit: MRC Laboratory of Molecular Biology
Kendrew (left) and Perutz, the scientists who determined the crystal structures of myoglobin and hemoglobin, with a model of myoglobin in the 1960s.
Kendrew and Perutz with molecular models.
Credit: MRC Laboratory of Molecular Biology
Kendrew (left) and Perutz, the scientists who determined the crystal structures of myoglobin and hemoglobin, with a model of myoglobin in the 1960s.

In the summer of 1937, Perutz took a trip to Prague to visit relatives. In a conversation with a biochemist cousin, Perutz talked about finding a better, more stimulating project. The cousin then made a suggestion that would change Perutz’ life and establish the field of protein structural biology. “He said why don’t [you] take on hemoglobin?”

At the time, top-notch crystallographers were studying the structures of vitamins, amino acids, and peptides, molecules containing hundreds of atoms. Hemoglobin had nearly 10,000 atoms. Nobody had solved the structure of such a large molecule.

But Perutz was already convinced that “if we didn’t understand the structure of proteins we would never understand how life works.” At the time “there was just one method, X-ray crystallography, which offered any hope—but even that was doubtful,” he said.

Perutz’ attempt to solve the structure of hemoglobin would take 22 years.

Before he finally succeeded, in 1959, Perutz’ Cambridge colleagues James D. Watson and Francis H. C. Crick would begin and end their work to solve the structure of DNA, with essential data from Rosalind Franklin. Another colleague, John Kendrew, would start a project to solve the protein structure of myoglobin and finish it in 1957, beating Perutz to the punch by two years, in part by using some of the essential methodology Perutz had developed.

Yet if Perutz ever despaired that his efforts might be in vain—and he did—many thousands of researchers today are grateful for his perseverance as well as for the methods he developed. Visualizing the structures of proteins at atomic resolution underpins the molecular understanding of biology. It is vital for drug research, and it is crucial for engineering enzymes to do industrial work, from fuel refinement to pulp and paper production.

When Perutz finally caught his first glimpse of hemoglobin at 5.5-Å resolution (Nature 1960, DOI: 10.1038/185416a0), he was awestruck. “I finally saw this thing I had been working on for 22 years,” Perutz recalled. “It was like reaching the top of a mountain after a very hard climb and falling in love at the same time,” Perutz said. “That intensity of joy—maybe you find it only in science, when nature reveals its great secrets.”

In contrast, the famous structure of DNA was solved in about a year and a half because the molecule’s basic architecture is simple and symmetrical. “The DNA structure was a pen and pencil operation,” explains Wayne Hendrickson, a structural biology pioneer at Columbia University. At the time it was solved, researchers were proposing theoretical structures for DNA and calculating the X-ray diffraction patterns that would result from crystals of those proposed structures. The theoretical diffraction patterns were then compared with real diffraction data, Hendrickson says. “If you were close, you knew you had the right structure,” he says, which is how Watson and Crick succeeded.

TIMBER TROPHY
[+]Enlarge
Credit: MRC Laboratory of Molecular Biology
Max Perutz carved this balsa wood model of hemoglobin in 1959. Two of the protein’s four subunits are in white, the other two in black. The red objects are heme groups, which are used by the protein to bind oxygen.
Early balsa wood model of hemoglobin. Max Perutz, 1959.
Credit: MRC Laboratory of Molecular Biology
Max Perutz carved this balsa wood model of hemoglobin in 1959. Two of the protein’s four subunits are in white, the other two in black. The red objects are heme groups, which are used by the protein to bind oxygen.

Proteins were far more complicated. “It was doomed to think that you could just think your way through the problem to predict the protein’s diffraction patterns,” Hendrickson says.

Perutz, Kendrew, and their teams needed to use computers to mathematically convert X-ray diffraction patterns, using Fourier transforms, into electron-density maps of three-dimensional proteins. From these electron-density maps, researchers could then identify atoms in the structures.

In the mid-1950s, Kendrew’s and Perutz’ teams began using one-off computers built at the University of Cambridge, called EDSAC-1 and EDSAC-2, to analyze the X-ray diffraction data obtained from crystalline proteins, says Michael G. Rossmann, a structural biologist at Purdue University who helped Perutz solve the first structure of hemoglobin. These computers used paper tape to input data and programs.

But computers weren’t the only advance necessary to solve protein structures from diffraction experiments. Sixteen years into the hemoglobin project—an era he called the dark years—Perutz was still stymied by the so-called phase problem. To mathematically convert diffraction data into electron-density data, researchers needed two important pieces of information: the intensity of diffracted light and the phase of that light. Perutz had figured out how to quantify the intensity of the diffraction pattern spots. But computing the phase of incident X-rays foiled Perutz until 1953.

The eventual solution, called isomorphous replacement, involved inserting a heavy atom into the protein crystals. This modified diffraction patterns so that the phase of the incident light could be inferred. Kendrew used the technique to solve myoglobin; hemoglobin yielded shortly thereafter.

And thus began a new field of research. Throughout the 1960s, the structures of carboxypeptidase, papain, chymotrypsin, and ribonuclease began to appear from old and new protein structure labs, while Perutz and Kendrew obtained increasingly better resolution structures of hemoglobin and myoglobin.

MAGIC BOX
[+]Enlarge
Credit: Frederic Richards
The original Richards Box, built by Frederic M. Richards at Oxford University in 1968 to help construct more accurate atomic models of proteins. The device became an essential part of structural biology labs. However, researchers were known to lose months using it to tweak protein structures. Perhaps for this reason, the box was also referred to as Fred’s Folly.
Photograph of the original "Richards box" built by Frederic M. Richards at Oxford in 1968. It uses a half-silvered mirror at 45° to visually superimpose an image of the sheets of crystallographic electron-density contours onto the molecular model of the protein being built in the lower compartment.
Credit: Frederic Richards
The original Richards Box, built by Frederic M. Richards at Oxford University in 1968 to help construct more accurate atomic models of proteins. The device became an essential part of structural biology labs. However, researchers were known to lose months using it to tweak protein structures. Perhaps for this reason, the box was also referred to as Fred’s Folly.

“Every new structure was a surprise,” says Robert Huber of the Max Planck Institute for Biochemistry, in Munich, whose thesis supervisor, Walter Hoppe, asked him to solve the structure of an oxygen-carrying protein in insects during the 1960s. After getting his feet wet in the field, Huber went on to win the 1988 Nobel Prize in Chemistry for solving the first structure of a membrane protein.

Yet the field still faced many challenges, particularly in finding ways to visually represent these complicated protein structures. “It really helped to have some artist’s skills,” says Helen M. Berman, the head of the Research Collaborating for Structural Bioinformatics’ Protein Data Bank at Rutgers University, part of a worldwide repository for protein structure coordinates. Sometimes people would draw structures of the complicated proteins by hand to make stereodiagrams, whereas others carved structures out of balsa wood.

Models of proteins containing thousands of atoms were labors of love. To construct them, researchers would plot two-dimensional sections of the protein’s electron-density data onto transparent plastic sheets. Stacking these plastic sheets together, researchers could get a 3-D representation of the protein’s electron density, from which they would then painstakingly construct atomic models of the protein using wires and screws. To improve proper alignment of the models with the electron-density maps, researchers would build a contraption called a Richards’ box. This box used half-silvered mirrors to superimpose the atomic models being built with the electron-density maps. The whole process “took months and months,” says Ian A. Wilson, a pioneering structural biologist at Scripps Research Institute California who solved the structure of the glycolysis enzyme triose phosphate isomerase as a graduate student at Oxford University.

Once you were done building the model, you began the exhausting process of figuring out the x-y-z coordinates of the protein. Prior to computers, this required rulers and plumb lines.

To illustrate proteins for a journal publication, early structural biologists often hired professional artists. One of the few people who had both scientific know-how and artistic skills was Duke University’s Jane S. Richardson. She invented the now ubiquitous ribbon diagram to simplify representations of secondary structures such as α-helices and β-sheets. These types of representations are still found in most articles that include protein structure images. In 2004, Richardson told a biographer that she was staggered that “a whole generation of scientists sees protein structure through my eyes” (Biol. Physicist 2004,4, 5).

OLD-SCHOOL TALENT
[+]Enlarge
Credit: Jane Richardson
Hand-drawn structure of triose phosphate isomerase by Richardson, who developed the ribbon model of secondary structure.
Hand drawn structure of triose phosphate isomerase.
Credit: Jane Richardson
Hand-drawn structure of triose phosphate isomerase by Richardson, who developed the ribbon model of secondary structure.

In the mid- to late 1970s, computers and their displays got powerful enough that researchers could use them to visualize protein structures in minutes instead of months. Yet programmers never found a better representation for α-helices and β-sheets than Richardson’s ribbon diagrams, so they were incorporated into visualization software as well.

In addition to advances in computer visualization, the 1970s saw the establishment of the Protein Data Bank (PDB), a tool universally used by structural biologists today. But at the time, the repository of 3-D details was extremely controversial.

Several scientists believed that “those who generated the first structure of an important protein should have the privilege to work on it without competition, at least for a certain time, and to determine protein-ligand complexes, particularly for proteins with relevance in pharma,” Huber says. PDB, in contrast, made all of this information available to anyone who asked—if the bank had the structure.

If it didn’t, “you had to write the author asking for coordinates,” Rossmann says. “Some said no.” Around 1981, Rossmann found a way to extract the data he wanted from a scientific paper, as long as its authors included a stereogram of the protein. “In desperation I wrote a computer program that could extract the protein’s 3-D coordinates from the stereogram,” Rossmann says. “PDB supplied the program to anyone who asked.”

According to Berman, everything changed again after 1989. That’s when the International Union of Crystallography published guidelines for submitting structural data to PDB. “As soon as these guidelines were established, funding agencies and journals increasingly required PDB submission as part of receiving funds or getting published,” she says. The number of structures in the bank began to rise exponentially, from 507 in 1990 to 13,596 in 2000.

The growing number of PDB submissions set the stage for a better understanding of the rules of protein structure, says Thomas L. Blundell, a structural biologist at Cambridge University who solved the structure of insulin. Researchers such as Janet Thornton, now the director of the European Bioinformatics Institute at the Wellcome Trust Genome Campus, in En­gland, used PDB to compare, contrast, and statistically analyze the prevalence of certain protein folds or interactions such as salt bridges.

These early structural bioinformaticians also began developing computer tools to superimpose the structures of seemingly unrelated proteins, allowing them to discover that nature sometimes reused useful topologies. The comparisons provided the first evidence that Darwinian evolutionary principles applied at the molecular scale in proteins. For example, some folds, such as α/β-barrels, cylinders of β-strands and α-helices, “kept popping up all over the place,” Thornton says.

In the 1980s, researchers increasingly noticed that evolution sometimes cut-and-pasted amino acid sequences from one protein to another and then slightly adjusted them to do different chemistry. This is a case of divergent evolution: Similar protein structures have related amino acid sequences, likely passed on from evolutionary protein ancestors. Other times, completely different amino acid sequences adopted the same 3-D fold, a case of convergent evolution, where a biomolecular structure or trait is so useful it evolves several times independently.

By the 1990s, protein structural biology had matured into a well-established analytical discipline. Researchers began using recombinant DNA to clone and express proteins at precise lengths, thus reducing the floppy ends that had stymied earlier crystallization attempts. Meanwhile, the use of X-rays produced by synchrotron sources improved the quality of diffraction data and its speed of acquisition. At the same time, Kurt Wüthrich and Richard R. Ernst were improving nuclear magnetic resonance techniques to solve protein structures, sometimes faster than with X-ray crystallization.

Throughout the 2000s, structural biology for simple and small proteins became more highly automated. High-throughput structural biology programs tackled previously unknown proteins uncovered from genome-sequencing projects. High-throughput technology also automated the production of protein crystal samples. Meanwhile, researchers in the field set their sights on solving the structures of large, multidomain cellular machines like the ribosome and complex membrane proteins, such as ion channels. Finally, electron microscopy evolved as a third tool for determining the structure of proteins. In 2013, PDB topped 85,000 protein structures, many solved by a combination of X-ray crystallography, NMR, and electron microscopy.

As pioneers in the field reflect on the biggest difference between current protein structure projects and early ones, many mention the decreasing role of crystallographers and the increasing role of biologists in determining what proteins are studied.

Advertisement

“Initially, people solved the structures they could,” Huber says. Perutz, for instance, was aided back in the 1930s by a physiologist at Cambridge who had already crystallized horse hemoglobin. Now, says Huber, “You pose the biological question first. Then you determine which structures you need to solve in order to answer the question.”

Blundell concurs: “A lot of the early protein structure people were physicists or computer people or chemists and didn’t know much about biology. The prerequisite was that you had crystals.”

Article:

This article has been sent to the following recipient:

0 /1 FREE ARTICLES LEFT THIS MONTH Remaining
Chemistry matters. Join us to get the news you need.