|
|
| Understanding proteinsNew model of protein folding helps researchers handle flood of genomic data All living tissue is made from proteins, and all proteins are made from a combination of the same 20 chemical building blocks, called amino acids. The difference between the proteins that make up bone, blood, hair and eyeballs is largely one of shape.
Genes are the recipes for stringing together amino acids into proteins, but the way in which those strings fold back on themselves determines their shape. So understanding genes’ roles in disease requires understanding how proteins fold.
In a series of recent papers, researchers at MIT’s Computer Science and Artificial Intelligence Laboratory have demonstrated a promising new technique for modeling such protein folding. While not as accurate as some existing techniques, it is much more computationally efficient. Sophisticated, atom-by-atom simulations that run on hundreds of thousands of computers might take months to model a few milliseconds of protein folding. The researchers’ new technique can model the same process in minutes on a single laptop.
Speed is of the essence as the amount of unprocessed genomic data proliferates. “There’s the Broad 1,000 Genomes project, there’s X many species that have been sequenced now, and the sequence data is just vastly outpacing the speed with which you could apply some of these other techniques,” says Charles O’Donnell, a PhD student in the Department of Electrical Engineering and Computer Science who helped develop the new approach. “If you want to make sense of all this high-throughput data that’s coming from this great biotech innovation, then you need something quick.”
Other “quick” methods of simulating protein folding exist, but the MIT researchers’ appears to be more accurate. There is still much we don’t know about the actual structure of proteins, O’Donnell cautions, so that makes assessing the quality of computational methods difficult. But at the 19th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB) in July, the MIT researchers will present a paper demonstrating that for a class of proteins known as amyloids, their technique’s predictions match the currently available data with 81 percent accuracy, whereas high-efficiency techniques previously managed 42 percent at best.
Computational modeling of protein folding has been an active research area for decades, but “it hasn’t been entirely clear whether it was going to be useful or not,” says Susan Lindquist, an MIT professor of biology, recent recipient of the National Medal of Science, and, along with CSAIL’s Bonnie Berger and Srini Devadas, one of O’Donnell’s faculty advisors. “I think that this paper helps realize that goal.”
Quantity over quality
When a protein folds, amino acids far from each other on the protein strand are brought close together, and chemical bonds form between them. That folding, however, brings other amino acids into proximity with each other, and those acids could exert either an attractive or a repulsive force on each other. Predicting a protein’s shape is a matter of figuring out which regions of the strand could have affinities for each other, and whether bringing those regions together would cause unsupportable tensions elsewhere.
Atom-by-atom simulations can model amino acids’ interactions very precisely, but because they’re so computationally complex, they’ve generally been restricted to protein strands with only a couple dozen amino acids, whereas full proteins can comprise hundreds or even thousands. Making the simulations computationally efficient means sacrificing information about the amino acids’ interactions, and most previous attempts have tried to strike a balance between accuracy of representation and simplicity of description.
MIT computer science professors Berger and Devadas, O’Donnell and Jérôme Waldispuhl, a former MIT math instructor who’s now an assistant professor at McGill, adopted a somewhat different approach. They employ what they describe as a “coarse representation” of a protein’s chemical properties, but that allows them to generate a huge number of candidate shapes. Their algorithm then looks for the features that occur most frequently across all the candidates, which it then synthesizes into a small group of likely structures.
Working with collaborators at McGill, Boston College, and the MIT Department of Biology, they’ve applied the technique to several different problems. The paper they’re presenting in July describes their amyloid shape-prediction results, but at the 15th Annual International Conference on Research in Computational Molecular Biology on March 28, they’ll present another paper describing the precise sequence of steps by which different types of proteins — mainly so-called beta-sheet proteins — fold. There, Waldispuhl explains, the trick is that each step in the folding pathway is itself a different shape in the library of candidates, and the algorithm finds a pathway through them. Two years ago at the same conference, the researchers presented an earlier result in which they used their technique to explain the commonalities between proteins with different sequences of amino acids that nonetheless played the same role in certain biological systems, implying that they had structural similarities.
“Protein folding continues to be wide-open problem with desperate need of more rigorous mathematical, statistical and computer-science approaches,” says Sorin Istrail, a professor of computer science at Brown University who specializes in computational biology. What distinguishes the MIT researchers’ work, he says, is its “rigorously mathematical results.” “The world needs to do what Bonnie and Charlie are doing,” Istrail says, “taking one aspect of the problem and building rigorous methods for that particular component.”
| |
| Ancient body clock discovered that helps to keep all living things on time 
The mechanism that controls the internal 24-hour clock of all forms of life from human cells to algae has been identified by scientists. Not only does the research provide important insight into health-related problems linked to individuals with disrupted clocks - such as pilots and shift workers - it also indicates that the 24-hour circadian clock found in human cells is the same as that found in algae and dates back millions of years to early life on Earth. Two new studies out on 27 January, in the journal Nature from the Universities of Cambridge and Edinburgh give insight into the circadian clock which controls patterns of daily and seasonal activity, from sleep cycles to butterfly migrations. One study, from the Institute of Metabolic Science at the University of Cambridge, has for the first time identified 24-hour rhythms in red blood cells. This is significant because circadian rhythms have always been assumed to be linked to DNA and gene activity, but - unlike most of the other cells in the body - red blood cells do not have DNA. Akhilesh Reddy, from the University of Cambridge and lead author of the study, said: "We know that clocks exist in all our cells; they're hard-wired into the cell. Imagine what we'd be like without a clock to guide us through our days. The cell would be in the same position if it didn't have a clock to coordinate its daily activities. "The implications of this for health are manifold. We already know that disrupted clocks - for example, caused by shift-work and jet-lag - are associated with metabolic disorders such as diabetes, mental health problems and even cancer. By furthering our knowledge of how the 24-hour clock in cells works, we hope that the links to these disorders - and others - will be made clearer. This will, in the longer term, lead to new therapies that we couldn't even have thought about a couple of years ago."
For the study, the scientists, funded by the Wellcome Trust, incubated purified red blood cells from healthy volunteers in the dark and at body temperature, and sampled them at regular intervals for several days. They then examined the levels of biochemical markers - proteins called peroxiredoxins - that are produced in high levels in blood and found that they underwent a 24-hour cycle. Peroxiredoxins are found in virtually all known organisms. A further study, by scientists working together at the Universities of Edinburgh and Cambridge, and the Observatoire Oceanologique in Banyuls, France, found a similar 24-hour cycle in marine algae, indicating that internal body clocks have always been important, even for ancient forms of life. The researchers in this study found the rhythms by sampling the peroxiredoxins in algae at regular intervals over several days. When the algae were kept in darkness, their DNA was no longer active, but the algae kept their circadian clocks ticking without active genes. Scientists had thought that the circadian clock was driven by gene activity, but both the algae and the red blood cells kept time without it. Andrew Millar of the University of Edinburgh's School of Biological Sciences, who led the study, said: "This groundbreaking research shows that body clocks are ancient mechanisms that have stayed with us through a billion years of evolution. They must be far more important and sophisticated than we previously realised. More work is needed to determine how and why these clocks developed in people - and most likely all other living things on earth - and what role they play in controlling our bodies." Additional funding for the studies was provided by the Biotechnology and Biological Sciences Research Council, the Engineering and Physical Sciences Research Council, the Medical Research Council, the French Agence Nationale de la Recherche, and the National Institute of Health Research. | |
|
|