Home > About Us > CNIO Publications, Year 2015 > A new algorithm to predict the dynamic language of proteins

CNIO Publications, Year 2015


A new algorithm to predict the dynamic language of proteins

Madrid, 22 October, 2015

Researchers have developed the first computational method based on evolutionary principles to predict the changes in shape that proteins experience to carry out their functions

This method is a step forward in the study of protein dynamics, of great importance for the design of drugs and the investigation of genetic diseases such as cancer

The work, published in the journal ‘Proceedings of the National Academy of Sciences (PNAS)’, was prepared by researchers at the CNIO and University College London (United Kingdom)

Researchers from the Structural Biology Computational Group of the Spanish National Cancer Research Centre (CNIO), led by Alfonso Valencia, in collaboration with a group headed by Francesco Gervasio at the University College London (UK), have developed the first computational method based on evolutionary principles to predict protein dynamics, which explains the changes in the shape or dimensional structure that they experience in order to interact with other compounds or speed up chemical reactions. The study constitutes a major step forward in the computational study of protein dynamics (i.e. their movement), which is crucial for the design of drugs and for the research on genetic diseases, such as cancer, resulting in higher levels of complexity than allowed by current methods. The results have been published this week in the journal Proceedings of the National Academy of Sciences (PNAS).

Proteins are macromolecules that are key to the thousands of cellular functions that take place in a living organism. They are formed by chains of smaller molecules called amino acids that fold forming a three-dimensional structure. It has recently been discovered that by studying the co-evolution of amino acids we can reconstruct the form or structure of these biological compounds in their natural surroundings. “A protein's amino acids can co-evolve, i.e. change in a coordinated way,” says Alfonso Valencia. “By analysing the sequences of a given family of proteins, we can predict physical contacts between amino acids with great precision, in sufficient number to reconstruct the folding of a protein accurately and, therefore, its structure or form.”

However, this structure does not remain static; it goes through changes in such a way that, similar to a dance in which each of the dancers adapts to their partner, it interacts with other biological compounds or with drugs. This is known as protein dynamics, the study of which has proven to be very difficult both with experimental observations and using computational tools.

The question addressed by the researchers at the beginning of the study, when Francesco Gervasio headed the Computational Biophysics Group at the CNIO, was more complex: can we use co-evolutionary studies to predict changes in the shape of proteins and, consequently, the language they establish with their environment?

“We developed a model in which the amino acids that have a strong co-evolutionary relationship attracted each other, without further additional data,” says Simone Marsili, researcher who has also participated in the project. “First, we simulate the folding process and then we can see how the simulations were able to predict the changes in shape of the proteins at different levels of complexity, including those required for kinases to function [these are key proteins in metabolic and cell signalling processes as well as in cell transport, amongst others].”

This new computational method easily integrates experimental and genomic data through the use of the latest sequence analysis and 3D modelling technology. In addition, it demonstrates that genomic data can be a source of useful information to supplement the current tools used to study the structure and dynamics of proteins.

“The ability to predict key features of proteins at this level of complexity will help to understand how the sequence of a protein determines its dynamics and, therefore, its functions,” concludes Valencia. This field of knowledge is key to the study of genetic diseases, such as cancer, or the design of drugs, amongst other uses.

The study has been funded by the Spanish Ministry of Economy and Competitiveness and the Engineering and Physical Sciences Research Council (UK).

The predicted profile for the conformational dynamics of the tyrosine kinase family. Regions highlighted in red correspond to important structural elements involved in protein activation. /CNIO

Reference article:

From residue coevolution to protein conformational ensembles and functional dynamics. Ludovico Sutto, Simone Marsili, Alfonso Valencia, Francesco Luigi Gervasio. PNAS (2015). doi: 10.1073/pnas.1508584112


  • Head of Communications
  • Nuria Noriega
  • Communications Officer
  • Cristina de Martos
  • E-mail comunicacion@cnio.es
  • Phone +(34) 917 328 000