Computational Methods in Life Sciences

Branch 1: Computing in Structural and Cell Biology
  • II3a - Fast Protein-Protein Docking Algorithms

    Background and state of the art
    Since many proteins bring out their biological functions by binding to a specific partner protein at a specific site, predicting and determining the structure of a given complex is one of the most important focuses for molecular biology researchers. This is often called "the protein-protein docking problem". It is not only an important task for understanding the complex interaction network inside the cell but also the key to rational drug design. Of the many docking approaches, grid-based Fast Fourier Transforms (FFT) has been shown to provide the best balance between complexity and accuracy of computation.

    The Project

    Determining experimentally the putative structures of all molecular complexes that one may want to analyse is far from feasible due to its high costs and technical difficulty. In this project we focus on developing a new in silico method to predict conformations of such complexes.
    So far we have been working on a possibility to solve the docking problem with an FFT that is using nonequispaced data (NFFT) and can thus be performed directly with the atom coordinates of the interacting molecules. This significantly improved the computational efficiency and reduced the storage needed for computation in comparison to grid-based algorithms. Docking processes typically consist of two stages, producing a list of putative conformations out of a six-dimensional search space of rotations and translations followed by a refinement step. Special emphasis is laid on the first stage where we use a suitable parameterisation of the three-dimensional rotation group SO(3) along with spherical harmonics to quickly compute the sought correlation function between molecules.

    Investigators
    ClinicalLife SciencesInformatics
    T. PetersInstitut für Chemie

    Contact: thomas.peters (at) chemie.uni-luebeck.de
    J. PrestinInstitut für Mathematik

    Contact: prestin (at) math.uni-luebeck.de
    Key Literature
    • Kovacs, J.A., Chacón, P., Cong, Y., Metwally, E. and Wriggers, W.: Fast Rotational Matching of Rigid Bodies by Fast Fourier Transform Acceleration of Five Degrees of Freedom, Acta Cryst. D 59:1371-1376, 2003.
    • Potts, D. and Steidl, G.: Fast summation at nonequispaced nodes by NFFTs, SIAM J. Sci. Comput., 24:2013-2037, 2003.
  • II3b - Efficient Methods for Exact Solutions of Complex Problems in Molecular Biology

    Background and state of the art
    The aim of this project is to develop algorithmic methods and software tools for central problems in molecular biology. This will be based on results and techniques from the areas efficient algorithms and data structures, information theory, algorithmic learning, data mining and complexity theory. To deal with molecular data in an intelligent way, we plan to investigate the properties of such data - noise, dependencies, hidden information, etc. - and an algorithmic modelling to process it on high-performance parallel computers.

    The Project
    The project is conducted by the Institute for Theoretical Computer Science and the Institute for Biology. The Theoretical Computer Science group has worked on algorithmic methods to process complex data, in particular with inductive learning and data mining techniques. Besides sequential algorithms, speedup methods by massive parallelism have also been investigated. The Biology group has specialised on the analysis of intracellular protein transport. We analyzed data for multiple sequence alignment problems (MSA). Finding an optimal alignment is a very difficult and time consuming task due to the inherent algorithmic complexity. Therefore, most software systems provide only approximate solutions. We have developed new exact sequential algorithms that give better performance. A current PhD project tries to parallelise our new methods in a suitable way such that further significant speedups by parallel processing can be achieved. The algorithms are tested on our high performance shared-memory parallel machine SunFire 15K. An important innovation for our solution is the reduction of the tasks to specific graph theoretical problems that can be solved efficiently also in parallel, for example to distance problems in specific regular graphs.

    Investigators
    ClinicalLife SciencesInformatics
    E. HartmannInstitut für Biologie

    Contact: ennohart (at) molbio.uni-luebeck.de
    R. ReischukInstitut für Theoretische Informatik

    Contact: reischuk (at) tcs.uni-luebeck.de
    Key Literature
    • Jakoby, M., Liskiewicz, R. and Reischuk, R.: Approximating Schedules for Dynamic Graphs Efficiently, Electronic Colloquium on Computation Complexity, Journal on Discrete Algorithms 2:471-500, 2004
  • II3c - Synaptic Plasticity: Regulatory Mechanisms in Receptor Trafficking

    Background and state of the art

    Nerve cell, sketch by S. Ramón y Cajal

    Synaptic plasticity is regarded as the molecular basis of learning and memory. It involves complex molecular machinery with various protein interactions but it is not yet clear how the components give rise to the different aspects of synaptic plasticity.

    The figures give a sketch of the LTP- and LTD-states. The ability of the synapse to stay in different stable stationary states is referred to as multi- or bistability in the literature. The transition from one stationary state to another requires a positive-feedback aspect in the biochemical reactions. Until now, first steps have been done in modelling the synaptic plasticity process. Beside models with a small number of components concerning the positive-feedback mechanism of CaMKII phosphorylation, e.g. by Lisman, there are very few extended mathematical models of biochemical systems containing switches which lead to synaptic plasticity, e.g. by Castellani or Hayer. Moreover, many current experimental strategies consist in a search for putative memory tags, s. Kelleher Ill.

    Stable states LTD

    Stable states LTP

    Within the past few years, an accumulation of observations including the trafficking of AMPA-receptor subunits and the identification of numerous interacting proteins involved in synaptic plasticity have created a new state of knowledge about the molecular mechanisms leading to long-term potentiation (LTP) or long-term depression (LTD) at synapses [1].
    A repeated short-time excitation of an active synapse can fix it in the LTP- respectively in the LTD-state. It is well accepted that the AMPA-receptor sorting and trafficking play crucial roles in synaptic plasticity. The occurrence of the homomeric form of this receptor in the postsynaptic membrane governs the strength of synaptic transmission.

    The Project
    The project deals with the modelling and the simulation of the molecular mechanisms of LTP and LTD in the synaptic transmissions. The base of synaptic plasticity is regarded as a multistable molecular reaction, which stays in distinguished stable states like LTP, LTD and possible further states after having been induced by stress related signals [2].
    Most biochemical reactions described in the literature are monostable. The proposed project aims in the explanation of fundamental mechanisms of multistable reactions both in synaptic plasticity and in more general context. An essential part of such reactions is a positive-feedback loop, which has been found in various biochemical and medical investigations. Using modules known form these investigations, bistable and multistable reactions are created which reproduce the behaviour of the particular application. Starting from a small core system of ordinary differential equations containing multistability, synaptic plasticity is modelled by the enrichment of the core system by further regulatory paths. Occurring parameters are to be identified by a comparison with clinical and experimental data. The resulting nonlinear dynamical system is discussed concerning parameter sensitivity and robustness with respect to the kinetics.
    It is known that AMPA-receptors are trafficking from the cytosol into and away from synapses. Thus, the confirmed and tested ordinary differential equations including the reaction kinetics are completed by active diffusion and other transport processes. In general, the resulting partial differential equations have a highly complex solution behaviour, which has to be carefully discussed and adapted to clinical experiences. The local resolution of the model enables us to explain the marking process of different synapses in one nerve cell.
    Further to the synaptic plasticity, it is assumed that fundamental regulatory mechanisms act in several biochemical applications. With respect to the immense number of possible substances and regulatory paths in extended biochemical systems like the synaptic plasticity, mathematical models allow to check virtually a large number of hypotheses and to estimate a priori the gain of knowledge expected by time-expensive experimental investigations.

    Investigators
    ClinicalLife SciencesInformatics
    E. HartmannInstitut für Biologie

    Contact: ennohart (at) molbio.uni-luebeck.de
    D. LangemannInstitut für Mathematik

    Contact: langemann (at) math.uni-luebeck.de
    A. PetersMedizinische Klinik I

    Contact: achim.peters (at) uk-sh.de
    Key Literature
    • Malinow, R. and Malenka, R.C.: AMPA receptor trafficking and synaptic plasticity, Ann. Rev. Neurosci. 25:103-126, 2002.
    • Langemann, D., Pellerin, L. and Peters, A.: Modelling molecular mechanisms of synaptic plasticity - making sense of AMPA receptor trafficking (in preparation, 2007).
  • II3d - Virus Evolution

    Background and state of the art
    Virus evolution is subject to several restrictions, which include the rather small size of the viral genome and the necessity to replicate in a host cell, i.e. to try go undetected by the immune system of the host. On the other hand, many viruses evolve very fast, as their replicases lack a proof-reading system. Thus, viruses can adapt themselves quickly to changing environmental conditions, such as treatment with antiviral drugs. Understanding virus evolution is not only of academic interest, but is necessary to predict drug-resistance mutations or the epidemic potential of certain viruses. Thus, in order to predict whether or not an outbreak of avian influenza virus constitutes a pandemic threat, it will be useful to have a complete picture of the evolutionary relationships between different influenza virus strains and their subgroups. Such an analysis has not been described for complete genomes of a large number of viral isolates. Analysis of virus evolution is complicated by the frequent occurrence of overlapping reading frames. This phenomenon is observed in retroviruses such as HIV, but also occurs in influenza virus (e.g., the overlap between the NS1 and NS2 genes). Even in large RNA virus genomes such as those of coronaviruses, overlapping reading frames do occur, disproving the current belief that this phenomenon is basically limited to small viral genomes. In fact, SARS coronavirus only became easily transmittable from human to human after a rearrangement of overlapping reading frames among the accessory genes occurred during adaption of this bat virus to the human host. The evolution of overlapping reading frames has not been analyzed in a systematic way so far.

    The project
    The project consists of two parts. First, an analysis of the evolution of influenza virus genes will be carried out, taking all >6,000 sequences of influenza A virus isolates into account that have been deposited in the databases. The relationships between the individual isolates will be analyzed. This will require the development of new algorithms for data sampling and presentation. Correlations of the evolutionary relationships between individual influenza virus genes with three-dimensional structures will be sought in cases where the latter are available. In the second part of the project, a systematic analysis of overlapping viral reading frames will be carried out, starting from the NS gene of influenza virus and continuing with coronaviruses and human immunodeficiency virus. The analysis will involve prediction of the secondary structure of RNA at the frameshift sites, studies of the mutations of each of the overlapping genes, and other aspects, and will be complemented by structural studies of the "overlapping proteins" (within another, experimentally oriented PhD thesis). One of the questions to be answered will be whether the evolution of both genes constitutes a compromise, or whether one of them is being optimized while the other is less-than-ideal.

    Investigators
    ClinicalLife SciencesInformatics
    R. HilgenfeldInstitut für Biochemie

    Contact: hilgenfeld (at) biochem.uni-luebeck.de
    T. MartinetzInstitut für Neuro- und Bioinformatik

    Contact: martinetz (at) informatik.uni-luebeck.de
    Key Literature
    • R. Hilgenfeld, J. Tan, S. Chen, X. Shen & H. Jiang: Structural proteomics of emerging viruses: The examples of SARS-CoV and other coronaviruses. In: Structural Proteomics and Its Impact on the Life Sciences (J. Sussman & I. Silman, eds.), World Scientific, Singapore, 2008, pp. 361-433.