LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 68

Search options

  1. Article ; Online: Session introduction: AI-driven Advances in Modeling of Protein Structure.

    Fidelis, Krzysztof / Grudinin, Sergei

    Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

    2021  Volume 27, Page(s) 1–9

    Abstract: The last few years mark dramatic improvements in modeling of protein structure. Progress was initially due to breakthroughs in residue-residue contact prediction, first with global statistical models and later with deep learning. These advancements were ... ...

    Abstract The last few years mark dramatic improvements in modeling of protein structure. Progress was initially due to breakthroughs in residue-residue contact prediction, first with global statistical models and later with deep learning. These advancements were then followed by an even broader application of the deep learning techniques to the protein structure modeling itself, first using Convolutional Neural Networks (CNNs) and then switching to Natural Language Processing (NLP), including Attention models, and to Geometric Deep Learning (GDL). The accuracy of protein structure models generated with current state-of-the-art methods rivals that of experimental structures, while models themselves are used to solve structures or to make them more accurate.Looking at the near future of machine learning applications in structural biology, we ask the following questions: Which specific problems should we expect to be solved next? Which new methods will prove to be the most effective? Which actions are likely to stimulate further progress the most? In addressing these questions, we invite the 2022 PSB attendees to actively participate in session discussions.The AI-driven Advances in Modeling of Protein Structure session includes five papers specifically dedicated to:Evaluating the significance of training data selection in machine learning.Geometric pattern transferability, from protein self-interactions to protein-ligand interactions.Supervised versus unsupervised sequence to contact learning, using attention models.Side chain packing using SE(3) transformers.Feature detection in electrostatic representations of ligand binding sites.
    MeSH term(s) Computational Biology ; Deep Learning ; Humans ; Machine Learning ; Neural Networks, Computer ; Proteins
    Chemical Substances Proteins
    Language English
    Publishing date 2021-12-10
    Publishing country United States
    Document type Journal Article
    ISSN 2335-6936
    ISSN (online) 2335-6936
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  2. Article ; Online: HOPMA: Boosting Protein Functional Dynamics with Colored Contact Maps.

    Laine, Elodie / Grudinin, Sergei

    The journal of physical chemistry. B

    2021  Volume 125, Issue 10, Page(s) 2577–2588

    Abstract: In light of the recent very rapid progress in protein structure prediction, accessing the multitude of functional protein states is becoming more central than ever before. Indeed, proteins are flexible macromolecules, and they often perform their ... ...

    Abstract In light of the recent very rapid progress in protein structure prediction, accessing the multitude of functional protein states is becoming more central than ever before. Indeed, proteins are flexible macromolecules, and they often perform their function by switching between different conformations. However, high-resolution experimental techniques such as X-ray crystallography and cryogenic electron microscopy can catch relatively few protein functional states. Many others are only accessible under physiological conditions in solution. Therefore, there is a pressing need to fill this gap with computational approaches. We present HOPMA, a novel method to predict protein functional states and transitions by using a modified elastic network model. The method exploits patterns in a protein contact map, taking its 3D structure as input, and excludes some disconnected patches from the elastic network. Combined with nonlinear normal mode analysis, this strategy boosts the protein conformational space exploration, especially when the input structure is highly constrained, as we demonstrate on a set of more than 400 transitions. Our results let us envision the discovery of new functional conformations, which were unreachable previously, starting from the experimentally known protein structures. The method is computationally efficient and available at https://github.com/elolaine/HOPMA and https://team.inria.fr/nano-d/software/nolb-normal-modes.
    MeSH term(s) Crystallography, X-Ray ; Macromolecular Substances ; Models, Molecular ; Protein Conformation ; Proteins
    Chemical Substances Macromolecular Substances ; Proteins
    Language English
    Publishing date 2021-03-09
    Publishing country United States
    Document type Journal Article
    ISSN 1520-5207
    ISSN (online) 1520-5207
    DOI 10.1021/acs.jpcb.0c11633
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  3. Article ; Online: AnAnaS: Software for Analytical Analysis of Symmetries in Protein Structures.

    Pagès, Guillaume / Grudinin, Sergei

    Methods in molecular biology (Clifton, N.J.)

    2020  Volume 2165, Page(s) 245–257

    Abstract: Symmetry is very common among proteins found in structural databases such as the Protein Data Bank (PDB). We present novel software, called AnAnaS, that finds positions and orientations of the symmetry axes in all types of symmetrical protein assemblies. ...

    Abstract Symmetry is very common among proteins found in structural databases such as the Protein Data Bank (PDB). We present novel software, called AnAnaS, that finds positions and orientations of the symmetry axes in all types of symmetrical protein assemblies. It deals with five symmetry groups: cyclic, dihedral, tetrahedral, octahedral, and icosahedral. The software also assesses the quality of symmetry and can detect symmetries in incomplete cyclic assemblies. Internally, AnAnaS comprises discrete and continuous optimization steps and is applicable to assemblies with multiple chains in the asymmetric subunits or to those with pseudosymmetry. The method is very fast as most of the steps are performed analytically.
    MeSH term(s) Isomerism ; Protein Conformation ; Protein Multimerization ; Sequence Analysis, Protein/methods ; Software
    Language English
    Publishing date 2020-07-03
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ISSN 1940-6029
    ISSN (online) 1940-6029
    DOI 10.1007/978-1-0716-0708-4_14
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Article ; Online: DeepSymmetry: using 3D convolutional networks for identification of tandem repeats and internal symmetries in protein structures.

    Pagès, Guillaume / Grudinin, Sergei

    Bioinformatics (Oxford, England)

    2019  Volume 35, Issue 24, Page(s) 5113–5120

    Abstract: Motivation: Thanks to the recent advances in structural biology, nowadays 3D structures of various proteins are solved on a routine basis. A large portion of these structures contain structural repetitions or internal symmetries. To understand the ... ...

    Abstract Motivation: Thanks to the recent advances in structural biology, nowadays 3D structures of various proteins are solved on a routine basis. A large portion of these structures contain structural repetitions or internal symmetries. To understand the evolution mechanisms of these proteins and how structural repetitions affect the protein function, we need to be able to detect such proteins very robustly. As deep learning is particularly suited to deal with spatially organized data, we applied it to the detection of proteins with structural repetitions.
    Results: We present DeepSymmetry, a versatile method based on 3D convolutional networks that detects structural repetitions in proteins and their density maps. Our method is designed to identify tandem repeat proteins, proteins with internal symmetries, symmetries in the raw density maps, their symmetry order and also the corresponding symmetry axes. Detection of symmetry axes is based on learning 6D Veronese mappings of 3D vectors, and the median angular error of axis determination is less than one degree. We demonstrate the capabilities of our method on benchmarks with tandem-repeated proteins and also with symmetrical assemblies. For example, we have discovered about 7800 putative tandem repeat proteins in the PDB.
    Availability and implementation: The method is available at https://team.inria.fr/nano-d/software/deepsymmetry. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the DeepSymmetry model to these maps.
    Supplementary information: Supplementary data are available at Bioinformatics online.
    MeSH term(s) Algorithms ; Proteins ; Software ; Tandem Repeat Sequences
    Chemical Substances Proteins
    Language English
    Publishing date 2019-06-03
    Publishing country England
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 1422668-6
    ISSN 1367-4811 ; 1367-4803
    ISSN (online) 1367-4811
    ISSN 1367-4803
    DOI 10.1093/bioinformatics/btz454
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  5. Article ; Online: Protein sequence-to-structure learning: Is this the end(-to-end revolution)?

    Laine, Elodie / Eismann, Stephan / Elofsson, Arne / Grudinin, Sergei

    Proteins

    2021  Volume 89, Issue 12, Page(s) 1770–1786

    Abstract: The potential of deep learning has been recognized in the protein structure prediction community for some time, and became indisputable after CASP13. In CASP14, deep learning has boosted the field to unanticipated levels reaching near-experimental ... ...

    Abstract The potential of deep learning has been recognized in the protein structure prediction community for some time, and became indisputable after CASP13. In CASP14, deep learning has boosted the field to unanticipated levels reaching near-experimental accuracy. This success comes from advances transferred from other machine learning areas, as well as methods specifically designed to deal with protein sequences and structures, and their abstractions. Novel emerging approaches include (i) geometric learning, that is, learning on representations such as graphs, three-dimensional (3D) Voronoi tessellations, and point clouds; (ii) pretrained protein language models leveraging attention; (iii) equivariant architectures preserving the symmetry of 3D space; (iv) use of large meta-genome databases; (v) combinations of protein representations; and (vi) finally truly end-to-end architectures, that is, differentiable models starting from a sequence and returning a 3D structure. Here, we provide an overview and our opinion of the novel deep learning approaches developed in the last 2 years and widely used in CASP14.
    MeSH term(s) Amino Acid Sequence ; Computational Biology ; Databases, Protein ; Deep Learning ; Protein Conformation ; Proteins/chemistry ; Proteins/metabolism ; Sequence Analysis, Protein ; Software
    Chemical Substances Proteins
    Language English
    Publishing date 2021-09-22
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't ; Review
    ZDB-ID 806683-8
    ISSN 1097-0134 ; 0887-3585
    ISSN (online) 1097-0134
    ISSN 0887-3585
    DOI 10.1002/prot.26235
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  6. Article ; Online: Analytical symmetry detection in protein assemblies. II. Dihedral and cubic symmetries.

    Pagès, Guillaume / Grudinin, Sergei

    Journal of structural biology

    2018  Volume 203, Issue 3, Page(s) 185–194

    Abstract: Protein assemblies are often symmetric, as this organization has many advantages compared to individual proteins. Complex protein structures thus very often possess high-order symmetries. Detection and analysis of these symmetries has been a challenging ... ...

    Abstract Protein assemblies are often symmetric, as this organization has many advantages compared to individual proteins. Complex protein structures thus very often possess high-order symmetries. Detection and analysis of these symmetries has been a challenging problem and no efficient algorithms have been developed so far. This paper presents the extension of our cyclic symmetry detection method for higher-order symmetries with multiple symmetry axes. These include dihedral and cubic, i.e., tetrahedral, octahedral, and icosahedral, groups. Our method assesses the quality of a particular symmetry group and also determines all of its symmetry axes with a machine precision. The method comprises discrete and continuous optimization steps and is applicable to assemblies with multiple chains in the asymmetric subunits or to those with pseudo-symmetry. We implemented the method in C++ and exhaustively tested it on all 51,358 symmetric assemblies from the Protein Data Bank (PDB). It allowed us to study structural organization of symmetric assemblies solved by X-ray crystallography, and also to assess the symmetry annotation in the PDB. For example, in 1.6% of the cases we detected a higher symmetry group compared to the PDB annotation, and we also detected several cases with incorrect annotation. The method is available at http://team.inria.fr/nano-d/software/ananas. The graphical user interface of the method built for the SAMSON platform is available at http://samson-connect.net.
    MeSH term(s) Algorithms ; Crystallography, X-Ray ; Databases, Protein ; Protein Conformation ; Proteins/chemistry ; Software
    Chemical Substances Proteins
    Language English
    Publishing date 2018-06-15
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 1032718-6
    ISSN 1095-8657 ; 1047-8477
    ISSN (online) 1095-8657
    ISSN 1047-8477
    DOI 10.1016/j.jsb.2018.05.005
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  7. Article ; Online: Predicting Protein Functional Motions: an Old Recipe with a New Twist.

    Grudinin, Sergei / Laine, Elodie / Hoffmann, Alexandre

    Biophysical journal

    2020  Volume 118, Issue 10, Page(s) 2513–2525

    Abstract: Large macromolecules, including proteins and their complexes, very often adopt multiple conformations. Some of them can be seen experimentally, for example with x-ray crystallography or cryo-electron microscopy. This structural heterogeneity is not ... ...

    Abstract Large macromolecules, including proteins and their complexes, very often adopt multiple conformations. Some of them can be seen experimentally, for example with x-ray crystallography or cryo-electron microscopy. This structural heterogeneity is not occasional and is frequently linked with specific biological function. Thus, the accurate description of macromolecular conformational transitions is crucial for understanding fundamental mechanisms of life's machinery. We report on a real-time method to predict such transitions by extrapolating from instantaneous eigen motions, computed using the normal mode analysis, to a series of twists. We demonstrate the applicability of our approach to the prediction of a wide range of motions, including large collective opening-closing transitions and conformational changes induced by partner binding. We also highlight particularly difficult cases of very small transitions between crystal and solution structures. Our method guarantees preservation of the protein structure during the transition and allows accessing conformations that are unreachable with classical normal mode analysis. We provide practical solutions to describe localized motions with a few low-frequency modes and to relax some geometrical constraints along the predicted transitions. This work opens the way to the systematic description of protein motions, whatever their degree of collectivity. Our method is freely available as a part of the NOn-Linear rigid Block (NOLB) package.
    MeSH term(s) Cryoelectron Microscopy ; Crystallography, X-Ray ; Models, Molecular ; Protein Conformation ; Proteins
    Chemical Substances Proteins
    Language English
    Publishing date 2020-04-04
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 218078-9
    ISSN 1542-0086 ; 0006-3495
    ISSN (online) 1542-0086
    ISSN 0006-3495
    DOI 10.1016/j.bpj.2020.03.020
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  8. Article ; Online: VoroCNN: deep convolutional neural network built on 3D Voronoi tessellation of protein structures.

    Igashov, Ilia / Olechnovič, Kliment / Kadukova, Maria / Venclovas, Česlovas / Grudinin, Sergei

    Bioinformatics (Oxford, England)

    2021  Volume 37, Issue 16, Page(s) 2332–2339

    Abstract: Motivation: Effective use of evolutionary information has recently led to tremendous progress in computational prediction of three-dimensional (3D) structures of proteins and their complexes. Despite the progress, the accuracy of predicted structures ... ...

    Abstract Motivation: Effective use of evolutionary information has recently led to tremendous progress in computational prediction of three-dimensional (3D) structures of proteins and their complexes. Despite the progress, the accuracy of predicted structures tends to vary considerably from case to case. Since the utility of computational models depends on their accuracy, reliable estimates of deviation between predicted and native structures are of utmost importance.
    Results: For the first time, we present a deep convolutional neural network (CNN) constructed on a Voronoi tessellation of 3D molecular structures. Despite the irregular data domain, our data representation allows us to efficiently introduce both convolution and pooling operations and train the network in an end-to-end fashion without precomputed descriptors. The resultant model, VoroCNN, predicts local qualities of 3D protein folds. The prediction results are competitive to state of the art and superior to the previous 3D CNN architectures built for the same task. We also discuss practical applications of VoroCNN, for example, in recognition of protein binding interfaces.
    Availability and implementation: The model, data and evaluation tests are available at https://team.inria.fr/nano-d/software/vorocnn/.
    Supplementary information: Supplementary data are available at Bioinformatics online.
    Language English
    Publishing date 2021-02-10
    Publishing country England
    Document type Journal Article
    ZDB-ID 1422668-6
    ISSN 1367-4811 ; 1367-4803
    ISSN (online) 1367-4811
    ISSN 1367-4803
    DOI 10.1093/bioinformatics/btab118
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  9. Article ; Online: Docking-based long timescale simulation of cell-size protein systems at atomic resolution.

    Vakser, Ilya A / Grudinin, Sergei / Jenkins, Nathan W / Kundrotas, Petras J / Deeds, Eric J

    Proceedings of the National Academy of Sciences of the United States of America

    2022  Volume 119, Issue 41, Page(s) e2210249119

    Abstract: Computational methodologies are increasingly addressing modeling of the whole cell at the molecular level. Proteins and their interactions are the key component of cellular processes. Techniques for modeling protein interactions, thus far, have included ... ...

    Abstract Computational methodologies are increasingly addressing modeling of the whole cell at the molecular level. Proteins and their interactions are the key component of cellular processes. Techniques for modeling protein interactions, thus far, have included protein docking and molecular simulation. The latter approaches account for the dynamics of the interactions but are relatively slow, if carried out at all-atom resolution, or are significantly coarse grained. Protein docking algorithms are far more efficient in sampling spatial coordinates. However, they do not account for the kinetics of the association (i.e., they do not involve the time coordinate). Our proof-of-concept study bridges the two modeling approaches, developing an approach that can reach unprecedented simulation timescales at all-atom resolution. The global intermolecular energy landscape of a large system of proteins was mapped by the pairwise fast Fourier transform docking and sampled in space and time by Monte Carlo simulations. The simulation protocol was parametrized on existing data and validated on a number of observations from experiments and molecular dynamics simulations. The simulation protocol performed consistently across very different systems of proteins at different protein concentrations. It recapitulated data on the previously observed protein diffusion rates and aggregation. The speed of calculation allows reaching second-long trajectories of protein systems that approach the size of the cells, at atomic resolution.
    MeSH term(s) Algorithms ; Biophysical Phenomena ; Kinetics ; Molecular Dynamics Simulation ; Monte Carlo Method ; Proteins
    Chemical Substances Proteins
    Language English
    Publishing date 2022-10-03
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't ; Research Support, N.I.H., Extramural ; Research Support, U.S. Gov't, Non-P.H.S.
    ZDB-ID 209104-5
    ISSN 1091-6490 ; 0027-8424
    ISSN (online) 1091-6490
    ISSN 0027-8424
    DOI 10.1073/pnas.2210249119
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  10. Article ; Online: NOLB: Nonlinear Rigid Block Normal-Mode Analysis Method.

    Hoffmann, Alexandre / Grudinin, Sergei

    Journal of chemical theory and computation

    2017  Volume 13, Issue 5, Page(s) 2123–2134

    Abstract: We present a new conceptually simple and computationally efficient method for nonlinear normal-mode analysis called NOLB. It relies on the rotations-translations of blocks (RTB) theoretical basis developed by Y.-H. Sanejouand and colleagues [ Durand et ... ...

    Abstract We present a new conceptually simple and computationally efficient method for nonlinear normal-mode analysis called NOLB. It relies on the rotations-translations of blocks (RTB) theoretical basis developed by Y.-H. Sanejouand and colleagues [ Durand et al. Biopolymers 1994 , 34 , 759 - 771 . Tama et al. Proteins: Struct., Funct., Bioinf . 2000 , 41 , 1 - 7 ]. We demonstrate how to physically interpret the eigenvalues computed in the RTB basis in terms of angular and linear velocities applied to the rigid blocks and how to construct a nonlinear extrapolation of motion out of these velocities. The key observation of our method is that the angular velocity of a rigid block can be interpreted as the result of an implicit force, such that the motion of the rigid block can be considered as a pure rotation about a certain center. We demonstrate the motions produced with the NOLB method on three different molecular systems and show that some of the lowest frequency normal modes correspond to the biologically relevant motions. For example, NOLB detects the spiral sliding motion of the TALE protein, which is capable of rapid diffusion along its target DNA. Overall, our method produces better structures compared to the standard approach, especially at large deformation amplitudes, as we demonstrate by visual inspection, energy, and topology analyses and also by the MolProbity service validation. Finally, our method is scalable and can be applied to very large molecular systems, such as ribosomes. Standalone executables of the NOLB normal-mode analysis method are available at https://team.inria.fr/nano-d/software/nolb-normal-modes/ . A graphical user interface created for the SAMSON software platform will be made available at https://www.samson-connect.net .
    Language English
    Publishing date 2017-05-09
    Publishing country United States
    Document type Journal Article
    ISSN 1549-9626
    ISSN (online) 1549-9626
    DOI 10.1021/acs.jctc.7b00197
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

To top