LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Your last searches

  1. AU=Xia Kelin
  2. AU="Ria Armunanto"
  3. AU="Hao, Jingjie"
  4. AU="Manhard, John"
  5. AU="C Gräf"
  6. AU="Fisher, Aron B."

Search results

Result 1 - 10 of total 126

Search options

  1. Book: Protein cages

    Ueno, Takafumi / Lim, Sierin / Xia, Kelin

    design, structure, and applications

    (Methods in molecular biology ; 2671 ; Springer protocols)

    2023  

    Author's details edited by Takafumi Ueno, Sierin Lim, Kelin Xia
    Series title Methods in molecular biology ; 2671
    Springer protocols
    Collection
    Keywords Proteins
    Subject code 572.6
    Language English
    Size xii, 422 Seiten, Illustrationen, 26 cm
    Publisher Humana Press
    Publishing place New York, NY
    Publishing country United States
    Document type Book
    HBZ-ID HT030057637
    ISBN 978-1-0716-3221-5 ; 9781071632222 ; 1-0716-3221-3 ; 1071632221
    Database Catalogue ZB MED Medicine, Health

    More links

    Kategorien

  2. Article ; Online: Integration of persistent Laplacian and pre-trained transformer for protein solubility changes upon mutation.

    Wee, JunJie / Chen, Jiahui / Xia, Kelin / Wei, Guo-Wei

    Computers in biology and medicine

    2024  Volume 169, Page(s) 107918

    Abstract: Protein mutations can significantly influence protein solubility, which results in altered protein functions and leads to various diseases. Despite tremendous effort, machine learning prediction of protein solubility changes upon mutation remains a ... ...

    Abstract Protein mutations can significantly influence protein solubility, which results in altered protein functions and leads to various diseases. Despite tremendous effort, machine learning prediction of protein solubility changes upon mutation remains a challenging task as indicated by the poor scores of normalized Correct Prediction Ratio (CPR). Part of the challenge stems from the fact that there is no three-dimensional (3D) structures for the wild-type and mutant proteins. This work integrates persistent Laplacians and pre-trained Transformer for the task. The Transformer, pretrained with hundreds of millions of protein sequences, embeds wild-type and mutant sequences, while persistent Laplacians track the topological invariant change and homotopic shape evolution induced by mutations in 3D protein structures, which are rendered from AlphaFold2. The resulting machine learning model was trained on an extensive data set labeled with three solubility types. Our model outperforms all existing predictive methods and improves the state-of-the-art up to 15%.
    MeSH term(s) Solubility ; Amino Acid Sequence ; Machine Learning ; Mutation
    Language English
    Publishing date 2024-01-03
    Publishing country United States
    Document type Journal Article
    ZDB-ID 127557-4
    ISSN 1879-0534 ; 0010-4825
    ISSN (online) 1879-0534
    ISSN 0010-4825
    DOI 10.1016/j.compbiomed.2024.107918
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  3. Article ; Online: Coarse-Grained Models for Vault Normal Model Analysis.

    Anand, D Vijay / Wei, Ronald Koh Joon / Xia, Kelin

    Methods in molecular biology (Clifton, N.J.)

    2023  Volume 2671, Page(s) 307–318

    Abstract: Recent experiments have shown that the molecular complex of vault has large conformational changes at its shoulder and cap regions in solution. From the comparison of two configuration structures, it has been found that the shoulder region can twist and ... ...

    Abstract Recent experiments have shown that the molecular complex of vault has large conformational changes at its shoulder and cap regions in solution. From the comparison of two configuration structures, it has been found that the shoulder region can twist and move outward, while the cap region will rotate and push upward correspondingly. To further understand these experimental results, in this paper, we study the vault dynamics for the first time. Since vault has an extremely large-sized structure with around 63,336 C
    MeSH term(s) Anisotropy ; Rotation
    Language English
    Publishing date 2023-06-12
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ISSN 1940-6029
    ISSN (online) 1940-6029
    DOI 10.1007/978-1-0716-3222-2_17
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Article ; Online: Sequence-based multiscale modeling for high-throughput chromosome conformation capture (Hi-C) data analysis.

    Kelin Xia

    PLoS ONE, Vol 13, Iss 2, p e

    2018  Volume 0191899

    Abstract: In this paper, we introduce sequence-based multiscale modeling for biomolecular data analysis. We employ spectral clustering method in our modeling and reveal the difference between sequence-based global scale clustering and local scale clustering. ... ...

    Abstract In this paper, we introduce sequence-based multiscale modeling for biomolecular data analysis. We employ spectral clustering method in our modeling and reveal the difference between sequence-based global scale clustering and local scale clustering. Essentially, two types of distances, i.e., Euclidean (or spatial) distance and genomic (or sequential) distance, can be used in data clustering. Clusters from sequence-based global scale models optimize spatial distances, meaning spatially adjacent loci are more likely to be assigned into the same cluster. Sequence-based local scale models, on the other hand, result in clusters that optimize genomic distances. That is to say, in these models, sequentially adjoining loci tend to be cluster together. We propose two sequence-based multiscale models (SeqMMs) for the study of chromosome hierarchical structures, including genomic compartments and topological associated domains (TADs). We find that genomic compartments are determined only by global scale information in the Hi-C data. The removal of all the local interactions within a band region as large as 10 Mb in genomic distance has almost no significant influence on the final compartment results. Further, in TAD analysis, we find that when the sequential scale is small, a tiny variation of diagonal band region in a contact map will result in a great change in the predicted TAD boundaries. When the scale value is larger than a threshold value, the TAD boundaries become very consistent. This threshold value is highly related to TAD sizes. By the comparison of our results with those previously obtained using a spectral clustering model, we find that our method is more robust and reliable. Finally, we demonstrate that almost all TAD boundaries from both clustering methods are local minimum of a TAD summation function.
    Keywords Medicine ; R ; Science ; Q
    Subject code 006
    Language English
    Publishing date 2018-01-01T00:00:00Z
    Publisher Public Library of Science (PLoS)
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: Persistent Dirac for molecular representation

    Junjie Wee / Ginestra Bianconi / Kelin Xia

    Scientific Reports, Vol 13, Iss 1, Pp 1-

    2023  Volume 19

    Abstract: Abstract Molecular representations are of fundamental importance for the modeling and analysing molecular systems. The successes in drug design and materials discovery have been greatly contributed by molecular representation models. In this paper, we ... ...

    Abstract Abstract Molecular representations are of fundamental importance for the modeling and analysing molecular systems. The successes in drug design and materials discovery have been greatly contributed by molecular representation models. In this paper, we present a computational framework for molecular representation that is mathematically rigorous and based on the persistent Dirac operator. The properties of the discrete weighted and unweighted Dirac matrix are systematically discussed, and the biological meanings of both homological and non-homological eigenvectors are studied. We also evaluate the impact of various weighting schemes on the weighted Dirac matrix. Additionally, a set of physical persistent attributes that characterize the persistence and variation of spectrum properties of Dirac matrices during a filtration process is proposed to be molecular fingerprints. Our persistent attributes are used to classify molecular configurations of nine different types of organic-inorganic halide perovskites. The combination of persistent attributes with gradient boosting tree model has achieved great success in molecular solvation free energy prediction. The results show that our model is effective in characterizing the molecular structures, demonstrating the power of our molecular representation and featurization approach.
    Keywords Medicine ; R ; Science ; Q
    Subject code 541
    Language English
    Publishing date 2023-07-01T00:00:00Z
    Publisher Nature Portfolio
    Document type Article ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  6. Article ; Online: Persistent spectral based ensemble learning (PerSpect-EL) for protein-protein binding affinity prediction.

    Wee, JunJie / Xia, Kelin

    Briefings in bioinformatics

    2022  Volume 23, Issue 2

    Abstract: Protein-protein interactions (PPIs) play a significant role in nearly all cellular and biological activities. Data-driven machine learning models have demonstrated great power in PPIs. However, the design of efficient molecular featurization poses a ... ...

    Abstract Protein-protein interactions (PPIs) play a significant role in nearly all cellular and biological activities. Data-driven machine learning models have demonstrated great power in PPIs. However, the design of efficient molecular featurization poses a great challenge for all learning models for PPIs. Here, we propose persistent spectral (PerSpect) based PPI representation and featurization, and PerSpect-based ensemble learning (PerSpect-EL) models for PPI binding affinity prediction, for the first time. In our model, a sequence of Hodge (or combinatorial) Laplacian (HL) matrices at various different scales are generated from a specially designed filtration process. PerSpect attributes, which are statistical and combinatorial properties of spectrum information from these HL matrices, are used as features for PPI characterization. Each PerSpect attribute is input into a 1D convolutional neural network (CNN), and these CNN networks are stacked together in our PerSpect-based ensemble learning models. We systematically test our model on the two most commonly used datasets, i.e. SKEMPI and AB-Bind. It has been found that our model can achieve state-of-the-art results and outperform all existing models to the best of our knowledge.
    MeSH term(s) Machine Learning ; Neural Networks, Computer ; Protein Binding
    Language English
    Publishing date 2022-02-21
    Publishing country England
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2068142-2
    ISSN 1477-4054 ; 1467-5463
    ISSN (online) 1477-4054
    ISSN 1467-5463
    DOI 10.1093/bib/bbac024
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  7. Article ; Online: Persistent Dirac for molecular representation.

    Wee, Junjie / Bianconi, Ginestra / Xia, Kelin

    Scientific reports

    2023  Volume 13, Issue 1, Page(s) 11183

    Abstract: Molecular representations are of fundamental importance for the modeling and analysing molecular systems. The successes in drug design and materials discovery have been greatly contributed by molecular representation models. In this paper, we present a ... ...

    Abstract Molecular representations are of fundamental importance for the modeling and analysing molecular systems. The successes in drug design and materials discovery have been greatly contributed by molecular representation models. In this paper, we present a computational framework for molecular representation that is mathematically rigorous and based on the persistent Dirac operator. The properties of the discrete weighted and unweighted Dirac matrix are systematically discussed, and the biological meanings of both homological and non-homological eigenvectors are studied. We also evaluate the impact of various weighting schemes on the weighted Dirac matrix. Additionally, a set of physical persistent attributes that characterize the persistence and variation of spectrum properties of Dirac matrices during a filtration process is proposed to be molecular fingerprints. Our persistent attributes are used to classify molecular configurations of nine different types of organic-inorganic halide perovskites. The combination of persistent attributes with gradient boosting tree model has achieved great success in molecular solvation free energy prediction. The results show that our model is effective in characterizing the molecular structures, demonstrating the power of our molecular representation and featurization approach.
    Language English
    Publishing date 2023-07-11
    Publishing country England
    Document type Journal Article
    ZDB-ID 2615211-3
    ISSN 2045-2322 ; 2045-2322
    ISSN (online) 2045-2322
    ISSN 2045-2322
    DOI 10.1038/s41598-023-37853-z
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  8. Article ; Online: Laplacian Spectra of Persistent Structures in Taiwan, Singapore, and US Stock Markets.

    Yen, Peter Tsung-Wen / Xia, Kelin / Cheong, Siew Ann

    Entropy (Basel, Switzerland)

    2023  Volume 25, Issue 6

    Abstract: An important challenge in the study of complex systems is to identify appropriate effective variables at different times. In this paper, we explain why structures that are persistent with respect to changes in length and time scales are proper effective ... ...

    Abstract An important challenge in the study of complex systems is to identify appropriate effective variables at different times. In this paper, we explain why structures that are persistent with respect to changes in length and time scales are proper effective variables, and illustrate how persistent structures can be identified from the spectra and Fiedler vector of the graph Laplacian at different stages of the topological data analysis (TDA) filtration process for twelve toy models. We then investigated four market crashes, three of which were related to the COVID-19 pandemic. In all four crashes, a persistent gap opens up in the Laplacian spectra when we go from a normal phase to a crash phase. In the crash phase, the persistent structure associated with the gap remains distinguishable up to a characteristic length scale ϵ* where the first non-zero Laplacian eigenvalue changes most rapidly. Before ϵ*, the distribution of components in the Fiedler vector is predominantly bi-modal, and this distribution becomes uni-modal after ϵ*. Our findings hint at the possibility of understanding market crashs in terms of both continuous and discontinuous changes. Beyond the graph Laplacian, we can also employ Hodge Laplacians of higher order for future research.
    Language English
    Publishing date 2023-05-25
    Publishing country Switzerland
    Document type Journal Article
    ZDB-ID 2014734-X
    ISSN 1099-4300 ; 1099-4300
    ISSN (online) 1099-4300
    ISSN 1099-4300
    DOI 10.3390/e25060846
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  9. Article ; Online: Molecular geometric deep learning.

    Shen, Cong / Luo, Jiawei / Xia, Kelin

    Cell reports methods

    2023  Volume 3, Issue 11, Page(s) 100621

    Abstract: Molecular representation learning plays an important role in molecular property prediction. Existing molecular property prediction models rely on the de facto standard of covalent-bond-based molecular graphs for representing molecular topology at the ... ...

    Abstract Molecular representation learning plays an important role in molecular property prediction. Existing molecular property prediction models rely on the de facto standard of covalent-bond-based molecular graphs for representing molecular topology at the atomic level and totally ignore the non-covalent interactions within the molecule. In this study, we propose a molecular geometric deep learning model to predict the properties of molecules that aims to comprehensively consider the information of covalent and non-covalent interactions of molecules. The essential idea is to incorporate a more general molecular representation into geometric deep learning (GDL) models. We systematically test molecular GDL (Mol-GDL) on fourteen commonly used benchmark datasets. The results show that Mol-GDL can achieve a better performance than state-of-the-art (SOTA) methods. Extensive tests have demonstrated the important role of non-covalent interactions in molecular property prediction and the effectiveness of Mol-GDL models.
    MeSH term(s) Deep Learning ; Benchmarking ; Models, Molecular
    Language English
    Publishing date 2023-10-23
    Publishing country United States
    Document type Journal Article
    ISSN 2667-2375
    ISSN (online) 2667-2375
    DOI 10.1016/j.crmeth.2023.100621
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  10. Article ; Online: Persistent Homology for RNA Data Analysis.

    Xia, Kelin / Liu, Xiang / Wee, JunJie

    Methods in molecular biology (Clifton, N.J.)

    2023  Volume 2627, Page(s) 211–229

    Abstract: Molecular representations are of great importance for machine learning models in RNA data analysis. Essentially, efficient molecular descriptors or fingerprints that characterize the intrinsic structural and interactional information of RNAs can ... ...

    Abstract Molecular representations are of great importance for machine learning models in RNA data analysis. Essentially, efficient molecular descriptors or fingerprints that characterize the intrinsic structural and interactional information of RNAs can significantly boost the performance of all learning modeling. In this paper, we introduce two persistent models, including persistent homology and persistent spectral, for RNA structure and interaction representations and their applications in RNA data analysis. Different from traditional geometric and graph representations, persistent homology is built on simplicial complex, which is a generalization of graph models to higher-dimensional situations. Hypergraph is a further generalization of simplicial complexes and hypergraph-based embedded persistent homology has been proposed recently. Moreover, persistent spectral models, which combine filtration process with spectral models, including spectral graph, spectral simplicial complex, and spectral hypergraph, are proposed for molecular representation. The persistent attributes for RNAs can be obtained from these two persistent models and further combined with machine learning models for RNA structure, flexibility, dynamics, and function analysis.
    MeSH term(s) RNA/genetics ; Data Analysis
    Chemical Substances RNA (63231-63-0)
    Language English
    Publishing date 2023-03-24
    Publishing country United States
    Document type Journal Article
    ISSN 1940-6029
    ISSN (online) 1940-6029
    DOI 10.1007/978-1-0716-2974-1_12
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

To top