LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 5 of total 5

Search options

  1. Article: DETIRE: a hybrid deep learning model for identifying viral sequences from metagenomes.

    Miao, Yan / Bian, Jilong / Dong, Guanghui / Dai, Tianhong

    Frontiers in microbiology

    2023  Volume 14, Page(s) 1169791

    Abstract: A metagenome contains all DNA sequences from an environmental sample, including viruses, bacteria, archaea, and eukaryotes. Since viruses are of huge abundance and have caused vast mortality and morbidity to human society in history as a type of major ... ...

    Abstract A metagenome contains all DNA sequences from an environmental sample, including viruses, bacteria, archaea, and eukaryotes. Since viruses are of huge abundance and have caused vast mortality and morbidity to human society in history as a type of major pathogens, detecting viruses from metagenomes plays a crucial role in analyzing the viral component of samples and is the very first step for clinical diagnosis. However, detecting viral fragments directly from the metagenomes is still a tough issue because of the existence of a huge number of short sequences. In this study a hybrid Deep lEarning model for idenTifying vIral sequences fRom mEtagenomes (DETIRE) is proposed to solve the problem. First, the graph-based nucleotide sequence embedding strategy is utilized to enrich the expression of DNA sequences by training an embedding matrix. Then, the spatial and sequential features are extracted by trained CNN and BiLSTM networks, respectively, to enrich the features of short sequences. Finally, the two sets of features are weighted combined for the final decision. Trained by 220,000 sequences of 500 bp subsampled from the Virus and Host RefSeq genomes, DETIRE identifies more short viral sequences (<1,000 bp) than the three latest methods, such as DeepVirFinder, PPR-Meta, and CHEER. DETIRE is freely available at Github (https://github.com/crazyinter/DETIRE).
    Language English
    Publishing date 2023-06-16
    Publishing country Switzerland
    Document type Journal Article
    ZDB-ID 2587354-4
    ISSN 1664-302X
    ISSN 1664-302X
    DOI 10.3389/fmicb.2023.1169791
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  2. Article ; Online: ACP-ML: A sequence-based method for anticancer peptide prediction.

    Bian, Jilong / Liu, Xuan / Dong, Guanghui / Hou, Chang / Huang, Shan / Zhang, Dandan

    Computers in biology and medicine

    2024  Volume 170, Page(s) 108063

    Abstract: Cancer is a serious malignant tumor and is difficult to cure. Chemotherapy, as a primary treatment for cancer, causes significant harm to normal cells in the body and is often accompanied by serious side effects. Recently, anti-cancer peptides (ACPs) as ... ...

    Abstract Cancer is a serious malignant tumor and is difficult to cure. Chemotherapy, as a primary treatment for cancer, causes significant harm to normal cells in the body and is often accompanied by serious side effects. Recently, anti-cancer peptides (ACPs) as a type of protein for treating cancers dominated research into the development of new anti-tumor drugs because of their ability to specifically target and destroy cancer cells. The screening of proteins with cancer-inhibiting properties from a large pool of proteins is key to the development of anti-tumor drugs. However, it is expensive and inefficient to accurately identify protein functions only through biological experiments due to their complex structure. Therefore, we propose a new prediction model ACP-ML to effectively predict ACPs. In terms of feature extraction, DPC, PseAAC, CTDC, CTDT and CS-Pse-PSSM features were used and the most optimal feature set was selected by comparing combinations of these features. Then, a two-step feature selection process using MRMD and RFE algorithms was performed to determine the most crucial features from the most optimal feature set for identifying ACPs. Furthermore, we assessed the classification accuracy of single learning models and different strategies-based ensemble models through ten-fold cross-validation. Ultimately, a voting-based ensemble learning method is developed to predict ACPs. To validate its effectiveness, two independent test sets were used to perform tests, achieving accuracy of 90.891 % and 92.578 % respectively. Compared with existing anticancer peptide prediction algorithms, the proposed feature processing method is more effective, and the proposed ensemble model ACP-ML exhibits stronger generalization capability and higher accuracy.
    MeSH term(s) Humans ; Computational Biology/methods ; Peptides/chemistry ; Proteins ; Algorithms ; Neoplasms/drug therapy ; Antineoplastic Agents/pharmacology ; Antineoplastic Agents/therapeutic use
    Chemical Substances Peptides ; Proteins ; Antineoplastic Agents
    Language English
    Publishing date 2024-01-28
    Publishing country United States
    Document type Journal Article
    ZDB-ID 127557-4
    ISSN 1879-0534 ; 0010-4825
    ISSN (online) 1879-0534
    ISSN 0010-4825
    DOI 10.1016/j.compbiomed.2024.108063
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  3. Article ; Online: MCANet: shared-weight-based MultiheadCrossAttention network for drug-target interaction prediction.

    Bian, Jilong / Zhang, Xi / Zhang, Xiying / Xu, Dali / Wang, Guohua

    Briefings in bioinformatics

    2023  Volume 24, Issue 2

    Abstract: Accurate and effective drug-target interaction (DTI) prediction can greatly shorten the drug development lifecycle and reduce the cost of drug development. In the deep-learning-based paradigm for predicting DTI, robust drug and protein feature ... ...

    Abstract Accurate and effective drug-target interaction (DTI) prediction can greatly shorten the drug development lifecycle and reduce the cost of drug development. In the deep-learning-based paradigm for predicting DTI, robust drug and protein feature representations and their interaction features play a key role in improving the accuracy of DTI prediction. Additionally, the class imbalance problem and the overfitting problem in the drug-target dataset can also affect the prediction accuracy, and reducing the consumption of computational resources and speeding up the training process are also critical considerations. In this paper, we propose shared-weight-based MultiheadCrossAttention, a precise and concise attention mechanism that can establish the association between target and drug, making our models more accurate and faster. Then, we use the cross-attention mechanism to construct two models: MCANet and MCANet-B. In MCANet, the cross-attention mechanism is used to extract the interaction features between drugs and proteins for improving the feature representation ability of drugs and proteins, and the PolyLoss loss function is applied to alleviate the overfitting problem and the class imbalance problem in the drug-target dataset. In MCANet-B, the robustness of the model is improved by combining multiple MCANet models and prediction accuracy further increases. We train and evaluate our proposed methods on six public drug-target datasets and achieve state-of-the-art results. In comparison with other baselines, MCANet saves considerable computational resources while maintaining accuracy in the leading position; however, MCANet-B greatly improves prediction accuracy by combining multiple models while maintaining a balance between computational resource consumption and prediction accuracy.
    MeSH term(s) Drug Discovery/methods ; Drug Development ; Proteins/metabolism ; Drug Delivery Systems ; Protein Domains
    Chemical Substances Proteins
    Language English
    Publishing date 2023-03-09
    Publishing country England
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2068142-2
    ISSN 1477-4054 ; 1467-5463
    ISSN (online) 1477-4054
    ISSN 1467-5463
    DOI 10.1093/bib/bbad082
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Article ; Online: Unsupervised construction of gene regulatory network based on single-cell multi-omics data of colorectal cancer.

    Cui, Lingyu / Li, Hongfei / Bian, Jilong / Wang, Guohua / Liang, Yingjian

    Briefings in bioinformatics

    2023  Volume 24, Issue 2

    Abstract: Identifying gene regulatory networks (GRNs) at the resolution of single cells has long been a great challenge, and the advent of single-cell multi-omics data provides unprecedented opportunities to construct GRNs. Here, we propose a novel strategy to ... ...

    Abstract Identifying gene regulatory networks (GRNs) at the resolution of single cells has long been a great challenge, and the advent of single-cell multi-omics data provides unprecedented opportunities to construct GRNs. Here, we propose a novel strategy to integrate omics datasets of single-cell ribonucleic acid sequencing and single-cell Assay for Transposase-Accessible Chromatin using sequencing, and using an unsupervised learning neural network to divide the samples with high copy number variation scores, which are used to infer the GRN in each gene block. Accuracy validation of proposed strategy shows that approximately 80% of transcription factors are directly associated with cancer, colorectal cancer, malignancy and disease by TRRUST; and most transcription factors are prone to produce multiple transcript variants and lead to tumorigenesis by RegNetwork database, respectively. The source code access are available at: https://github.com/Cuily-v/Colorectal_cancer.
    MeSH term(s) Humans ; Gene Regulatory Networks ; Multiomics ; DNA Copy Number Variations ; Algorithms ; Transcription Factors/genetics ; Colorectal Neoplasms/genetics
    Chemical Substances Transcription Factors
    Language English
    Publishing date 2023-03-27
    Publishing country England
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 2068142-2
    ISSN 1477-4054 ; 1467-5463
    ISSN (online) 1477-4054
    ISSN 1467-5463
    DOI 10.1093/bib/bbad011
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  5. Article ; Online: Feature selection combined with top-down and bottom-up strategies for survival analysis: A case of prognostic prediction in glioblastoma.

    Liu, Yanan / Zhao, Xudong / Bian, Jilong / Wang, Guohua

    Computers in biology and medicine

    2022  Volume 153, Page(s) 106486

    Abstract: Over the last decades, molecular signatures have attracted extensive attention in cancer research. However, most of the reported biomarkers show a weak distinguishing ability in predicting the survival risks of patients. Actually, univariate analysis is ... ...

    Abstract Over the last decades, molecular signatures have attracted extensive attention in cancer research. However, most of the reported biomarkers show a weak distinguishing ability in predicting the survival risks of patients. Actually, univariate analysis is generally considered in regression analysis, which makes the existing statistical methods ineffective. Furthermore, there is too much human involvement in the ways of classifying patients with high and low risk. Last but not least, the participation of therapy after conservative surgery also makes the survival analysis more complex. In order to solve these problems, we propose a solid method of feature selection which combines top-down and bottom-up strategies. The top-down strategy is to randomly extract some genes each time and select candidate genes through cumulative voting. The bottom-up strategy is to fully enumerate the selected genes and to use a clustering algorithm to classify samples. We analyzed glioblastoma data from the Cancer Genome Atlas (TCGA) and got candidate signatures. The results of simulation data, as well as an independent test set the Chinese Glioma Genome Atlas (CGGA), verified the reliability of the method and validity of the selected features.
    MeSH term(s) Humans ; Glioblastoma/genetics ; Gene Expression Profiling/methods ; Prognosis ; Reproducibility of Results ; Survival Analysis
    Language English
    Publishing date 2022-12-29
    Publishing country United States
    Document type Journal Article ; Research Support, Non-U.S. Gov't
    ZDB-ID 127557-4
    ISSN 1879-0534 ; 0010-4825
    ISSN (online) 1879-0534
    ISSN 0010-4825
    DOI 10.1016/j.compbiomed.2022.106486
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

To top