LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 1 of total 1

Search options

Article ; Online: Predicting gene expression levels from DNA sequences and post-transcriptional information with transformers.

Pipoli, Vittorio / Cappelli, Mattia / Palladini, Alessandro / Peluso, Carlo / Lovino, Marta / Ficarra, Elisa

Computer methods and programs in biomedicine

2022  Volume 225, Page(s) 107035

Abstract: Background and objectives: In the latest years, the prediction of gene expression levels has been crucial due to its potential applications in the clinics. In this context, Xpresso and others methods based on Convolutional Neural Networks and ... ...

Abstract Background and objectives: In the latest years, the prediction of gene expression levels has been crucial due to its potential applications in the clinics. In this context, Xpresso and others methods based on Convolutional Neural Networks and Transformers were firstly proposed to this aim. However, all these methods embed data with a standard one-hot encoding algorithm, resulting in impressively sparse matrices. In addition, post-transcriptional regulation processes, which are of uttermost importance in the gene expression process, are not considered in the model.
Methods: This paper presents Transformer DeepLncLoc, a novel method to predict the abundance of the mRNA (i.e., gene expression levels) by processing gene promoter sequences, managing the problem as a regression task. The model exploits a transformer-based architecture, introducing the DeepLncLoc method to perform the data embedding. Since DeepLncloc is based on word2vec algorithm, it avoids the sparse matrices problem.
Results: Post-transcriptional information related to mRNA stability and transcription factors is included in the model, leading to significantly improved performances compared to the state-of-the-art works. Transformer DeepLncLoc reached 0.76 of R
Conclusion: The Multi-Headed Attention mechanisms which characterizes the transformer methodology is suitable for modeling the interactions between DNA's locations, overcoming the recurrent models. Finally, the integration of the transcription factors data in the pipeline leads to impressive gains in predictive power.
MeSH term(s) Base Sequence ; DNA/genetics ; Gene Expression ; RNA, Messenger/genetics ; Transcription Factors/genetics
Chemical Substances RNA, Messenger ; Transcription Factors ; DNA (9007-49-2)
Language English
Publishing date 2022-08-07
Publishing country Ireland
Document type Journal Article
ZDB-ID 632564-6
ISSN 1872-7565 ; 0169-2607
ISSN (online) 1872-7565
ISSN 0169-2607
DOI 10.1016/j.cmpb.2022.107035
Shelf mark
Zs.B 521: Show issues Location:
Je nach Verfügbarkeit (siehe Angabe bei Bestand)
bis Jg. 2021: Bestellungen von Artikeln über das Online-Bestellformular
ab Jg. 2022: Lesesaal (EG)
Database MEDical Literature Analysis and Retrieval System OnLINE

More links

Kategorien

To top