Article ; Online: miWords: transformer-based composite deep learning for highly accurate discovery of pre-miRNA regions across plant genomes.
2023 Volume 24, Issue 2
Abstract: Discovering pre-microRNAs (miRNAs) is the core of miRNA discovery. Using traditional sequence/structural features, many tools have been published to discover miRNAs. However, in practical applications like genomic annotations, their actual performance ... ...
Abstract | Discovering pre-microRNAs (miRNAs) is the core of miRNA discovery. Using traditional sequence/structural features, many tools have been published to discover miRNAs. However, in practical applications like genomic annotations, their actual performance has been very low. This becomes more grave in plants where unlike animals pre-miRNAs are much more complex and difficult to identify. A huge gap exists between animals and plants for the available software for miRNA discovery and species-specific miRNA information. Here, we present miWords, a composite deep learning system of transformers and convolutional neural networks which sees genome as a pool of sentences made of words with specific occurrence preferences and contexts, to accurately identify pre-miRNA regions across plant genomes. A comprehensive benchmarking was done involving >10 software representing different genre and many experimentally validated datasets. miWords emerged as the best one while breaching accuracy of 98% and performance lead of ~10%. miWords was also evaluated across Arabidopsis genome where also it outperformed the compared tools. As a demonstration, miWords was run across the tea genome, reporting 803 pre-miRNA regions, all validated by small RNA-seq reads from multiple samples, and most of them were functionally supported by the degradome sequencing data. miWords is freely available as stand-alone source codes at https://scbb.ihbt.res.in/miWords/index.php. |
---|---|
MeSH term(s) | Animals ; MicroRNAs/genetics ; MicroRNAs/chemistry ; Deep Learning ; Software ; Genomics ; Genome, Plant ; Arabidopsis/genetics |
Chemical Substances | MicroRNAs |
Language | English |
Publishing date | 2023-03-04 |
Publishing country | England |
Document type | Journal Article ; Research Support, Non-U.S. Gov't |
ZDB-ID | 2068142-2 |
ISSN | 1477-4054 ; 1467-5463 |
ISSN (online) | 1477-4054 |
ISSN | 1467-5463 |
DOI | 10.1093/bib/bbad088 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
More links
Kategorien
In stock of ZB MED Cologne/Königswinter
Zs.A 6262: Show issues | Location: Je nach Verfügbarkeit (siehe Angabe bei Bestand) bis Jg. 2021: Bestellungen von Artikeln über das Online-Bestellformular ab Jg. 2022: Lesesaal (EG) |
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.