Article: Improved Large-Scale Homology Search by Two-Step Seed Search Using Multiple Reduced Amino Acid Alphabets
Genes. 2021 Sept. 21, v. 12, no. 9
2021
Abstract: Metagenomic analysis, a technique used to comprehensively analyze microorganisms present in the environment, requires performing high-precision homology searches on large amounts of sequencing data, the size of which has increased dramatically with the ... ...
Abstract | Metagenomic analysis, a technique used to comprehensively analyze microorganisms present in the environment, requires performing high-precision homology searches on large amounts of sequencing data, the size of which has increased dramatically with the development of next-generation sequencing. NCBI BLAST is the most widely used software for performing homology searches, but its speed is insufficient for the throughput of current DNA sequencers. In this paper, we propose a new, high-performance homology search algorithm that employs a two-step seed search strategy using multiple reduced amino acid alphabets to identify highly similar subsequences. Additionally, we evaluated the validity of the proposed method against several existing tools. Our method was faster than any other existing program for ≤120,000 queries, while DIAMOND, an existing tool, was the fastest method for >120,000 queries. |
---|---|
Keywords | DNA ; algorithms ; amino acids ; computer software ; metagenomics |
Language | English |
Dates of publication | 2021-0921 |
Publishing place | Multidisciplinary Digital Publishing Institute |
Document type | Article |
ZDB-ID | 2527218-4 |
ISSN | 2073-4425 |
ISSN | 2073-4425 |
DOI | 10.3390/genes12091455 |
Database | NAL-Catalogue (AGRICOLA) |
More links
Kategorien
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.