Article ; Online: pyPGCF: A Python Software for Phylogenomic Analysis, Species Demarcation, Identification of Core, and Fingerprint Proteins of Bacterial Genomes That Are Important for Plants.
Methods in molecular biology (Clifton, N.J.)
2024 Volume 2788, Page(s) 139–155
Abstract: This computational protocol describes how to use pyPGCF, a python software package that runs in the linux environment, in order to analyze bacterial genomes and perform: (i) phylogenomic analysis, (ii) species demarcation, (iii) identification of the ... ...
Abstract | This computational protocol describes how to use pyPGCF, a python software package that runs in the linux environment, in order to analyze bacterial genomes and perform: (i) phylogenomic analysis, (ii) species demarcation, (iii) identification of the core proteins of a bacterial genus and its individual species, (iv) identification of species-specific fingerprint proteins that are found in all strains of a species and, at the same time, are absent from all other species of the genus, (v) functional annotation of the core and fingerprint proteins with eggNOG, and (vi) identification of secondary metabolite biosynthetic gene clusters (smBGCs) with antiSMASH. This software has already been implemented to analyze bacterial genera and species that are important for plants (e.g., Pseudomonas, Bacillus, Streptomyces). In addition, we provide a test dataset and example commands showing how to analyze 165 genomes from 55 species of the genus Bacillus. The main advantages of pyPGCF are that: (i) it uses adjustable orthology cut-offs, (ii) it identifies species-specific fingerprints, and (iii) its computational cost scales linearly with the number of genomes being analyzed. Therefore, pyPGCF is able to deal with a very large number of bacterial genomes, in reasonable timescales, using widely available levels of computing power. |
---|---|
MeSH term(s) | Software ; Genome, Bacterial ; Phylogeny ; Plants/genetics ; Plants/microbiology ; Bacterial Proteins/genetics ; Genomics/methods ; Computational Biology/methods ; Bacteria/genetics ; Bacteria/classification ; Multigene Family ; Species Specificity |
Chemical Substances | Bacterial Proteins |
Language | English |
Publishing date | 2024-04-24 |
Publishing country | United States |
Document type | Journal Article ; Research Support, Non-U.S. Gov't |
ISSN | 1940-6029 |
ISSN (online) | 1940-6029 |
DOI | 10.1007/978-1-0716-3782-1_8 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
More links
Kategorien
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.
Inter-library loan at ZB MED
Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.