Artikel ; Online: LDmat: efficiently queryable compression of linkage disequilibrium matrices.
Bioinformatics (Oxford, England)
2023 Band 39, Heft 2
Abstract: Motivation: Linkage disequilibrium (LD) matrices derived from large populations are widely used in population genetics in fine-mapping, LD score regression, and linear mixed models for Genome-wide Association Studies (GWAS). However, these matrices can ... ...
Abstract | Motivation: Linkage disequilibrium (LD) matrices derived from large populations are widely used in population genetics in fine-mapping, LD score regression, and linear mixed models for Genome-wide Association Studies (GWAS). However, these matrices can reach large sizes when they are derived from millions of individuals; hence, moving, sharing and extracting granular information from this large amount of data can be cumbersome. Results: We sought to address the need for compressing and easily querying large LD matrices by developing LDmat. LDmat is a standalone tool to compress large LD matrices in an HDF5 file format and query these compressed matrices. It can extract submatrices corresponding to a sub-region of the genome, a list of select loci, and loci within a minor allele frequency range. LDmat can also rebuild the original file formats from the compressed files. Availability and implementation: LDmat is implemented in python, and can be installed on Unix systems with the command 'pip install ldmat'. It can also be accessed through https://github.com/G2Lab/ldmat and https://pypi.org/project/ldmat/. Supplementary information: Supplementary data are available at Bioinformatics online. |
|||||
---|---|---|---|---|---|---|
Mesh-Begriff(e) | Humans ; Linkage Disequilibrium ; Software ; Genome-Wide Association Study ; Data Compression ; Genome | |||||
Sprache | Englisch | |||||
Erscheinungsdatum | 2023-03-13 | |||||
Erscheinungsland | England | |||||
Dokumenttyp | Journal Article ; Research Support, N.I.H., Extramural | |||||
ZDB-ID | 1422668-6 | |||||
ISSN | 1367-4811 ; 1367-4803 | |||||
ISSN (online) | 1367-4811 | |||||
ISSN | 1367-4803 | |||||
DOI | 10.1093/bioinformatics/btad092 | |||||
Signatur |
|
|||||
Datenquelle | MEDical Literature Analysis and Retrieval System OnLINE |
Zusatzmaterialien
Kategorien
Verfügbar in ZB MED Köln/Königswinter
Zs.A 2374: Hefte anzeigen | Standort: Je nach Verfügbarkeit (siehe Angabe bei Bestand) bis Jg. 1994: Bestellungen von Artikeln über das Online-Bestellformular Jg. 1995 - 2021: Lesesall (2.OG) ab Jg. 2022: Lesesaal (EG) |
Über subito bestellen
Dieser Service ist kostenpflichtig (siehe Lieferbedingungen von subito). Bestellungen, die einen Artikel nebst Supplementary Material umfassen, werden grundsätzlich wie mehrfache Bestellungen bearbeitet. Gebühren fallen in diesen Fällen für jede einzelne Bestellung an.