Article ; Online: Genotype imputation accuracy and the quality metrics of the minor ancestry in multi-ancestry reference panels.
2024 Volume 25, Issue 1
Abstract: Large-scale imputation reference panels are currently available and have contributed to efficient genome-wide association studies through genotype imputation. However, whether large-size multi-ancestry or small-size population-specific reference panels ... ...
Abstract | Large-scale imputation reference panels are currently available and have contributed to efficient genome-wide association studies through genotype imputation. However, whether large-size multi-ancestry or small-size population-specific reference panels are the optimal choices for under-represented populations continues to be debated. We imputed genotypes of East Asian (180k Japanese) subjects using the Trans-Omics for Precision Medicine reference panel and found that the standard imputation quality metric (Rsq) overestimated dosage r2 (squared correlation between imputed dosage and true genotype) particularly in marginal-quality bins. Variance component analysis of Rsq revealed that the increased imputed-genotype certainty (dosages closer to 0, 1 or 2) caused upward bias, indicating some systemic bias in the imputation. Through systematic simulations using different template switching rates (θ value) in the hidden Markov model, we revealed that the lower θ value increased the imputed-genotype certainty and Rsq; however, dosage r2 was insensitive to the θ value, thereby causing a deviation. In simulated reference panels with different sizes and ancestral diversities, the θ value estimates from Minimac decreased with the size of a single ancestry and increased with the ancestral diversity. Thus, Rsq could be deviated from dosage r2 for a subpopulation in the multi-ancestry panel, and the deviation represents different imputed-dosage distributions. Finally, despite the impact of the θ value, distant ancestries in the reference panel contributed only a few additional variants passing a predefined Rsq threshold. We conclude that the θ value substantially impacts the imputed dosage and the imputation quality metric value. |
---|---|
MeSH term(s) | Humans ; Genome-Wide Association Study ; Gene Frequency ; Polymorphism, Single Nucleotide ; Genotype |
Language | English |
Publishing date | 2024-01-14 |
Publishing country | England |
Document type | Journal Article ; Research Support, Non-U.S. Gov't |
ZDB-ID | 2068142-2 |
ISSN | 1477-4054 ; 1467-5463 |
ISSN (online) | 1477-4054 |
ISSN | 1467-5463 |
DOI | 10.1093/bib/bbad509 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
More links
Kategorien
In stock of ZB MED Cologne/Königswinter
Zs.A 6262: Show issues | Location: Je nach Verfügbarkeit (siehe Angabe bei Bestand) bis Jg. 2021: Bestellungen von Artikeln über das Online-Bestellformular ab Jg. 2022: Lesesaal (EG) |
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.