Article ; Online: Less-is-more: selecting transcription factor binding regions informative for motif inference.
2024 Volume 52, Issue 4, Page(s) e20
Abstract: Numerous statistical methods have emerged for inferring DNA motifs for transcription factors (TFs) from genomic regions. However, the process of selecting informative regions for motif inference remains understudied. Current approaches select regions ... ...
Abstract | Numerous statistical methods have emerged for inferring DNA motifs for transcription factors (TFs) from genomic regions. However, the process of selecting informative regions for motif inference remains understudied. Current approaches select regions with strong ChIP-seq signal for a given TF, assuming that such strong signal primarily results from specific interactions between the TF and its motif. Additionally, these selection approaches do not account for non-target motifs, i.e. motifs of other TFs; they presume the occurrence of these non-target motifs infrequent compared to that of the target motif, and thus assume these have minimal interference with the identification of the target. Leveraging extensive ChIP-seq datasets, we introduced the concept of TF signal 'crowdedness', referred to as C-score, for each genomic region. The C-score helps in highlighting TF signals arising from non-specific interactions. Moreover, by considering the C-score (and adjusting for the length of genomic regions), we can effectively mitigate interference of non-target motifs. Using these tools, we find that in many instances, strong ChIP-seq signal stems mainly from non-specific interactions, and the occurrence of non-target motifs significantly impacts the accurate inference of the target motif. Prioritizing genomic regions with reduced crowdedness and short length markedly improves motif inference. This 'less-is-more' effect suggests that ChIP-seq region selection warrants more attention. |
---|---|
MeSH term(s) | Binding Sites ; Chromatin Immunoprecipitation ; Genomics ; Nucleotide Motifs ; Protein Binding ; Transcription Factors/genetics ; Transcription Factors/metabolism |
Chemical Substances | Transcription Factors |
Language | English |
Publishing date | 2024-01-12 |
Publishing country | England |
Document type | Journal Article |
ZDB-ID | 186809-3 |
ISSN | 1362-4962 ; 1362-4954 ; 0301-5610 ; 0305-1048 |
ISSN (online) | 1362-4962 ; 1362-4954 |
ISSN | 0301-5610 ; 0305-1048 |
DOI | 10.1093/nar/gkad1240 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
More links
Kategorien
In stock of ZB MED Cologne/Königswinter
Zs.A 1067: Show issues | Location: Je nach Verfügbarkeit (siehe Angabe bei Bestand) bis Jg. 1994: Bestellungen von Artikeln über das Online-Bestellformular Jg. 1995 - 2021: Lesesall (1.OG) ab Jg. 2022: Lesesaal (EG) |
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.