MAGICACT: A tool for predicting transcription factors and cofactors driving gene lists using ENCODE data
2019
Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples represents a challenge and current approaches are limited by high false discovery rates. This is due to the use of TF binding sequence motifs that, due their small size, are found randomly throughout the genome, and do not allow discovery of cofactors. A second limitation is that even the most advanced approaches that use ChIPseq tracks hosted at sites such as the Encyclopedia Of DNA Elements (ENCODE) assign TFs and cofactors to genes via a binary designation of target, or non-target that ignores the intricacies of the biology behind transcriptional regulation. ENCODE archives ChIPseq tracks of 169 TFs and cofactors assayed in 91 cell lines. The algorithm presented herein, Mann whitney Analysis of Gene Cohorts for Associated Cofactors and Transcription factors (MAGICACT), uses ENCODE ChIPseq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene sets. When compared to 2 commonly used web resources, o-Possum and Enrichr, MAGICACT was able to more accurately predict TFs and cofactors that drive gene changes in 3 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations and 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype. In summary, MAGICACT is a standalone application that runs on OSX and Windows machines that produces meaningful predictions of which TFs and cofactors are enriched in a gene set.
Keywords:
- Correction
- Cite
- Save
- Machine Reading By IdeaReader
25
References
0
Citations
NaN
KQI