Abstract Modular response analysis (MRA) is a widely used modeling technique to uncover coupling strengths in molecular networks under a steady-state condition by means of perturbation experiments. We propose an extension of this methodology to search genomic data for new associations with a network modeled by MRA and to improve the predictive accuracy of MRA models. These extensions are illustrated by exploring the cross talk between estrogen and retinoic acid receptors, two nuclear receptors implicated in several hormone-driven cancers such as breast. We also present a novel, rigorous and elegant mathematical derivation of MRA equations, which is the foundation of this work and of an R package that is freely available at https://github.com/bioinfo-ircm/aiMeRA/ . This mathematical analysis should facilitate MRA understanding by newcomers. Author summary Estrogen and retinoic acid receptors play an important role in several hormone-driven cancers and share co-regulators and co-repressors that modulate their transcription factor activity. The literature shows evidence for crosstalk between these two receptors and suggests that spatial competition on the promoters could be a mechanism. We used MRA to explore the possibility that key co-repressors, i.e., NRIP1 (RIP140) and LCoR could also mediate crosstalk by exploiting new quantitative (qPCR) and RNA sequencing data. The transcription factor role of the receptors and the availability of genome-wide data enabled us to explore extensions of the MRA methodology to explore genome-wide data sets a posteriori , searching for genes associated with a molecular network that was sampled by perturbation experiments. Despite nearly two decades of use, we felt that MRA lacked a systematic mathematical derivation. We present here an elegant and rather simple analysis that should greatly facilitate newcomers’ understanding of MRA details. Moreover, an easy-to-use R package is released that should make MRA accessible to biology labs without mathematical expertise. Quantitative data are embedded in the R package and RNA sequencing data are available from GEO.
Background
The interactions between proteins and nucleic acids have a fundamental function in many biological processes, including gene transcription, RNA homeostasis, protein translation and pathogen sensing for innate immunity. While our knowledge of the ensemble of proteins that bind individual mRNAs in mammalian cells has been greatly augmented by recent surveys, no systematic study on the non-sequence-specific engagement of native human proteins with various types of nucleic acids has been reported.