Using topic modeling to detect cellular crosstalk in scRNA-seq

2021 
Abstract Cell-cell interactions are vital for numerous biological processes including development, differentiation, and response to inflammation. Currently most methods for studying interactions on scRNA-seq level are based on curated databases of ligands and receptors. While those methods are useful, they are limited to our current biological knowledge. Recent advances in single cell protocols have allowed for physically interacting cells to be captured and as such we have the potential to study interactions in a complimentary way without relying on prior knowledge. We introduce a new method for detecting genes that change as a result of interaction based on Latent Dirichlet Allocation (LDA). We apply our method to synthetic datasets to demonstrate its ability to detect genes that change in an interacting population compared to a reference population. Next, we apply our approach to two datasets of physically interacting cells and identify genes that change as a result of interaction, examples include adhesion and co-stimulatory molecules which confirm physical interaction between cells. For each dataset we produce a ranking of genes that are changing in subpopulations of the interacting cells. In addition to genes, discussed in the original publications we highlight further candidates for interaction in the top 100 and 300 ranked genes. Lastly, we apply our method to a dataset generated by a standard droplet based protocol, not designed to capture interacting cells and discuss its suitability for analysing interactions. We present a method that streamlines the detection of interactions and does not require prior clustering and generation of synthetic reference profiles to detect changes in expression. Author summary While scRNA-seq research is a dynamic area, progress is lacking when it comes to developing methods that allow analysis of interaction that is independent of curated resources of known interacting pairs. Recent advances of sequencing protocols have allowed for interacting cells to be captured. We propose a novel method based on LDA that captures changes in gene expression as a result of interaction. Our method does not require prior information in the form of clustering or generation of synthetic reference profiles. We demonstrate the suitability of our approach by applying it to synthetic and real datasets and manage to capture biologically interesting candidates of interaction.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    30
    References
    0
    Citations
    NaN
    KQI
    []