Systematic protein complex profiling and differential analysis from co-fractionation mass spectrometry data

2020 
Protein complexes, macro-molecular assemblies of two or more proteins, play vital roles in numerous cellular activities and collectively determine the cellular state. Despite the availability of a range of methods for analysing protein complexes, systematic analysis of complexes under multiple conditions has remained challenging. Approaches based on biochemical fractionation of intact, native complexes and correlation of protein profiles have shown promise, for instance in the combination of size exclusion chromatography (SEC) with accurate protein quantification by SWATH/DIA-MS. However, most approaches for interpreting co-fractionation datasets to yield complex composition, abundance and rearrangements between samples depend heavily on prior evidence. We introduce PCprophet, a computational framework to identify novel protein complexes from SEC-SWATH-MS data and to characterize their changes across different experimental conditions. We demonstrate accurate prediction of protein complexes (AUC >0.99 and accuracy around 97%) via five-fold cross-validation on SEC-SWATH-MS data, show improved performance over state-of-the-art approaches on multiple annotated co-fractionation datasets, and describe a Bayesian approach to analyse altered protein-protein interactions across conditions. PCprophet is a generic computational tool consisting of modules for data pre-processing, hypothesis generation, machine-learning prediction, post-prediction processing, and differential analysis. It can be applied to any co-fractionation MS dataset, independent of separation or quantitative LC-MS workflow employed, and to support the detection and quantitative tracking of novel protein complexes and their physiological dynamics.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    45
    References
    1
    Citations
    NaN
    KQI
    []