Statistical Data Analysis and Modeling

2016 
The availability of large structured datasets has prompted the need for efficient data analysis and modeling techniques. In systems biology, data-driven modeling approaches create models of complex cellular systems without making assumptions about the underlying mechanisms. In this chapter, we will discuss eigenvalue-based approaches, which identify important characteristics (information) of big datasets through decomposition and dimensionality reduction. We intend to address singular value decomposition (SVD), principle component analysis (PCA), and partial least squares regression (PLSR) approaches for data-driven modeling. In multi-linear systems (that share characteristics such as time points, measurements, etc.), tensor decomposition becomes particularly important for understanding higher-order datasets. Therefore, we will also discuss how to scale up these methods to tensor decomposition using an example dealing with host-cell responses to viral infection.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    63
    References
    2
    Citations
    NaN
    KQI
    []