Computational Epigenetics in Lung Cancer

2019 
Abstract This chapter introduces the technology of gene expression profiles preprocessing based on the complex use of bicluster analysis and objective clustering inductive technology with the use of self-organizing SOTA and density DBSCAN clustering algorithms. Implementation of this technology allows us to increase the quality of epigenetics investigation in lung cancer based on the use of gene regulatory network. Inductive methods of complex system analysis were used as the basis to implement the objective clustering inductive technology of gene expression profiles. To estimate the clustering quality for equal power subsets (including the same quantity of pairwise similar objects) the complex multiplicative criterion was calculated as a combination of Calinski-Harabasz and WB index criteria. External clustering quality criteria were calculated as a normalized difference of internal clustering quality criteria for equal power subsets. Final decision concerning the determination of optimal parameters of clustering algorithm operation has been done based on the maximum value of Harrington desirability function that takes into account both the character of the objects and clusters distribution in various clustering and the differences between clustering, which are implemented on equal power data subsets. To estimate the effectiveness of the proposed technology, the data set of lung cancer patients were used. This data set includes the gene expression profiles of 96 patients, 10 of which were healthy, and 86 patients were divided according to the degree of disease severity into three groups (well, moderate, poor). The results of the simulation allow us to propose the hybrid model of step-by-step process of gene expression profiles, whereby grouping is based on the complex use of clustering and biclustering algorithms.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []