Integrative analysis of mutated genes and mutational processes reveals seven colorectal cancer subtypes

2020 
Colorectal cancer (CRC) is one of the leading causes of cancer-related deaths in the world. It has been reported that ~10%-15% of individuals with colorectal cancer experience a causative mutation in the known susceptibility genes, highlighting the importance of identifying mutations for early detection in high risk individuals. Through extensive sequencing projects such as the International Cancer Genome Consortium (ICGC), a large number of somatic point mutations have been identified that can be used to identify cancer-associated genes, as well as the signature of mutational processes defined by the 3-nucleotide sequence context (motif) of mutated sites. Mutated genes and motifs have also been used to cluster patients and identify cancer subtypes. In this study, we developed a statistical pipeline based on a novel concept "gene-motif", which merges mutated gene information with 3-bp sequence motif of mutated sites, to identify cancer subtypes, in this case CRCs. Our analysis identified for the first time, 3,131 gene-motif combinations that were significantly mutated in 536 ICGC colorectal cancer samples compared to other cancer types, identifying seven CRC subtypes with distinguishable phenotypes and biomarkers. Interestingly, we identified several genes that were mutated in multiple subtypes but with unique sequence contexts. Taken together, our results highlight the importance of considering both the mutation type and mutated genes in identification of cancer subtypes.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []