A risk score system based on DNA methylation levels and a nomogram survival model for lung squamous cell carcinoma

2020 
: Lung squamous cell carcinoma (LSCC) is one of the primary types of non‑small cell lung carcinoma, and patients with recurrent LSCC usually have a poor prognosis. The present study was conducted to build a risk score (RS) system for LSCC. Methylation data on LSCC (training set) and on head and neck squamous cell carcinoma (validation set 2) were obtained from The Cancer Genome Atlas database, and GSE39279 (validation set 1) was retrieved from the Gene Expression Omnibus database. Differentially methylated protein‑coding genes (DMGs)/long non‑coding RNAs (DM‑lncRNAs) between recurrence‑associated samples and nonrecurrence samples were screened out using the limma package, and their correlation analysis was conducted using the cor.test() function. Following identification of the optimal combinations of DMGs or DM‑lncRNAs using the penalized package in R, RS systems were built, and the system with optimal performance was selected. Using the rms package, a nomogram survival model was then constructed. For the differentially expressed genes (DEGs) between the high‑ and low‑risk groups, pathway enrichment analysis was performed by Gene Set Enrichment Analysis. There were 335 DMGs and DM‑lncRNAs in total. Following screening out of the top 10 genes (aldehyde dehydrogenase 7 family member A1, chromosome 8 open reading frame 48, cytokine‑like 1, heat shock protein 90 alpha family class A member 1, isovaleryl‑CoA dehydrogenase, phosphodiesterase 3A, PNMA family member 2, SAM domain, SH3 domain and nuclear localization signals 1, thyroid hormone receptor interactor 13 and zinc finger protein 878) and 6 top lncRNAs, RS systems were constructed. According to Kaplan‑Meier analysis, the DNA methylation level‑based RS system exhibited the best performance. In combination with independent clinical prognostic factors, a nomogram survival model was built and successfully predicted patient survival. Furthermore, 820 DEGs between the high‑ and low‑risk groups were identified, and 3 pathways were identified to be enriched in this gene set. The 10‑DMG methylation level‑based RS system and the nomogram survival model may be applied for predicting the outcomes of patients with LSCC.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    47
    References
    7
    Citations
    NaN
    KQI
    []