Multi-Approach Bioinformatics Analysis of Curated Omics Data Provides a Gene Expression Panorama for Multiple Cancer Types
2020
Studies describing the expression patterns and biomarkers for the tumoral process grow every year. This availability of new datasets, although essential, also creates a confusing landscape, where common or critical mechanisms are clouded amidst the divergent and heterogeneous nature of such results. In this work, we manually curated the Gene Expression Omnibus using rigorous filtering criteria, to select the most homogeneous and quality microarray and RNA-seq datasets from multiple types of cancer. By applying systems biology approaches, combined with a machine learning analysis, we investigated possible frequently deregulated molecular mechanisms underlying the tumoral process. Our multi-approach analysis of 99 curated datasets, composed of 5.406 samples, revealed 47 differentially expressed genes in all analyzed cancer types, which were all in agreement with the validation using TCGA data. Results suggest that the tumoral process is more related to the overexpression of a core deregulated machinery than the underexpression of a given gene set. Additionally, we identified gene expression similarities between different cancer types not described before and performed an overall survival analysis using 20 cancer types. Finally, we were able to suggest a core regulatory mechanism that could be frequently deregulated.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
81
References
3
Citations
NaN
KQI