Identification and Validation of Two Lung Adenocarcinoma-Development Characteristic Gene Sets for Diagnosing Lung Adenocarcinoma and Predicting Prognosis

Frontiers in Genetics (2020)

Cheng Liu Xiang Li Hua Shao Dan Li

Citation

Reference

Related Paper

Citation Trend

Abstract:

Background : Lung adenocarcinoma (LUAD) is one of the main types of lung cancer. Because of its low early diagnosis rate, poor late prognosis, and high mortality, it is of great significance to find biomarkers for diagnosis and prognosis. Methods : Five hundred and twelve LUADs from The Cancer Genome Atlas were used for differential expression analysis and short time-series expression miner (STEM) analysis to identify the LUAD-development characteristic genes. Survival analysis was used to identify the LUAD-unfavorable genes and LUAD-favorable genes. Gene set variation analysis (GSVA) was used to score individual samples against the two gene sets. Receiver operating characteristic (ROC) curve analysis and univariate and multivariate Cox regression analysis were used to explore the diagnostic and prognostic ability of the two GSVA score systems. Two independent data sets from Gene Expression Omnibus (GEO) were used for verifying the results. Functional enrichment analysis was used to explore the potential biological functions of LUAD-unfavorable genes. Results : With the development of LUAD, 185 differentially expressed genes (DEGs) were gradually upregulated, of which 84 genes were associated with LUAD survival and named as LUAD-unfavorable gene set. While 237 DEGs were gradually downregulated, of which 39 genes were associated with LUAD survival and named as LUAD-favorable gene set. ROC curve analysis and univariate/multivariate Cox proportional hazards analyses indicated both of LUAD-unfavorable GSVA score and LUAD-favorable GSVA score were a biomarker of LUAD. Moreover, both of these two GSVA score systems were an independent factor for LUAD prognosis. The LUAD-unfavorable genes were significantly involved in p53 signaling pathway, Oocyte meiosis, and Cell cycle. Conclusion : We identified and validated two LUAD-development characteristic gene sets that not only have diagnostic value but also prognostic value. It may provide new insight for further research on LUAD.

Keywords:

Univariate

Univariate analysis

Topics:

RNA modifications and cancer

Ferroptosis and cancer prognosis

10.3389/fgene.2020.565206

Cite

PDF

Comparing multivariate and univariate subject-specific reference regions for blood constituents in healthy persons.

Clinical Chemistry (1982)

Eugene K. Harris Toshio Yasaka M R Horton G Shakarji

We examined the comparative behavior of subject-specific multivariate and univariate reference regions, using both computer-generated data and serial (semi-annual) measurements of selected analytes in subjects from a large health-maintenance program. Univariate studies under both homeostatic and random-walk time-series models were helpful in defining expected results, but only the homeostatic model was used in multivariate as well as univariate forms. Analysis of the computer-generated data and the real biochemical series produced similar findings, which showed the multivariate subject-specific reference region to be much more conservative than corresponding univariate intervals. That is, a multidimensional point of p correlated observations is quite likely to lie within the individual's multivariate reference region (based on past observation vectors), even when one or more of the observations lie outside their separate reference intervals for that individual. One consequence of this high specificity against univariate false positives in a large surveillance program is a higher than expected proportion of positive multivariate vectors in which none of the values lie outside their univariate ranges. Thus, although the development of multivariate reference regions should be encouraged, they should be used in conjunction with, not instead of, univariate ranges.

Univariate

Univariate analysis

10.1093/clinchem/28.3.422

Cite

Citations (19)

MULTIVARIATE TREND TESTING OF LAKE WATER QUALITY¹

JAWRA Journal of the American Water Resources Association (1991)

Jim C. Loftis Charles H. Taylor Avis D. Newell Phillip L. Chapman

ABSTRACT: Multivariate methods of trend analysis offer the potential for higher power in detecting gradual water quality changes as compared to multiple applications of univariate tests. Simulation experiments were used to investigate the power advantages of multivariate methods for both linear model and Mann‐Kendall based approaches. The experiments focused on quarterly observations of three water quality variables with no serial correlation and with several different intervariable correlation structures. The multivariate methods were generally more powerful than the univariate methods, offering the greatest advantage in situations where water quality variables were positively correlated with trends in opposing directions. For illustration, both the univariate and multivariate versions of the Mann‐Kendall based tests were applied to case study data from several lakes in Maine and New York which have been sampled as part of EPA's long term monitoring study of acid precipitation effects.

Univariate

10.1111/j.1752-1688.1991.tb01446.x

Cite

Citations (36)

Outliers in multivariate time series

Biometrika (2000)

Ruey S. Tsay

This paper generalises four types of disturbance commonly used in univariate time series analysis to the multivariate case, highlights the differences between univariate and multivariate outliers, and investigates dynamic effects of a multivariate outlier on individual components. The effect of a multivariate outlier depends not only on its size and the underlying model, but also on the interaction between the size and the dynamic structure of the model. The latter factor does not appear in the univariate case. A multivariate outlier can introduce various types of outlier for the marginal component models. By comparing and contrasting results of univariate and multivariate outlier detections, one can gain insights into the characteristics of an outlier. We use real examples to demonstrate the proposed analysis.

Univariate

10.1093/biomet/87.4.789

Cite

Citations (168)

New multivariate orderings based on conditional distributions

Applied Stochastic Models in Business and Industry (2011)

Félix Belzunce Julio Mulero José M. Ruiz Alfonso Suárez‐Llorens

In this paper, we propose and study new multivariate extensions of the dispersive, right‐spread, decreasing mean residual life and new better than used in expectation univariate orders. These new orders are based on the comparison of univariate marginal distributions conditional on survival data for the rest of the components. Relationships among multivariate orders and applications to some multivariate random vectors are also provided. Copyright © 2011 John Wiley & Sons, Ltd.

Univariate

Univariate distribution

Conditional expectation

10.1002/asmb.924

Cite

Citations (2)

Theory and practice of multivariate arma forecasting

Journal of Forecasting (1984)

Trond Riise Dag Tjozstheim

Abstract We compare univariate and multivariate forecasts based on ARMA models. In theory we cannot do worse by using a multivariate model instead of a univariate one, but we can risk getting no improvement. Conditions for no improvements are discussed as well as cases where large improvements occur. The effect of estimated parameters is examined and found to be small granted that a good method of estimation is used. However, multivariate models could be very sensitive to structural changes. This is illustrated via an example involving monetary data, where the multivariate forecasts perform considerably worse than the univariate ones. This seems to put a limitation on the use of multivariate ARMA forecasting models.

Univariate

10.1002/for.3980030308

Cite

Citations (41)

When univariate model-free time series prediction is better than multivariate

Physics Letters A (2016)

Masayoshi Chayama Yoshito Hirata

Univariate

10.1016/j.physleta.2016.05.027

Cite

Citations (23)

Forecasting of Human Development Index of Latin American Countries Through Data Mining Techniques

IEEE Latin America Transactions (2017)

Celso Bilynkievycz dos Santos Bruno Pedroso Alaine Margarete Guimarães Déborah Ribeiro Carvalho Luiz Alberto Pìlatti

Aim: Predict the Human Development Index (HDI) of 2013 and 2014 of Latin American countries through forecast data mining techniques. Methodology: Full stages of Knowledge Discovery in Databases applied in univariate and multivariate time series. For the prediction, the predicting abilities of 90 predicting models were tested, distributed in two global multivariate, 44 specific multivariate per country and 44 univariate. The algorithm SMOReg was adopted in the development of models as it presented a better performance among the learning algorithms based on functions tested in the experiment. Results: It was observed that the predictions of the models did not present significant statistical differences from the HDI tendencies disclosed in the last report of the United Nations Development Program. Nevertheless, the global multivariate models presented better quality measures in the predictions. Conclusion: The HDI prediction models used with multivariate time series provide better learning of algorithms with the increase of different univariate historical experiences.

Univariate

Human Development Index

10.1109/tla.2017.8015082

Cite

Citations (3)

Forecasting precious metal returns with multivariate random forests

Empirical Economics (2018)

Christian Pierdzioch Marian Risse

Univariate

Precious metal

Sample (material)

10.1007/s00181-018-1558-9

Cite

Citations (44)

A multivariate approach to a specific problem of grouping maize cultivars

South African Journal of Plant and Soil (1990)

Marie F. Smith

The use of multivariate techniques in the analysis of multivariate problems is illustrated by comparing the results of univariate and multivariate techniques applied to the problem of establishing the nutritional requirements of, and the acid tolerance differences between maize cultivars. Forty-eight maize cultivars were statistically separated into three groups, tolerant, intermediate and intolerant, using a univariate approach. A principal components analysis was then carried out to study the grouping at a multivariate level. The variates included were grain yield, plant height and ten leaf chemical analyses: Al, Mg, P, Ca, K, Mn, Zn, Fe, N and Cu. A non-hierarchical classification was applied to classify cultivars into the three tolerance classes. The univariate method resulted in different groupings for each variate under study, while the multivariate approach ensured one single classification of all cultivars into the three groups.

Univariate

10.1080/02571862.1990.10634561

Cite

Citations (5)

Multivariate rank-based forecast combining techniques

Technical reports (1999)

Matthias Klapper

We analyze macroeconomic data using univariate and multivariate forecast combining techniques. We simulate forecast errors with different variance-covariance structures. The simulations are used to compare the performance of univariate and multivariate combining techniques.

Univariate

Rank (graph theory)

Source

Cite

Citations (1)