Extending R Boxplot Analysis to Big Data in Education

2015 
Big Data is the buzz word doing rounds in all areas of human existence be medical, social networks, research, it has also made inroads to education. The large size and complexity of datasets in Big Data need specialized statistical tools for analysis where R can come handy. This paper explores the analysis of Big Data in education using a contemporary statistical tool R. R provides multiple dimensions to statistical analysis of dataset, this paper however explores only the Box Plot feature to study the impact of outliers on the overall summary measure of the dataset. The feature of trimmed mean is incorporated to demonstrate its impact on outliers. The trimmed data set can be used in predictive analysis for a business intelligence prediction or educational context.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    7
    References
    4
    Citations
    NaN
    KQI
    []