A smart method for spark using neural network for big data

2021 
Apache Spark, famously known for Big data handling ability, is a distributed open-source framework that utilizes the idea of distributed memory to process Big data. As the performance of the Spark is mostly being affected by the Spark predominant configuration parameters, it is challenging to achieve the optimal result from Spark. The current practice of tuning the parameters is ineffective, as it is performed manually. Manual tuning is challenging for large space of parameters and complex interactions with and among the parameters. This paper proposes a more effective, self-tuning approach subject to a neural network called Smart method for Spark using Neural Network for Big data (SSNNB) to avoid the disadvantages of manual tuning of the parameters. The paper has selected five predominant parameters with five different sizes of data to test the approach. The proposed approach has increased the speed of around 30% compared with the default parameter configuration.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []