Refactoring a statistical package for demanding memory loads: Adapting R for high performance telemetry data analytics

2020 
An R library application employing distinct resampling and multithreaded overlap routines demanded memory beyond the limits of a desktop workstation. Initial efforts revealed that the necessary memory allocations for the dataset output from the resampling exceeded that available on a high memory node, making code refactoring necessary. Subsequent in-depth analysis of the employed package revealed that by combining and multithreading bootstrap sampling and overlap calculations, the memory footprint was reduced to a manageable size. However, the computation time was substantially increased. Further development, decomposing the resampling step into two routines, minimized the impact of a rate-limiting step, restoring shorter runtimes while maintaining the smaller memory footprint.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    3
    References
    1
    Citations
    NaN
    KQI
    []