A like-for-like comparison of lightweight-mapping pipelines for single-cell RNA-seq data pre-processing

2021 
Recently, Booeshaghi and Pachter published a benchmark comparing the kallisto-bustools pipeline for single-cell data pre-processing to the alevin-fry pipeline. Their benchmarking adopted drastically dissimilar configurations for these two tools, and overlooked the time- and space-frugal configurations of alevin-fry previously benchmarked by Sarkar et al. In this manuscript, we provide a small set of modifications to the benchmarking scripts of Booeshaghi and Pachter that are necessary to perform a like-for-like comparison between kallisto-bustools and alevin-fry. We also address some misuses of the alevin-fry commands and include important data on the exact reference transcriptomes used for processing. Using the same benchmarking scripts of Booeshaghi and Pachter, we demonstrate that, when configured to match the computational complexity of kallisto-bustools as closely as possible, alevin-fry processes data faster ({approx}2.08 times as fast on average) and uses less peak memory ({approx}0.34 times as much on average) compared to kallisto-bustools, while producing results that are similar when assessed in the manner done by Booeshaghi and Pachter. This is a notable inversion of the performance characteristics presented in the previous benchmark.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    20
    References
    2
    Citations
    NaN
    KQI
    []