Abstract B1-12: RNA Architect: High performance tools to exhaustively detect cancer-specific structural events in RNA sequencing data

2015 
The transcriptomes of cancer cells differ structurally and numerically from normal tissue and between different types of cancer. However, a comprehensive large-scale analysis of structural transcriptomic events has been intractable so far due to computational limitations and the lack of an integrative software tool to detect a complete list of events. We developed an optimized computation pipeline, named RNA Architect, which enables the detection of such events from RNA sequencing data for thousands of cancer samples. RNA Architect detects events by discordant read pair and split-read analysis down to the resolution of individual breakpoints. Beside different kinds of fusion events (as in-frame, out of frame, fusions to intergenic regions), it reports a comprehensive overview of other transcriptional events including specific and novel splice-forms, inversions, cryptic and alternative splice sites, exon skips, exon re-usages, and early polyadenylation sites. In this poster, we describe the individual steps of the algorithm as well as our latest benchmarking results on a compilation of literature curated gene fusions: the approach shows a high sensitivity and recovers 92% of known fusion events. We also describe the post-processing of the data, which enables us to delineate relevant events from non-cancer specific ones and noise. We especially discuss optimizations implemented to run the software efficiently on a state of the art compute cluster. These optimizations enable the computation of a large amount of samples in a reasonable time frame (~ 8h on a 100 CPU system for the core split-read algorithm and ~3 hours for the discordant pair analysis on a 50 CPU system per sample). The high performance of the pipeline allows generating a comprehensive catalogue of all transcriptional events from large sets of cancer samples. RNA Architect is also fast enough to be used in time critical clinical settings as a potential diagnostic tool. Citation Format: Roland Chrisitian Arnold, Andrej Rosic, Reid JP Hayes, Adam Shlien. RNA Architect: High performance tools to exhaustively detect cancer-specific structural events in RNA sequencing data. [abstract]. In: Proceedings of the AACR Special Conference on Computational and Systems Biology of Cancer; Feb 8-11 2015; San Francisco, CA. Philadelphia (PA): AACR; Cancer Res 2015;75(22 Suppl 2):Abstract nr B1-12.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []