TEMPORARY REMOVAL: Intronic breakpoint signatures enhance detection and characterization of clinically relevant germline structural variants

2021 
The relevance of large copy number variants (CNVs) to hereditary disorders has been long recognized, and population sequencing efforts have chronicled many common structural variants (SVs). However, limited data are available on the clinical contribution of rare germline SVs. We performed a detailed characterization of SVs identified using targeted next generation sequencing. Across 50 genes associated with hereditary cancer and cardiovascular disorders, a minimum of 828 unique SVs were reported, including 584 fully characterized SVs. Almost 40% of CNVs were <5kb, with one in three deletions impacting a single exon. Additionally, 36 mid-range deletions/duplications (50-250bp), 21 mobile element insertions, six inversions, and 27 complex rearrangements were detected. This dataset was used to model SV detection in a bioinformatics pipeline solely relying on read depth, which revealed that genome sequencing (30X) allows detection of 71%, a 500X panel only targeting coding regions 53%, and exome sequencing (100X) less than 20% of characterized SVs. SVs accounted for 14.1% of all unique pathogenic variants, supporting the importance of SVs in hereditary disorders. Robust SV detection requires an ensemble of variant calling algorithms that utilize sequencing of intronic regions. These algorithms should use distinct data features representative of each class of mutational mechanism, including recombination between two sequences sharing high similarity, co-variants inserted between CNV breakpoints, and complex rearrangements containing inverted sequences.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    65
    References
    2
    Citations
    NaN
    KQI
    []