Polymorphic SNPs, short tandem repeats and structural variants are responsible for differential gene expression across C57BL/6 and C57BL/10 substrains

2021 
Mouse substrains are an invaluable model for understanding disease. We compared C57BL/6J, which is the most commonly used inbred mouse strain, with several closely related substrains. We performed whole genome sequencing and RNA-sequencing analysis on 9 C57BL/6 and 5 C57BL/10 substrains. We identified 352,631 SNPs, 109,096 INDELs, 150,344 short tandem repeats (STRs), 3,425 structural variants (SVs) and 2,826 differentially expressed genes (DEGenes) among these 14 strains. 312,981 SNPs (89%) distinguished the B6 and B10 lineages. These SNPS were clustered into 28 short segments that are likely due to introgressed haplotypes rather than new mutations. Outside of these introgressed regions, we identified 53 SVs, protein-truncating SNPs and frameshifting INDELs that were associated with DEGenes. Our results can be used for both forward and reverse genetic approaches, and illustrate how introgression and mutational processes give rise to differences among substrains.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    53
    References
    4
    Citations
    NaN
    KQI
    []