Adjustment of RNA-Seq data for the effect of highly abundant transcripts: a case study in milk production (622.4)

2014 
During lactation, profound changes occur in the mammary gland and thousands of genes undergo differential regulation. RNA-seq data analysis reveals a tiny minority of milk genes account for the vast majority of total gene expression. This striking imbalance in transcript abundances poses a significant problem for data analysis, e.g. low abundance genes may be incorrectly identified as downregulated. To tackle this problem, we developed a ‘Dilution Adjustment Model’ which more accurately classifies changes in levels of low abundance transcripts between transcriptomes at two developmental stages (baseline and mature lactation). Applying this model to human and bovine milk data led us to reclassify 2,155 human (971 bovine) genes as ‘upregulated’ instead of ‘not differentially expressed’ and 2,524 human (1,732 bovine) as ‘unregulated’ instead of ‘downregulated’. Changes in gene classification were supported by analysis of Gene Ontology and TFBS enrichment profiles. Investigation of ChIP-Seq data showed genes ...
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []