SEGtool: a specifically expressed gene detection tool and applications in human tissue and single-cell sequencing data

2018 
Different tissues and diseases have distinct transcriptional profilings with specifically expressed genes (SEGs). So, the identification of SEGs is an important issue in the studies of gene function, biological development, disease mechanism and biomarker discovery. However, few accurate and easy-to-use tools are available for RNA sequencing (RNA-seq) data to detect SEGs. Here, we presented SEGtool, a tool based on fuzzy c-means, Jaccard index and greedy annealing method for SEG detection automatically and self-adaptively ignoring data distribution. Testing result showed that our SEGtool outperforms the existing tools, which was mainly developed for microarray data. By applying SEGtool to Genotype-Tissue Expression (GTEx) human tissue data set, we detected 3181 SEGs with tissue-related functions. Regulatory networks reveal tissue-specific transcription factors regulating many SEGs, such as ETV2 in testis, HNF4A in liver and NEUROD1 in brain. Applied to a case study of single-cell sequencing (SCS) data from embryo cells, we identified many SEGs in specific stages of human embryogenesis. Notably, SEGtool is suitable for RNA-seq data and even SCS data with high specificity and accuracy. An implementation of SEGtool R package is freely available at http://bioinfo.life.hust.edu.cn/SEGtool/.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    56
    References
    17
    Citations
    NaN
    KQI
    []