Combination of several bioinformatics approaches for the identification of new putative glycosyltransferases in Arabidopsis.

2009 
Approximately 450 glycosyltransferase (GT) sequences have been already identified in the Arabidopsis genome that organize into 40 sequence-based families, but a vast majority of these gene products remain biochemically uncharacterized open reading frames. Given the complexity of the cell wall carbohydrate network, it can be inferred that some of the biosynthetic genes have not yet been identified by classical bioinformatics approaches. With the objective to identify new plant GT genes, we designed a bioinformatic strategy that is based on the use of several remote homology detection methods that act at the 1D, 2D, and 3D level. Together, these methods led to the identification of more than 150 candidate protein sequences. Among them, 20 are considered as putative glycosyltransferases that should further be investigated since known GT signatures were clearly identified.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    58
    References
    26
    Citations
    NaN
    KQI
    []