We collected and completely sequenced 28,469 full-length complementary DNA clones from Oryza sativa L. ssp. japonica cv. Nipponbare. Through homology searches of publicly available sequence data, we assigned tentative protein functions to 21,596 clones (75.86%). Mapping of the cDNA clones to genomic DNA revealed that there are 19,000 to 20,500 transcription units in the rice genome. Protein informatics analysis against the InterPro database revealed the existence of proteins presented in rice but not in Arabidopsis. Sixty-four percent of our cDNAs are homologous to Arabidopsis proteins.
Several genes/QTLs governing resistance/tolerance to abiotic and biotic stresses have been reported and mapped in rice. A QTL for submergence tolerance was found to be co-located with a major QTL for broadspectrum bacterial leaf blight (bs-blb) resistance on the long arm of chromosome 5 in indica cultivars FR13A and IET8585. Using the Nipponbare (japonica) and 9311 (indica) genome sequences, we identified, in silico, candidate genes in the chromosomal region [Kottapalli et al. (2006)]. Transcriptional profiling of FR13A and IET8585 using a rice 22K oligo array validated the above findings. Based on in silico analysis and arraying we observed that both cultivars respond to the above stresses through a common signaling system involving protein kinases, adenosine mono phosphate kinase, leucine rich repeat, PDZ/DHR/GLGF, and response regulator receiver protein. The combined approaches suggest that transcription factor EREBP on long arm of chromosome 5 regulates both submergence tolerance and blb resistance. Pyruvate decarboxylase and alcohol dehydrogenase, co-located in the same region, are candidate downstream genes for submergence tolerance at the seedling stage, and t-snare for bs-blb resistance. We also detected up-regulation of novel defense/stress-related genes including those encoding fumaryl aceto acetate (FAA) hydrolase, scramblase, and galactose oxidase, in response to the imposed stresses.
Rice (Oryza sativa L.) is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA) sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE) genes, 33K annotated non-expressed (ANE) genes, and 5.5K non-annotated expressed (NAE) genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.