Structural Analysis of Arabidopsis thaliana Chromosome 5. I. Sequence Features of the 1.6 Mb Regions Covered by Twenty Physically Assigned P1 Clones

1997 
A total of 20 PI clones with an average insert size of 80 kb and each containing a marker(s) specifically mapped on chromosome 5 were isolated from a PI library of the Arabidopsis thaliana genome, and their nucleotide sequences were determined according to a shotgun-based strategy and precisely located on the physical map of chromosome 5 separately constructed. The total length of the sequenced regions were summed up to 1,621,245 bp. By comparison with the sequences in protein and EST databases and analysis with computer programs for gene modeling, a total of 347 potential protein-coding genes and/or gene seg- ments with known or predicted functions were identified. The positions of exons which do not exhibit any similarity to known genes were also predicted. An average density of the genes and/or gene segments as- signed so far is 1 gene/4,672 bp. Introns were identified in approximately 78% of the potential genes, and the average number and length of the introns per gene were 3.7 and 161 bp. The transcription level of the pre- dicted genes was roughly monitored by counting the numbers of identified Arabidopsis ESTs. The sequence data and gene information are available through the World Wide Web at http://www.kazusa.or.jp/arabi/.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    18
    References
    67
    Citations
    NaN
    KQI
    []