Nobuyuki Kawagashira

Yokohama City University

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Piero Carninci

RIKEN Center for Integrative Medical Sciences

Jun Kawai

Kyoto University

Shoshi Kikuchi

Naturalis Biodiversity Center

Yoshihide Hayashizaki

RIKEN Center for Integrative Medical Sciences

Shinji Kondo

Research Organization of Information and Systems

Yasuhiro Ohtomo

Tohoku Gakuin University

Kazuo Murakami

Meijo University

Yoshihide Hayashizaki

Beth Israel Deaconess Medical Center

Kazuhiro Shibata

Gifu University

Takahiro Arakawa

Tokyo Medical and Dental University

Cooperative Institutions

RIKEN Center for Integrative Medical Sciences

RIKEN

The University of Tokyo

Institute of Agrobiological Sciences

Osaka University

Foundation for Advancement of International Science

Hitachi (Japan)

University of Queensland

Harvard University

University of Tsukuba

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

[Orthologue, paralogue and comparative genomics for gene network analysis].

Tanpakushitsu kakusan koso. Protein nucleic acid enzyme (2004)

Nobuyuki Kawagashira

Comparative Genomics

Genome Biology

Source

Cite

Citations (0)

Multiple Zinc Finger Motifs with Comparison of Plant and Insect

Proceedings Genome Informatics Workshop/Genome informatics (2001)

Nobuyuki Kawagashira Yasuhiro Ohtomo Kazuo Murakami Kenichi Matsubara Jun Kawai

A protein is a sequence of amino acidal residue. Usually a sequence of amino acids in one protein is divided into several subsequences, which is thought to be an independent component or region. They are called motifs or domains. Zinc fingers are motifs that has a unique structure capturing a zinc ion in the core with several (usually four) amino acid residues, which are cysteines or histidines in most cases. Zinc fingers are kinds of transcription factors because they connect to the specific DNA sequence, so they are called DNA-binding proteins [2]. In this research protein data of three species are used for motif search: Oryza sativa and Arabidopsis thaliana for plant, and Drosophila melanogaster for insect. Protein data of A. thaliana and D. melanogaster is obtained from GenBank FTP site. Protein data of O. sativa is extracted as the open reading frame (ORF) from cDNA sequence, which is sequenced by Rice Full-Length cDNA Sequencing Project in National Institute of Agrobiological Sciences (NIAS), FAIS, and RIKEN. We selected 13,919 cDNA sequences and extracted 13,554 proteins as ORF’s from them [1]. These data are not yet public.

Sequence motif

PHD finger

Protein sequencing

Melanogaster

Structural motif

10.11234/gi1990.12.368

Cite

Citations (9)

Collection, Mapping, and Annotation of Over 28,000 cDNA Clones from japonica Rice

Science (2003)

Shoshi Kikuchi Kouji Satoh Toshifumi Nagata Nobuyuki Kawagashira Koji Doi

We collected and completely sequenced 28,469 full-length complementary DNA clones from Oryza sativa L. ssp. japonica cv. Nipponbare. Through homology searches of publicly available sequence data, we assigned tentative protein functions to 21,596 clones (75.86%). Mapping of the cDNA clones to genomic DNA revealed that there are 19,000 to 20,500 transcription units in the rice genome. Protein informatics analysis against the InterPro database revealed the existence of proteins presented in rice but not in Arabidopsis. Sixty-four percent of our cDNAs are homologous to Arabidopsis proteins.

Homology

genomic DNA

10.1126/science.1081288

Cite

Citations (873)

Wavelet profiles: their application in Oryza sativa DNA sequence analysis

Nobuyuki Kawagashira Yasuhiro Ohtomo K Murakami Keiko Matsubara Jun Kawai

Here we introduce our application of the wavelet analysis method to DNA sequences. In the signal processing field, Fourier transform is popular for analyzing wave data. However, although this method can process frequency information, it fails to handle locational data. In contrast, the wavelet method accommodates both locational and frequency information for wave analysis. The wavelet method is now increasing in its importance for signal processing. Fast Fourier transform is already applied to biological sequence analysis using correlations. We introduce a new method, called wavelet profile, for biological sequence analysis. Our method is based on multiresolution analysis of wavelet transform, offering data decomposition in several scaling at the same time. We applied our wavelet profile method to identifying gene loci among O. sativa genomic sequences.

Harmonic wavelet transform

Second-generation wavelet transform

Stationary wavelet transform

Multiresolution analysis

10.1109/csb.2002.1039368

Cite

Citations (7)

The Transcriptional Landscape of the Mammalian Genome

Science (2005)

Piero Carninci Takeya Kasukawa Shintaro Katayama Julian Gough Martin C. Frith

This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

Transcription

10.1126/science.1112014

Cite

Citations (3,454)