MultiSeq: unifying sequence and structure data for evolutionary analysis

Elijah Roberts,John Eargle,Dan Wright,Zaida Ann Luthey-Schulten

MultiSeq: unifying sequence and structure data for evolutionary analysis

2006

Elijah Roberts
John Eargle
Dan Wright
Zaida Ann Luthey-Schulten

Background Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million sequences and 35 thousand structures of proteins and nucleic acids are available in public databases. Finding correlations in and between these data to answer critical research questions is extremely challenging. This problem needs to be approached from several directions: information science to organize and search the data; information visualization to assist in recognizing correlations; mathematics to formulate statistical inferences; and biology to analyze chemical and physical properties in terms of sequence and structure changes.

Keywords:

Phylogenetics
Multiple sequence alignment
Data model
Genetics
Structural alignment
Computational biology
Sequence alignment
Structural change
Bioinformatics
Information visualization
Information science
Biology
Pace
Mathematics
critical research
Human genome
Statistical inference
Computer graphics

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

317

Citations