Improving Start Codon Prediction Accuracy in Prokaryotic Organisms Using Naïve Bayesian Classification

Sean Landman

Improving Start Codon Prediction Accuracy in Prokaryotic Organisms Using Naïve Bayesian Classification

2010

Sean Landman

With an overwhelming amount of genetic data now becoming publicly available, there is a growing need to develop more effective gene prediction methods that produce reliable results. Although prediction of the stop codon location for genes in prokaryotic organisms is largely considered to be a solved problem, accurate prediction of the exact start codon location continues to lag behind because of the ambiguity for these start codons in the genetic code. This paper will detail a new approach to predicting more precise gene locations for both the start and stop codon in prokaryotic organisms. This approach uses gene prediction results from other prediction programs to find consistently predicted gene locations. It then uses these “consistent genes” as a training set for Naive Bayesian classification to improve accuracy in the “ambiguous genes,” those in which there is some variability or inconsistency in predicted locations between the prediction programs.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations