Modeling Global and local Codon Bias with Deep Language Models

Masaki Stanley Fujimoto,Paul Bodily,Cole A. Lyman,Andrew J. Jacobsen,Quinn Snell,Mark J. Clement

Modeling Global and local Codon Bias with Deep Language Models

2017

Masaki Stanley Fujimoto
Paul Bodily
Cole A. Lyman
Andrew J. Jacobsen
Quinn Snell
Mark J. Clement

Codon bias, the usage patterns of synonymous codons for encoding a protein sequence as nucleotides, is a biological phenomenon that is not fully understood. Several methods exist to represent the codon bias of an organism: codon adaptation index (CAI) [1], individual codon usage (ICU), hidden stop codons (HSC) [2] and codon context (CC) [3]. These methods are often employed in the optimization of heterologous gene expression to increase the accuracy and rate of translation. They, however, have many shortcomings as they dont take into account the local and global context of a gene. We present a method for modeling global and local codon bias through deep language models that is more robust than current methods by providing more contextual information and long-range dependencies.

Keywords:

Bioinformatics
Computer science
Gene expression
Gene
Stop codon
Codon Adaptation Index
Codon usage bias
Phenomenon
Language model
Machine learning
a protein
biological phenomenon
Artificial intelligence
contextual information
Computational biology

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations