logo
    Mutations within the Open Reading Frame (ORF) including Ochre stop codon of the Surface Glycoprotein gene of SARS-CoV-2 virus erase potential seed location motifs of human non-coding microRNAs
    0
    Citation
    52
    Reference
    10
    Related Paper
    Abstract:
    Abstract MicroRNA are short and non-coding RNA, 18-25 nucleotides in length. They are produced at the early stage of viral infection. The roles played by cellular miRNAs and miRNA-mediated gene-silencing in the COVID-19 epidemic period is critical in order to develop novel therapeutics. We analysed SARS-CoV-2 Surface Glycoprotein (S) nucleotide sequence originating from India as well as Iran, Australia, Germany, Italy, Russia, China, Japan and Turkey and identified mutation in potential seed location of several human miRNA. Seventy single nucleotide polymorphisms (SNP) were detected in the S gene out of which, 36, 32 and 2 were cases of transitions, transversions and deletions respectively. Eleven human miRNA targets were identified on the reference S gene sequence with a score >80 in the miRDB database. Mutation A845S erased a common binding site of 7 human miRNA (miR-195-5p, miR-16-5p, miR-15b-5p, miR-15a-5p, miR-497-5p, miR-424-5p and miR-6838-5p). A synonymous mutation altered the wild type Ochre stop codon within the S gene sequence (Italy) to Opal thereby changing the seed sequence of miR-511-3p. Similar (synonymous) mutations were detected at amino acid position 659 and 1116 of the S gene where amino acids serine and threonine were retained, abolishing potential seed location for miR-219a-1-3p and miR-20b-3p respectively. The significance of this finding in reference to the strategy to use synthetic miRNA combinations as a novel therapeutic tool is discussed.
    Keywords:
    Coding region
    Silent mutation
    Stop codon
    In-frame stop codons normally signal termination during mRNA translation, but they can be read as 'sense' (readthrough) depending on their context, comprising the 6 nt preceding and following the stop codon. To identify novel contexts directing readthrough, under-represented 5′ and 3′ stop codon contexts from Saccharomyces cerevisiae were identified by genome-wide survey in silico . In contrast with the nucleotide bias 3′ of the stop codon, codon bias in the two codon positions 5′ of the termination codon showed no correlation with known effects on stop codon readthrough. However, individually, poor 5′ and 3′ context elements were equally as effective in promoting stop codon readthrough in vivo , readthrough which in both cases responded identically to changes in release factor concentration. A novel method analysing specific nucleotide combinations in the 3′ context region revealed positions +1,2,3,5 and +1,2,3,6 after the stop codon were most predictive of termination efficiency. Downstream of yeast open reading frames (ORFs), further in-frame stop codons were significantly over-represented at the +1, +2 and +3 codon positions after the ORF, acting to limit readthrough. Thus selection against stop codon readthrough is a dominant force acting on 3′, but not on 5′, nucleotides, with detectable selection on nucleotides as far downstream as +6 nucleotides. The approaches described can be employed to define potential readthrough contexts for any genome.
    Stop codon
    ORFS
    Release factor
    Citations (89)
    Summary Integrons are genetic elements able to capture anti‐biotic resistance and other genes and to promote their transcription. Here, we have investigated integron‐dependent translation of an aminoglycoside 6 ′ ‐N‐acetyltransferase gene ( aac(6 ′ )‐Ib 7 ) inserted at the attI1 site. N‐terminal sequencing revealed that translation of this gene was initiated at a GTG codon, which is not part of a plausible translation initiation region (TIR). A short open reading frame (called ORF‐11) overlapping the attI1 site was probed by site‐directed mutagenesis for its contribution to aac(6 ′ )‐Ib 7 translation. When ORF‐11 and its TIR were deleted en bloc , translational efficiency dropped by over 80%, as determined with an acetyltransferase– luciferase fusion product. Invalidation of the ATG start codon of ORF‐11 or its putative Shine–Dalgarno sequence resulted in a decrease of over 60%, whereas the decrease was much less pronounced when the amino acid sequence of the putative ORF‐11‐encoded peptide was altered or when the distance between ORF‐11 and aac(6 ′ )‐Ib 7 was doubled. This demonstrates that aac(6 ′ )‐Ib 7 translation is dependent upon the translation of ORF‐11, but almost certainly not upon the corresponding peptide. These results lead us to conclude that an intrinsic short ORF present in the 5 ′ ‐conserved segment of many class 1 integrons may substantially enhance expression at the translational level of captured TIR‐deficient anti‐biotic resistance genes.
    Stop codon
    Eukaryotic translation
    Reading frame
    ORFS
    Translational regulation
    The min 4 region of the Escherichia coli genome contains genes (lpxA and lpxB) that encode proteins involved in lipid A biosynthesis. We have determined the sequence of 1,350 base pairs of DNA upstream of the lpxB gene. This fragment of DNA contains the complete coding sequence for the 28.0-kilodalton lpxA gene product and an upstream open reading frame capable of encoding a 17-kilodalton protein (ORF17). In addition there appears to be an additional open reading frame (ORF?) immediately upstream of ORF17. The initiation codon for lpxA is a GUG codon, and the start codon for ORF17 is apparently a UUG codon. The start and stop codons overlap between ORF? and ORF17, ORF17 and lpxA, and lpxA and lpxB. This overlap is suggestive of translational coupling and argues that the genes are cotranscribed. Crowell et al. (D.N. Crowell, W.S. Reznikoff, and C.R.H. Raetz, J. Bacteriol. 169:5727-5734, 1987) and Tomasiewicz and McHenry (H.G. Tomasiewicz and C.S. McHenry, J. Bacteriol. 169:5735-5744, 1987) have demonstrated that there are three similarly overlapping coding regions downstream of lpxB including dnaE, suggesting the existence of a complex operon of at least seven genes: 5'-ORF?-ORF17-lpxA-lpxB-ORF23-dnaE-ORF37-3 '.
    Stop codon
    Coding region
    Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052–1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of ‘leaky terminators’. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated ‘termination’ protein and the full-length ‘read-through’ protein (VP9 and VP9′, respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9′ is a non-structural protein.
    Stop codon
    Coding region
    Citations (20)
    We have determined a sequence of 2073 bp from two recombinant plasmids carrying the whole spoIIA locus from Bacillus subtilis, the expression of which is required for spore formation. The sequence contains three long open reading frames (ORFs), each of them being preceded by a ribosome binding site. These three putative proteins (mol. wts 13100, 16300 and 22200) are likely to be expressed and are probably encoded on the same mRNA. The stop codon of ORF1 overlaps with the start codon of ORF2 suggesting that there might be translational coupling between the two ORFs. Although some known promoter sequences were found, the only one upstream from the first open reading frame is about 260 bp from it.
    ORFS
    Shine-Dalgarno sequence
    Ribosomal binding site
    Stop codon
    Citations (125)
    Set three reading frames artificially in 5′ untranslated region (5′ UTR) of bacterial and archaeal genomes and analyse the frequencies distribution of triplet AUG in the three reading frames respectively. There is an obviously AUG depletion in reading frame 1 in both bacterial and archaeal genomes. AUG depletion may be an indication that upstream AUG can probably disturb the translation initiation of bacteria and archaea. The authors found that majority of AUGs in the upstream sequences have in-frame stop codons in the upstream sequences and form upstream open reading frame (uORF), only a fraction of AUGs in the upstream have no in-frame stop codon in the upstream sequences, especially in reading frame 1. The authors call them upstream AUGs (uAUGs). In bacterial genomes, more uAUGs in reading frame 1 can be found in the genes having a Shine-Dalgarno sequence. Further analyses show synonymous codon usage bias is weaker of the sequences leading by uAUGs than the codon sequences. It may be an indication that the synonymous codon usage bias is an important determinant of translation initiation accurately in bacteria and archaea.
    Eukaryotic translation
    Stop codon
    Citations (0)