CASP assessments primarily rely on comparing predicted coordinates with experimental reference structures. However, errors in the reference structures can potentially reduce the accuracy of the assessment. This issue is particularly prominent in cryoEM-determined structures, and therefore, in the assessment of CASP15 cryoEM targets, we directly utilized density maps to evaluate the predictions. A method for ranking the quality of protein chain predictions based on rigid fitting to experimental density was found to correlate well with the CASP assessment scores. Overall, the evaluation against the density map indicated that the models are of high accuracy although local assessment of predicted side chains in a 1.52 Å resolution map showed that side-chains are sometimes poorly positioned. The top 136 predictions associated with 9 protein target reference structures were selected for refinement, in addition to the top 40 predictions for 11 RNA targets. To this end, we have developed an automated hierarchical refinement pipeline in cryoEM maps. For both proteins and RNA, the refinement of CASP15 predictions resulted in structures that are close to the reference target structure, including some regions with better fit to the density. This refinement was successful despite large conformational changes and secondary structure element movements often being required, suggesting that predictions from CASP-assessed methods could serve as a good starting point for building atomic models in cryoEM maps for both proteins and RNA. Loop modeling continued to pose a challenge for predictors with even short loops failing to be accurately modeled or refined at times. The lack of consensus amongst models suggests that modeling holds the potential for identifying more flexible regions within the structure.
Enzymes exist in ensembles of states that encode the energetics underlying their catalysis. Conformational ensembles built from 1231 structures of 17 serine proteases revealed atomic-level changes across their reaction states. By comparing the enzymatic and solution reaction, we identified molecular features that provide catalysis and quantified their energetic contributions to catalysis. Serine proteases precisely position their reactants in destabilized conformers, creating a downhill energetic gradient that selectively favors the motions required for reaction while limiting off-pathway conformational states. The same catalytic features have repeatedly evolved in proteases and additional enzymes across multiple distinct structural folds. Our ensemble-function analyses revealed previously unknown catalytic features, provided quantitative models based on simple physical and chemical principles, and identified motifs recurrent in nature that may inspire enzyme design.
RNA cryo-EM is a fast advancing field.It has solved RNA structures yet unsolved by x-ray crystallography such as the tetrahymena ribozyme and has pushed the boundaries of cryo-EM by resolving small molecules; down to 28kDa.We are now able to propose atomic models of RNA, guided by cryo-EM maps, ranging from 10Å resolution to below 3Å.However, these models should not be viewed as equal as various features of RNA, in particular the backbone, are resolved in a resolution-dependent manner.We have analyzed published and unpublished cryo-EM maps of RNA to systematically address the resolvability of various RNA features at different resolutions.Finally, we have explored RNA as a model system to examine the charge-dependent manner of electron scattering; revealing a key difference between interpreting x-ray and cryo-EM maps and the limitations and potential applications enabled by the charge-dependent nature of electron scattering.
Abstract Drug discovery campaigns against Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) are beginning to target the viral RNA genome 1, 2 . The frameshift stimulation element (FSE) of the SARS-CoV-2 genome is required for balanced expression of essential viral proteins and is highly conserved, making it a potential candidate for antiviral targeting by small molecules and oligonucleotides 3–6 . To aid global efforts focusing on SARS-CoV-2 frameshifting, we report exploratory results from frameshifting and cellular replication experiments with locked nucleic acid (LNA) antisense oligonucleotides (ASOs), which support the FSE as a therapeutic target but highlight difficulties in achieving strong inactivation. To understand current limitations, we applied cryogenic electron microscopy (cryo-EM) and the Ribosolve 7 pipeline to determine a three-dimensional structure of the SARS-CoV-2 FSE, validated through an RNA nanostructure tagging method. This is the smallest macromolecule (88 nt; 28 kDa) resolved by single-particle cryo-EM at subnanometer resolution to date. The tertiary structure model, defined to an estimated accuracy of 5.9 Å, presents a topologically complex fold in which the 5′ end threads through a ring formed inside a three-stem pseudoknot. Our results suggest an updated model for SARS-CoV-2 frameshifting as well as binding sites that may be targeted by next generation ASOs and small molecules.
The first RNA category of the CASP competition was only made possible because of the scientists who provided experimental structures to challenge the predictors.In this article, these scientists offer a unique and valuable analysis of both the successes and areas for improvement in the predicted models.All ten RNA-only targets yielded predictions topologically similar to experimentally determined structures.For one target, experimentalists were able to phase their X-ray diffraction data by molecular replacement, showing a potential application of structure predictions for RNA structural biologists.Recommended areas for improvement include: enhancing the accuracy in local interaction predictions and increased consideration of the experimental conditions such as multimerization, structure determination method, and time along folding pathways.The prediction of RNA-protein complexes remains the most significant challenge.Finally, given the intrinsic flexibility of many RNAs, we propose the consideration of ensemble models.