Abstract Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
Babesiosis is a tick-borne disease caused by eukaryotic Babesia parasites which are morphologically similar to Plasmodium falciparum, the causative agent of malaria in humans. Like Plasmodium, different species of Babesia are tuned to infect different mammalian hosts, including rats, dogs, horses and cattle. Most species of Plasmodium and Babesia possess an essential bifunctional enzyme for nucleotide synthesis and folate metabolism: dihydrofolate reductase-thymidylate synthase. Although thymidylate synthase is highly conserved across organisms, the bifunctional form of this enzyme is relatively uncommon in nature. The structural characterization of dihydrofolate reductase-thymidylate synthase in Babesia bovis, the causative agent of babesiosis in livestock cattle, is reported here. The apo state is compared with structures that contain dUMP, NADP and two different antifolate inhibitors: pemetrexed and raltitrexed. The complexes reveal modes of binding similar to that seen in drug-resistant malaria strains and point to the utility of applying structural studies with proven cancer chemotherapies towards infectious disease research.
The increasing dissemination of carbapenemases in Gram-negative bacteria has threatened the clinical usefulness of the β-lactam class of antimicrobials. A program was initiated to discover a new series of serine β-lactamase inhibitors containing a boronic acid pharmacophore, with the goal of finding a potent inhibitor of serine carbapenemase enzymes that are currently compromising the utility of the carbapenem class of antibacterials. Potential lead structures were screened in silico by modeling into the active sites of key serine β-lactamases. Promising candidate molecules were synthesized and evaluated in biochemical and whole-cell assays. Inhibitors were identified with potent inhibition of serine carbapenemases, particularly the Klebsiella pneumoniae carbapenemase (KPC), with no inhibition of mammalian serine proteases. Studies in vitro and in vivo show that RPX7009 (9f) is a broad-spectrum inhibitor, notably restoring the activity of carbapenems against KPC-producing strains. Combined with a carbapenem, 9f is a promising product for the treatment of multidrug resistant Gram-negative bacteria.
Deinococcus radiodurans RNA ligase (DraRnl) is a template-directed ligase that seals nicked duplexes in which the 3'-OH strand is RNA. DraRnl is a 342 amino acid polypeptide composed of a C-terminal adenylyltransferase domain fused to a distinctive 126 amino acid N-terminal module (a putative OB-fold). An alanine scan of the C domain identified 9 amino acids essential for nick ligation, which are located within nucleotidyltransferase motifs I, Ia, III, IIIa, IV and V. Seven mutants were dysfunctional by virtue of defects in ligase adenylylation: T163A, H167A, G168A, K186A, E230A, F281A and E305A. Four of these were also defective in phosphodiester formation at a preadenylylated nick: G168A, E230A, F281A and E305A. Two nick sealing-defective mutants were active in ligase adenylylation and sealing a preadenylylated nick, thereby implicating Ser185 and Lys326 in transfer of AMP from the enzyme to the nick 5'-PO(4). Whereas deletion of the N-terminal domain suppressed overall nick ligation and ligase adenylylation, it did not compromise sealing at a preadenylylated nick. Mutational analysis of 15 residues of the N domain identified Lys26, Gln31 and Arg79 as key constituents. Structure-activity relationships at the essential residues were determined via conservative substitutions. We propose that DraRnl typifies a new clade of polynucleotide ligases. DraRnl homologs are detected in several eukaryal proteomes.
The structural genomics effort at the Seattle Structural Genomics Center for Infectious Disease (SSGCID) requires the manipulation of large numbers of amino-acid sequences and the underlying DNA sequences which are to be cloned into expression vectors. To improve efficiency in high-throughput protein structure determination, a database software package, Gene Composer, has been developed which facilitates the information-rich design of protein constructs and their underlying gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bioinformatics steps used in modern structure-guided protein engineering and synthetic gene engineering. An example of the structure determination of H1N1 RNA-dependent RNA polymerase PB2 subunit is given.