This paper presents the utilization of progressive alignment principle for positional adjustment of a set of genomic signals with different lengths. The new method of multiple alignment of signals based on dynamic time warping is tested for the purpose of evaluating the similarity of different length genes in phylogenetic studies. Two sets of phylogenetic markers were used to demonstrate the effectiveness of the evaluation of intraspecies and interspecies genetic variability. The part of the proposed method is modification of pairwise alignment of two signals by dynamic time warping with using correlation in a sliding window. The correlation based dynamic time warping allows more accurate alignment dependent on local homologies in sequences without the need of scoring matrix or evolutionary models, because mutual similarities of residues are included in the numerical code of signals.
The origin of replication (oriC) plays an important role in the cell cycle as the place where DNA replication is initiated. In bacterial cells, a single replication origin can be found and its correct identification is necessary in the annotation process of newly sequenced genomes. Although the rearrangement of a whole genome sequence according to oriC should be a standard procedure, public databases still contain lots of genomes starting at a random place. This situation complicates the comparative analysis of whole bacterial genomes as only two genomes rearranged according to oriC can be reliably aligned. In this paper, we present a novel technique for oriC prediction based exclusively on utilization of cumulated phase signal which distinguishes our approach from current techniques combining application of genomic signal processing techniques with a standard character based comparison. Proposed technique is therefore fast and suitably complements the current pipeline for comparison of whole bacterial genomes by aligned downsampled signals.
Metallothioneins (MT) are a family of ubiquitous proteins, whose role is still discussed in numerous papers, but their affinity to some metal ions is undisputable. These cysteine-rich proteins are connected with antioxidant activity and protective effects on biomolecules against free radicals, especially reactive oxygen species. In this review, the connection between zinc(ii) ions, reactive oxygen species, heavy metal ions and metallothioneins is demonstrated with respect to effect of these proteins on cell proliferation and a possible negative role in resistance to heavy metal-based and non-heavy metal-based drugs.
Plants respond to heavy metal toxicity in a variety of different ways including synthesis of phytochelatins. The synthesis of phytochelatins is catalyzed by γ-Glu-Cys dipeptidyl transpeptidase named as phytochelatin synthase (PCS). The main aim of this study was to optimize high performance liquid chromatography coupled with electrochemical detector for determination of phytochelatin2. The optimized procedure was subsequently used for determination of the mentioned molecules with special attention aimed at the possibility to determine PC2 after activation of PCS in the tobacco BY-2 cells treated with different concentrations of cadmium(II) ions. The optimized conditions were as follows. Both the detector and the column were thermostated at 30 °C. Mobile phase consisted of A: trifluoroacetic acid (80 mM) and B: 100% methanol. Compounds were eluted by the following linear increasing gradient: 0-1 min (3 % of B), 1→12 min (20 % of B), 12→15 min (98 % of B), 15→20 min (98 % of B). Flow rate of the mobile phase was 1 ml min-1. Detection was carried out at applied potential 900 mV postponed on four electrodes. Time of one analysis was 15 minutes. The estimated detection limit for PC2 was 17 nM. In addition, the recoveries were from 93 to 98 %. Good precision was obtained with %C.V.s ranging from 5.3 to 7.5 % in the intra-assay. The inter-assay %C.V.s ranged from 8.9 to 10.2 %. Overall recoveries of were from 102 to 106 % (n = 30). Accuracy (%Bias) was about ±5 %. Further, the attention was aimed at determination of PC2 in BY-2 tobacco cells treated with cadmium(II) ions (0, 5, 10, 25, 50 and 100 µM). It clearly follow from the results obtained that the content of PC2 enhanced with increasing concentration of the substrate and with the applied concentration of cadmium(II) ions.
Comparison and classification of organisms based on molecular data is an important task of computational biology, since at least parts of DNA sequences for many organisms are available. Unfortunately, methods for comparison are computationally very demanding, suitable only for short sequences. In this paper, we focus on the redundancy of genetic information stored in DNA sequences. We proposed rules for downsampling of DNA signals of cumulated phase. According to the length of an original sequence, we are able to significantly reduce the amount of data with only slight loss of original information. Dyadic wavelet transform was chosen for fast downsampling with minimum influence on signal shape carrying the biological information. We proved the usability of such new short signals by measuring percentage deviation of pairs of original and downsampled signals while maintaining spectral power of signals. Minimal loss of biological information was proved by measuring the Robinson-Foulds distance between pairs of phylogenetic trees reconstructed from the original and downsampled signals. The preservation of inter-species and intra-species information makes these signals suitable for fast sequence identification as well as for more detailed phylogeny reconstruction.
Schlegelella thermodepolymerans is a moderately thermophilic bacterium capable of producing polyhydroxyalkanoates-biodegradable polymers representing an alternative to conventional plastics. Here, we present the first complete genome of the type strain S. thermodepolymerans DSM 15344 that was assembled by hybrid approach using both long (Oxford Nanopore) and short (Illumina) reads. The genome consists of a single 3,858,501-bp-long circular chromosome with GC content of 70.3%. Genome annotation identified 3,650 genes in total, whereas 3,598 open reading frames belonged to protein-coding genes. Functional annotation of the genome and division of genes into clusters of orthologous groups revealed a relatively high number of 1,013 genes with unknown function or unknown clusters of orthologous groups, which reflects the fact that only a little is known about thermophilic polyhydroxyalkanoates-producing bacteria on a genome level. On the other hand, 270 genes involved in energy conversion and production were detected. This group covers genes involved in catabolic processes, which suggests capability of S. thermodepolymerans DSM 15344 to utilize and biotechnologically convert various substrates such as lignocellulose-based saccharides, glycerol, or lipids. Based on the knowledge of its genome, it can be stated that S. thermodepolymerans DSM 15344 is a very interesting, metabolically versatile bacterium with great biotechnological potential.
Metagenomics became very popular after expansion of next-generation sequencing techniques that allowed simple implementation of extensive studies. With a target gene sequencing approach, an identification of organisms in a metagenome is quite effortless since only a small reference database of the particular gene is needed. Moreover, by counting the copies of individual genes, also quantitative analysis can be applied. Unfortunately, current bioinformatics tools aim mainly on the analysis of a single metagenome. A cluster analysis, a heatmap of correlation coefficients, biclustering or other statistics techniques can only show relations inside the metagenome or the relation between the metagenome composition and other parameters. On the other hand, there is a lack of tools to provide a comparative analysis of two or more metagenomes. Suitable properties for this kind of analysis can be found in a bipartite graph. Here, we present a novel workflow for finding the suitable representation of metagenomic data to provide a comparative analysis of metagenomes. The resulting graph can take into account information about the actual composition of the metagenome as well as the environment it relates to. Thus, it can provide different view of the data to the naked eye that can complement other techniques such as principal coordinate analysis.