Molecular characterization and sequecing analysis of SARS-CoV-2 genome in Minas Gerais, Brazil
Giulia Magalhães FerreiraIngra Morales ClaroVictória Riquena GroscheDarlan da Silva CândidoDiego Pandeló JoséE RochaThaís M. ColettiErika R. ManuliNelson GaburoNuno R. FariaÉster Cerdeira SabinoJaqueline Góes de JesusAna Carolina Gomes Jardim
2
Citation
48
Reference
10
Related Paper
Citation Trend
Keywords:
Pandemic
Coronavirus
Lineage (genetic)
Monophyly
Should we build our own phylogenetic trees based on gene sequence data, or can we simply use available synthesis phylogenies? This is a fundamental question that any study involving a phylogenetic framework must face at the beginning of the project. Building a phylogeny from gene sequence data (purpose-built phylogeny) requires more effort, expertise, and cost than subsetting an already available phylogeny (synthesis-based phylogeny). However, we still lack a comparison of how these two approaches to building phylogenetic trees influence common community phylogenetic analyses such as comparing community phylogenetic diversity and estimating trait phylogenetic signal. Here, we generated three purpose-built phylogenies and their corresponding synthesis-based trees (two from Phylomatic and one from the Open Tree of Life, OTL). We simulated 1,000 communities and 12,000 continuous traits along each purpose-built phylogeny. We then compared the effects of different trees on estimates of phylogenetic diversity (alpha and beta) and phylogenetic signal (Pagel's λ and Blomberg's K). Synthesis-based phylogenies generally yielded higher estimates of phylogenetic diversity when compared to purpose-built phylogenies. However, resulting measures of phylogenetic diversity from both types of phylogenies were highly correlated (Spearman's ρ > 0.8 in most cases). Mean pairwise distance (both alpha and beta) is the index that is most robust to the differences in tree construction that we tested. Measures of phylogenetic diversity based on the OTL showed the highest correlation with measures based on the purpose-built phylogenies. Trait phylogenetic signal estimated with synthesis-based phylogenies, especially from the OTL, was also highly correlated with estimates of Blomberg's K or close to Pagel's λ from purpose-built phylogenies when traits were simulated under Brownian motion. For commonly employed community phylogenetic analyses, our results justify taking advantage of recently developed and continuously improving synthesis trees, especially the Open Tree of Life.
Community
Cite
Citations (96)
The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results.
Computational phylogenetics
Tree (set theory)
Tree rearrangement
Toolbox
Phylogenetic comparative methods
Cite
Citations (168)
Horizontal gene transfer (HGT) is a common event in prokaryotic evolution.Therefore, it is very important to consider HGT in the study of molecular evolution of prokaryotes.This is true also for conducting computer simulations of their molecular phylogeny because HGT is known to be a serious disturbing factor for estimating their correct phylogeny.To the best of our knowledge, no existing computer program has generated a phylogenetic tree with HGT from an original phylogenetic tree.We developed a program called HGT-Gen that generates a phylogenetic tree with HGT on the basis of an original phylogenetic tree of a protein or gene.HGT-Gen converts an operational taxonomic unit or a clade from one place to another in a given phylogenetic tree.We have also devised an algorithm to compute the average length between any pair of branches in the tree.It defines and computes the relative evolutionary time to normalize evolutionary time for each lineage.The algorithm can generate an HGT between a pair of donor and acceptor lineages at the same evolutionary time.HGT-Gen is used with a sequence-generating program to evaluate the influence of HGT on the molecular phylogeny of prokaryotes in a computer simulation study.
Horizontal Gene Transfer
Computational phylogenetics
Tree (set theory)
Lineage (genetic)
Tree rearrangement
Cite
Citations (5)
Tree (set theory)
Phylogenomics
Cite
Citations (28)
The study of evolutionary relationships among organisms is vital in evolutionary biology. To reconstruct a reliable species phylogeny, one of the most important issues is to choose proper molecular markers and take full advantage of phylogenetic information contained in these markers. Intra-individual allele heterozygotes (IIAHs) have been commonly detected in intron phylogenetic studies. How to incorporate IIAHs into phylogenetic framework has been a focus in current studies. In this review, the conception, isolation, and analytic methods of IIAHs in phylogeny were summarized.
Phylogenetic relationship
Cite
Citations (0)
To understand the early evolution of the Metazoa, it is necessary to determine the correct phylogenetic status of diploblastic animals. Despite cladistic studies of morphological characters and recent molecular phylogenetic studies, it remains uncertain whether diploblasts are monophyletic or paraphyletic, and how the phyla of diploblasts are phylogenetically related. The heat shock protein 70 (Hsp70) sequences, because of their ubiquity and high degree of conservation, could provide a useful model for phylogenetic analysis. We have sequenced almost the entire nucleic acid sequence of cytoplasmic Hsp70 from eight diploblastic species. Our data support the monophyly of diploblastic animals. However, the phylogenetic relationships of the diploblast groups were not significantly resolved. Our phylogenetic trees also support the monophyly of Metazoa with high bootstrap values, indicating that animals form an extremely robust clade.
Monophyly
Paraphyly
Cite
Citations (107)
Abstract Should we build our own phylogenetic trees based on gene sequence data, or can we simply use available synthesis phylogenies? This is a fundamental question that any study involving a phylogenetic framework must face at the beginning of the project. Building a phylogeny from gene sequence data (purpose-built phylogeny) requires more effort, expertise, and cost than subsetting an already available phylogeny (synthesis-based phylogeny). However, we still lack a comparison of how these two approaches to building phylogenetic trees influence common community phylogenetic analyses such as comparing community phylogenetic diversity and estimating trait phylogenetic signal. Here, we generated three purpose-built phylogenies and their corresponding synthesis-based trees (two from Phylomatic and one from the Open Tree of Life [OTL]). We simulated 1,000 communities and 12,000 continuous traits along each purpose-built phylogeny. We then compared the effects of different trees on estimates of phylogenetic diversity (alpha and beta) and phylogenetic signal (Pagel’s λ and Blomberg’s K). Synthesis-based phylogenies generally yielded higher estimates of phylogenetic diversity when compared to purpose-built phylogenies. However, resulting measures of phylogenetic diversity from both types of phylogenies were highly correlated (Spearman’s ρ > 0.8 in most cases). Mean pairwise distance (both alpha and beta) is the index that is most robust to the differences in tree construction that we tested. Measures of phylogenetic diversity based on the OTL showed the highest correlation with measures based on the purpose-built phylogenies. Trait phylogenetic signal estimated with synthesis-based phylogenies, especially from the OTL, were also highly correlated with estimates of Blomberg’s K or close to Pagel’s λ from purpose-built phylogenies when traits were simulated under Brownian Motion. For commonly employed community phylogenetic analyses, our results justify taking advantage of recently developed and continuously improving synthesis trees, especially the Open Tree of Life.
Phylogenetic diversity
Trait
Phylogenetic comparative methods
Tree (set theory)
Cite
Citations (3)
The determination of the phylogenetic relationships among microorganisms has long relied primarily on gene sequence information. Given that prokaryotic organisms often lack morphological characteristics amenable to phylogenetic analysis, prokaryotic phylogenies, in particular, are often based on sequence data. In this work, we explore a new source of phylogenetic information, the distribution of protein structural domains within fully sequenced prokaryotic genomes. The evolution of the structural domains we use has been studied extensively, allowing us to base our phylogenetic methods on testable theoretical models of structural evolution. We find that the methods that produce reasonable phylogenetic relationships are indeed the methods that are most consistent with theoretical evolutionary models. This work represents, to our knowledge, the first such theoretically motivated phylogeny, as well as the first application of structural information to phylogeny on this scale. Our results have strong implications for the phylogenetic relationships among prokaryotic organisms and for the understanding of protein evolution as a whole.
Computational phylogenetics
Sequence (biology)
Cite
Citations (41)
Abstract Reconstructing the evolutionary relationships of species is a major goal in biology. Despite the increasing number of completely sequenced genomes, a large number of phylogenetic projects rely on targeted sequencing and analysis of a relatively small sample of marker genes. The selection of these phylogenetic markers should ideally be based on accurate predictions of their combined, rather than individual, potential to accurately resolve the phylogeny of interest. Here we present and validate a new phylogenomics strategy to efficiently select a minimal set of stable markers able to reconstruct the underlying species phylogeny. In contrast to previous approaches, our methodology does not only rely on the ability of individual genes to reconstruct a known phylogeny, but it also explores the combined power of sets of concatenated genes to accurately infer phylogenetic relationships of species not previously analyzed. We applied our approach to two broad sets of cyanobacterial and ascomycetous fungal species, and provide two minimal sets of six and four genes, respectively, necessary to fully resolve the target phylogenies. This approach paves the way for the informed selection of phylogenetic markers in the effort of reconstructing the tree of life.
Phylogenomics
Tree (set theory)
Cite
Citations (52)
Studies examining phylogenetic community structure have become increasingly prevalent, yet little attention has been given to the influence of the input phylogeny on metrics that describe phylogenetic patterns of co-occurrence. Here, we examine the influence of branch length, tree reconstruction method, and amount of sequence data on measures of phylogenetic community structure, as well as the phylogenetic signal (Pagel's λ) in morphological traits, using Trichoptera larval communities from Churchill, Manitoba, Canada. We find that model-based tree reconstruction methods and the use of a backbone family-level phylogeny improve estimations of phylogenetic community structure. In addition, trees built using the barcode region of cytochrome c oxidase subunit I (COI) alone accurately predict metrics of phylogenetic community structure obtained from a multi-gene phylogeny. Input tree did not alter overall conclusions drawn for phylogenetic signal, as significant phylogenetic structure was detected in two body size traits across input trees. As the discipline of community phylogenetics continues to expand, it is important to investigate the best approaches to accurately estimate patterns. Our results suggest that emerging large datasets of DNA barcode sequences provide a vast resource for studying the structure of biological communities.
DNA Barcoding
Tree (set theory)
Barcode
Cite
Citations (40)