Abstract Linking between genotype and phenotype is a fundamental goal in biology and requires robust data for both layers. The prominent increase in plant genome sequencing and comparisons of multiple related individuals, exposed the abundance of structural genomic variation and suggest that a single reference genome cannot represent the complete sequence diversity of a crop species, leading to the expansion of the pan-genome concept. For high-resolution forward genetics, this unprecedented access to genomic variation should be paralleled by availability and phenotypic characterization of genetic diversity, and effective integration between these layers. Here, we describe a multi-parental framework for trait dissection in melon, leveraging a novel pan-genome constructed for this crop. Melon ( Cucumis melo L.) is an important crop from the Cucurbitaceae family, which display extensive phenotypic variation available for breeding. A diverse core set of 25 founder lines ( MelonCore25 ) was sequenced using a combination of short and long-read technologies and their genomes were assembled de novo . The construction of a melon pan-genome exposed substantial variation in genome size and structure, including detection of ~300,000 structural variants and ~9 million SNPs. A half-diallel derived set of 300 F 2 populations representing all possible MelonCore25 parental combinations was constructed as framework for trait dissection through integration with the pan-genome. We demonstrate the potential of this unified framework for genetic analysis of various melon traits, including rind color and mottling pattern, fruit sugar content and resistance to fungal diseases. We anticipate that utilization of this integrated resource will enhance genetic dissection of important traits and accelerate melon breeding. Significance statement Pan-genomes aim to address the abundance of genome structural variation within species for improved genomic analyses. New pan-genome, constructed from de novo genome assemblies of 25 diverse melon ( Cucumis melo L.) accessions is integrated with a half-diallel derived set of 300 F2 populations representing all possible parental combinations. The potential of this unified multi-parental trait dissection framework for melon genetics and breeding is presented.
Pathogenic bacteria have evolved highly specialized systems to extract essential nutrients from their hosts. Mycobacterium tuberculosis (Mtb) scavenges lipids (cholesterol and fatty acids) to maintain infections in mammals but mechanisms and proteins responsible for the import of fatty acids in Mtb were previously unknown. Here, we identify and determine that the previously uncharacterized protein Rv3723/LucA, functions to integrate cholesterol and fatty acid uptake in Mtb. Rv3723/LucA interacts with subunits of the Mce1 and Mce4 complexes to coordinate the activities of these nutrient transporters by maintaining their stability. We also demonstrate that Mce1 functions as a fatty acid transporter in Mtb and determine that facilitating cholesterol and fatty acid import via Rv3723/LucA is required for full bacterial virulence in vivo. These data establish that fatty acid and cholesterol assimilation are inexorably linked in Mtb and reveals a key function for Rv3723/LucA in in coordinating thetransport of both these substrates.
Background: Nitrogen (N) fertilization in crop production significantly impacts ecosystems, often disrupting natural plant-microbe-soil interactions and causing environmental pollution. Our research tested the hypothesis that phylogenetically related perennial grasses might preserve rhizosphere management strategies conducive to a sustainable N economy for crops. Method: We analyzed the N cycle in the rhizospheres of 36 Andropogoneae grass species related to maize and sorghum, investigating their impacts on N availability and losses. This assay is supplemented with the collection and comparison of native habitat environment data for ecological inference as well as cross-species genomic and transcriptomic association analyses for candidate gene discovery. Result: Contrary to our hypothesis, all examined annual species, including sorghum and maize, functioned as N "Conservationists," reducing soil nitrification potential and conserving N. In contrast, some perennial species enhanced nitrification and leaching ("Leachers"). Yet a few other species exhibited similar nitrification stimulation effects but limited NO3- losses ("Nitrate Keepers"). We identified significant soil characteristics as influential factors in the eco-evolutionary dynamics of plant rhizospheres, and highlighted the crucial roles of a few transporter genes in soil N management and utilization. Conclusion: These findings serve as valuable guidelines for future breeding efforts for global sustainability.
Gff3 annotation files for 25 de-novo melon genomes discussed in publication. These are draft annotations based on lif-over from "Harukei-3" and "HS" melon genomes. Genome fasta files can be found in NCBI PRJNA726743
Abstract Pathogenic bacteria have evolved highly specialized systems to extract essential nutrients from their hosts and Mycobacterium tuberculosis (Mtb) scavenges lipids (cholesterol and fatty acids) to maintain infection in mammals. While the uptake of cholesterol by Mtb is mediated by the Mce4 transporter, the route(s) of uptake of fatty acids remain unknown. Here, we demonstrate that an uncharacterized protein LucA, integrates the assimilation of both cholesterol and fatty acids in Mtb. LucA interacts with subunits of the Mce1 and Mce4 complexes to coordinate the activities of these nutrient transporters. We also demonstrate that Mce1 functions as an important fatty acid transporter in Mtb and we determine that the integration of cholesterol and fatty acid transport by LucA is required for full bacterial virulence in vivo . These data establish that fatty acid and cholesterol assimilation are inexorably linked in Mtb and reveals a key role for LucA in coordinating both transport activities.
Mycobacterium tuberculosis (Mtb) imports and metabolizes fatty acids to maintain infection within human macrophages. Although this is a well-established paradigm, the bacterial factors required for fatty acid import are poorly understood. Previously, we found that LucA and Mce1 are required for fatty acid import in Mtb (Nazarova et al., 2017). Here, we identified additional Mtb mutants that have a reduced ability to import a fluorescent fatty acid substrate during infection within macrophages. This screen identified the novel genes as rv2799 and rv0966c as be necessary for fatty acid import and confirmed the central role for Rv3723/LucA and putative components of the Mce1 fatty acid transporter (Rv0200/OmamB, Rv0172/Mce1D, and Rv0655/MceG) in this process.
Abstract Assembled genomes and their associated annotations have transformed our study of gene function. However, each new assembly generates new gene models. Inconsistencies between annotations likely arise from biological and technical causes, including pseudogene misclassification, transposon activity, and intron retention from sequencing of unspliced transcripts. To evaluate gene model predictions, we developed reelGene, a pipeline of machine learning models focused on (1) transcription boundaries, (2) mRNA integrity, and (3) protein structure. The first two models leverage sequence characteristics and evolutionary conservation across related taxa to learn the grammar of conserved transcription boundaries and mRNA sequences, while the third uses conserved evolutionary grammar of protein sequences to predict whether a gene can produce a protein. Evaluating 1.8 million gene models in maize, reelGene found that 28% were incorrectly annotated or nonfunctional. By leveraging a large cohort of related species and through learning the conserved grammar of proteins, reelGene provides a tool for both evaluating gene model accuracy and genome biology.
SUMMARY Linking genotype with phenotype is a fundamental goal in biology and requires robust data for both. Recent advances in plant‐genome sequencing have expedited comparisons among multiple‐related individuals. The abundance of structural genomic within‐species variation that has been discovered indicates that a single reference genome cannot represent the complete sequence diversity of a species, leading to the expansion of the pan‐genome concept. For high‐resolution forward genetics, this unprecedented access to genomic variation should be paralleled and integrated with phenotypic characterization of genetic diversity. We developed a multi‐parental framework for trait dissection in melon ( Cucumis melo ), leveraging a novel pan‐genome constructed for this highly variable cucurbit crop. A core subset of 25 diverse founders ( MelonCore25 ), consisting of 24 accessions from the two widely cultivated subspecies of C. melo , encompassing 12 horticultural groups, and 1 feral accession was sequenced using a combination of short‐ and long‐read technologies, and their genomes were assembled de novo . The construction of this melon pan‐genome exposed substantial variation in genome size and structure, including detection of ~300 000 structural variants and ~9 million SNPs. A half‐diallel derived set of 300 F 2 populations, representing all possible MelonCore25 parental combinations, was constructed as a framework for trait dissection through integration with the pan‐genome. We demonstrate the potential of this unified framework for genetic analysis of various melon traits, including rind color intensity and pattern, fruit sugar content, and resistance to fungal diseases. We anticipate that utilization of this integrated resource will enhance genetic dissection of important traits and accelerate melon breeding.