Bamboo is one of the fastest-growing non-timber forest plants. Moso bamboo (Phyllostachys edulis) is the most economically valuable bamboo in Asia, especially in China. With the release of the whole-genome sequence of moso bamboo, there are increasing demands for refined annotation of bamboo genes. Recently, large amounts of bamboo transcriptome data have become available, including data on the multiple growth stages of tissues. It is now feasible for us to construct co-expression networks to improve bamboo gene annotation and reveal the relationships between gene expression and growth traits. We integrated the genome sequence of moso bamboo and 78 transcriptome data sets to build genome-wide global and conditional co-expression networks. We overlaid the gene expression results onto the network with multiple dimensions (different development stages). Through combining the co-expression network, module classification and function enrichment tools, we identified 1,896 functional modules related to bamboo development, which covered functions such as photosynthesis, hormone biosynthesis, signal transduction, and secondary cell wall biosynthesis. Furthermore, an online database (http://bioinformatics.cau.edu.cn/bamboo) was built for searching the moso bamboo co-expression network and module enrichment analysis. Our database also includes cis-element analysis, gene set enrichment analysis, and other tools. In summary, we integrated public and in-house bamboo transcriptome data sets and carried out co-expression network analysis and functional module identification. Through data mining, we have yielded some novel insights into the regulation of growth and development. Our established online database might be convenient for the bamboo research community to identify functional genes or modules with important traits.
Custom-designed nucleases, including CRISPR-Cas9 and CRISPR-Cpf1, are widely used to realize the precise genome editing.The high-coverage, low-cost and quantifiability make high-throughput sequencing (NGS) to be an effective method to assess the efficiency of custom-designed nucleases.However, contrast to standardized transcriptome protocol, the NGS data lacks a user-friendly pipeline connecting different tools that can automatically calculate mutation, evaluate editing efficiency and realize in a more comprehensive dataset that can be visualized.Here, we have developed an automatic stand-alone toolkit based on python script, namely CRISPRMatch, to process the high-throughput genome-editing data of CRISPR nuclease transformed protoplasts by integrating analysis steps like mapping reads and normalizing reads count, calculating mutation frequency (deletion and insertion), evaluating efficiency and accuracy of genome-editing, and visualizing the results (tables and figures).Both of CRISPR-Cas9 and CRISPR-Cpf1 nucleases are supported by CRISPRMatch toolkit and the integrated code has been released on GitHub (https://github.com/zhangtaolab/CRISPRMatch).
To identify liquid-liquid phase separation (LLPS)-related molecular clusters, and to develop and validate a novel index based on LLPS for predicting the prognosis of prostate cancer (PCa) patients. We download the clinical and transcriptome data of PCa from TCGA and GEO database. The LLPS-related genes (LRGs) were extracted from PhaSepDB. Consensus clustering analysis was used to develop LLPS-related molecular subtypes for PCa. The LASSO cox regression analysis was performed to establish a novel LLPS-related index for predicting biochemical recurrence (BCR)-free survival (BCRFS). Preliminary experimental verification was performed. We initially identified a total of 102 differentially expressed LRGs for PCa. Three LLPS related molecular subtypes were identified. Moreover, we established a novel LLPS related signature for predicting BCRFS of PCa patients. Compared to low-risk patients in the training cohort, testing cohort and validating cohort, high-risk populations meant a higher risk of BCR and significantly poorer BCRFS. The area under receiver operating characteristic curve were 0.728, 0.762, and 0.741 at 1 year in the training cohort, testing cohort and validating cohort. Additionally, the subgroup analysis indicated that this index was especially suitable for PCa patients with age ≤ 65, T stage III-IV, N0 stage or in cluster 1. The FUS, which was the potential biomarker related to PCa liquid-liquid phase separation, was preliminarily identified and verified. This study successfully developed three LLPS-related molecular subtypes and identified a novel LLPS related molecular signature, which performed well in predicting BCRFS of PCa.
The agriGO platform, which has been serving the scientific community for >10 years, specifically focuses on gene ontology (GO) enrichment analyses of plant and agricultural species. We continuously maintain and update the databases and accommodate the various requests of our global users. Here, we present our updated agriGO that has a largely expanded number of supporting species (394) and datatypes (865). In addition, a larger number of species have been classified into groups covering crops, vegetables, fish, birds and insects closely related to the agricultural community. We further improved the computational efficiency, including the batch analysis and P-value distribution (PVD), and the user-friendliness of the web pages. More visualization features were added to the platform, including SEACOMPARE (cross comparison of singular enrichment analysis), direct acyclic graph (DAG) and Scatter Plots, which can be merged by choosing any significant GO term. The updated platform agriGO v2.0 is now publicly accessible at http://systemsbiology.cau.edu.cn/agriGOv2/.
Background: This study aimed to explore the relationship between the fluorescence intensity of indocyanine green (ICG) in near-infrared fluorescence guided surgery (NIRFGS) and preoperative liver function indicators. Methods: A total of 12 4T1 tumor-bearing mice were used for model establishment. Intraperitoneal injection (i.p.) of 20% carbon tetrachloride (CCl4) corn oil solution (50 µL) was given to mice in the liver injury model group, 24 hours after injection, the model was established, while the control group received 0% CCl4 corn oil solution (50 µL) (n=6 for each group). Additionally, doses of 8 mg/kg and 1 mg/kg of free ICG were injected intravenously (i.v.) (n=3 in each group). Fluorescence was imaged in vivo using an NIR fluorescence imaging system at different time points (1, 2, 4, 8, 12, 24, 48, and 72 h) after injection. Results: The absolute fluorescence intensity of mice in the liver injury model group was stronger than that in the control group. Mice in the liver injury model group had the same clearance rate of ICG from the tumor as normal mice. However, the background clearance rate was slower than that of normal mice, which prolonged the optimal tumor to background ratio (TBR) time. Correlation analysis was also used to determine which preoperative liver function parameters were most correlated with hepatic ICG clearance. Conclusions: Liver injury does not significantly affect the maximum TBR, but prolongs the optimal TBR time, and at the same time, a wider and more stable surgical window will appear. This study showed that a prolonged surgical start time is feasible according to preoperative liver function testing using NIR fluorescence imaging technology.
Targeting specificity has been an essential issue for applying genome editing systems in functional genomics, precise medicine and plant breeding. Understanding the scope of off-target mutations in Cas9 or Cpf1-edited crops is critical for research and regulation. In plants, only limited studies had used whole-genome sequencing (WGS) to test off-target effects of Cas9. However, the cause of numerous discovered mutations is still controversial. Furthermore, WGS based off-target analysis of Cpf1 has not been reported in any higher organism to date. Here, we conducted a WGS analysis of 34 plants edited by Cas9 and 15 plants edited by Cpf1 in T0 and T1 generations along with 20 diverse control plants in rice, a major food crop with a genome size of ~380 Mb. The sequencing depth ranged from 45X to 105X with reads mapping rate above 96%. Our results clearly show that most mutations in edited plants were created by tissue culture process, which caused ~102 to 148 single nucleotide variations (SNVs) and ~32 to 83 insertions/deletions (indels) per plant. Among 12 Cas9 single guide RNAs (sgRNAs) and 3 Cpf1 CRISPR RNAs (crRNAs) assessed by WGS, only one Cas9 sgRNA resulted in off-target mutations in T0 lines at sites predicted by computer programs. Moreover, we cannot find evidence for bona fide off-target mutations due to continued expression of Cas9 or Cpf1 with guide RNAs in T1 generation. Taken together, our comprehensive and rigorous analysis of WGS big data across multiple sample types suggests both Cas9 and Cpf1 nucleases are very specific in generating targeted DNA modifications and off-targeting can be avoided by designing guide RNAs with high specificity .
CRISPR-Cas systems can be expressed in multiple ways, with different capabilities regarding tissue-specific expression, efficiency, and expression levels. Thus far, three expression strategies have been demonstrated in plants: mixed dual promoter systems, dual Pol II promoter systems, and single transcript unit (STU) systems. We explored a fourth strategy to express CRISPR-Cas9 in the model and crop plant, rice, where a bidirectional promoter (BiP) is used to express Cas9 and single guide RNA (sgRNA) in opposite directions. We first tested an engineered BiP system based on double-mini 35S promoter and an Arabidopsis enhancer, which resulted in 20.7% and 52.9% genome editing efficiencies at two target sites in T0 stable transgenic rice plants. We further improved the BiP system drastically by using a rice endogenous BiP, OsBiP1. The endogenous BiP expression system had higher expression strength and led to 75.9%-93.3% genome editing efficiencies in rice T0 generation, when the sgRNAs were processed by either tRNA or Csy4. We provided a proof-of-concept study of applying BiP systems for expressing two-component CRISPR-Cas9 genome editing reagents in rice. Our work could promoter future research and adoption of BiP systems for CRISPR-Cas based genome engineering in plants.
The Ethylene-responsive element binding factor-associated Amphiphilic Repression (EAR) motifs, which were initially identified in members of the Arabidopsis ethylene response factor (ERF) family, are transcriptional repression motifs in plants and are defined by the consensus sequence patterns of either LxLxL or DLNxxP. EAR motif-containing proteins can function as transcription repressors, thus interacting with co-repressors, such as TOPLESS and AtSAP18, affecting the structure of chromatin by histone modifications and thereby repressing gene transcription. EAR motif-containing proteins are highly conserved across diverse plant species and play important roles in hormone signal transduction, stress responses and development, but they have not been identified in most plants. In this study, we identified 20,542 EAR motif-containing proteins from 71 plants based on a Hidden Markov Model and orthologous gene search, and then we constructed a functional analysis platform for plant EAR motif-containing proteins (PlantEAR, http://structuralbiology.cau.edu.cn/plantEAR) by integrating a variety of functional annotations and processed data. Several tools were provided as functional support for EAR motif-containing proteins, such as browse, search, co-expression and protein-protein interaction (PPI) network analysis as well as cis-element analysis and gene set enrichment analysis (GSEA). In addition, basing on the identified EAR motif-containing proteins, we also explored their distribution in various species and found that the numbers of EAR motif-containing proteins showed an increasing trend in evolution from algae to angiosperms.