Abstract Due to computational resource limitations, in mass spectrometry based proteomics only a limited set of peptide sequences is used for the matching against measured spectra. We present an approach to represent proteins by graphs and allow not only the canonical sequences but also known isoforms and annotated amino acid variations, e.g. originating from genomic mutations, and further common protein sequence features contained in Uniprot KB or other protein databases. Our C++ and Python implementation enables a groundbreaking comprehensive characterization of the peptide search space, encompassing for the first time all available annotations in a protein database (in combination more than $10^{200}$ possibilities). Additionally, it can be used to quickly extract the relevant subset of the search space for peptide to spectrum matching, e.g. filtering by the peptide mass. We demonstrate the advantages and innovative findings of our implementation compared to previous workflows by re-analysing publicly available datasets.
(1) Background: Neuroblastomas (NBs) are the most common extracranial solid tumors of children. The amplification of the Myc-N proto-oncogene (MYCN) is a major driver of NB aggressiveness, while high expression of the neurotrophin receptor NTRK1/TrkA is associated with mild disease courses. The molecular effects of NTRK1 signaling in MYCN-amplified NB, however, are still poorly understood and require elucidation. (2) Methods: Inducible NTRK1 expression was realized in four NB cell lines with (IMR5, NGP) or without MYCN amplification (SKNAS, SH-SY5Y). Proteome and phosphoproteome dynamics upon NTRK1 activation by its ligand, NGF, were analyzed in a time-dependent manner in IMR5 cells. Target validation by immunofluorescence staining and automated image processing was performed using the three other NB cell lines. (3) Results: In total, 230 proteins and 134 single phosphorylated class I phosphosites were found to be significantly regulated upon NTRK1 activation. Among known NTRK1 targets, Stathmin and the neurosecretory protein VGF were recovered. Additionally, we observed the upregulation and phosphorylation of Lamin A/C (LMNA) that accumulated inside nuclear foci. (4) Conclusions: We provide a comprehensive picture of NTRK1-induced proteome and phosphoproteome dynamics. The phosphorylation of LMNA within nucleic aggregates was identified as a prominent feature of NTRK1 signaling independent of the MYCN status of NB cells.
The German Conference on Bioinformatics (GCB) is an annual research conference bringing together researchers and practitioners in bioinformatics in Central Europe. The poster session is and always has been an important part of GCB, as it gives an excellent overview of current bioinformatics research topics in Germany and other countries.At GCB 2015, there will be 57 posters on display, whose abstracts (together with the poster number) are collected in the present document. Poster prizes will be presented to the presenting author(s) of the best posters, as selected by the program committee.
Quantitative secretome analyses are a high-performance tool for the discovery of physiological and pathophysiological changes in cellular processes. However, serum supplements in cell culture media limit secretome analyses, but serum depletion often leads to cell starvation and consequently biased results. To overcome these limiting factors, we investigated a model of T cell activation (Jurkat cells) and performed an approach for the selective enrichment of secreted proteins from conditioned medium utilizing metabolic marking of newly synthesized glycoproteins. Marked glycoproteins were labeled via bioorthogonal click chemistry and isolated by affinity purification. We assessed two labeling compounds conjugated with either biotin or desthiobiotin and the respective secretome fractions. 356 proteins were quantified using the biotin probe and 463 using desthiobiotin. 59 proteins were found differentially abundant (adjusted p-value ≤0.05, absolute fold change ≥1.5) between inactive and activated T cells using the biotin method and 86 using the desthiobiotin approach, with 31 mutual proteins cross-verified by independent experiments. Moreover, we analyzed the cellular proteome of the same model to demonstrate the benefit of secretome analyses and provide comprehensive data sets of both. 336 proteins (61.3%) were quantified exclusively in the secretome. Data are available via ProteomeXchange with identifier PXD004280.
Over the past years, phosphoproteomics has advanced to a prime tool in signaling research. Since then, an enormous amount of information about in vivo protein phosphorylation events has been collected providing a treasure trove for gaining a better understanding of the molecular processes involved in cell signaling. Yet, we still face the problem of how to achieve correct modification site localization. Here we use alternative fragmentation and different bioinformatics approaches for the identification and confident localization of phosphorylation sites. Phosphopeptide-enriched fractions were analyzed by multistage activation, collision-induced dissociation and electron transfer dissociation (ETD), yielding complementary phosphopeptide identifications. We further found that MASCOT, OMSSA and Andromeda each identified a distinct set of phosphopeptides allowing the number of site assignments to be increased. The postsearch engine SLoMo provided confident phosphorylation site localization, whereas different versions of PTM-Score integrated in MaxQuant differed in performance. Based on high-resolution ETD and higher collisional dissociation (HCD) data sets from a large synthetic peptide and phosphopeptide reference library reported by Marx et al. [Nat. Biotechnol. 2013, 31 (6), 557-564], we show that an Andromeda/PTM-Score probability of 1 is required to provide an false localization rate (FLR) of 1% for HCD data, while 0.55 is sufficient for high-resolution ETD spectra. Additional analyses of HCD data demonstrated that for phosphotyrosine peptides and phosphopeptides containing two potential phosphorylation sites, PTM-Score probability cutoff values of <1 can be applied to ensure an FLR of 1%. Proper adjustment of localization probability cutoffs allowed us to significantly increase the number of confident sites with an FLR of <1%.Our findings underscore the need for the systematic assessment of FLRs for different score values to report confident modification site localization.
The urothelium of the urinary bladder represents the first line of defense. However, uropathogenic E. coli (UPEC) damage the urothelium and cause acute bacterial infection. Here, we demonstrate the crosstalk between macrophages and the urothelium stimulating macrophage migration into the urothelium. Using spatial proteomics by MALDI-MSI and LC-MS/MS, a novel algorithm revealed the spatial activation and migration of macrophages. Analysis of the spatial proteome unravelled the coexpression of Myo9b and F4/80 in the infected urothelium, indicating that macrophages have entered the urothelium upon infection. Immunofluorescence microscopy additionally indicated that intraurothelial macrophages phagocytosed UPEC and eliminated neutrophils. Further analysis of the spatial proteome by MALDI-MSI showed strong expression of IL-6 in the urothelium and local inhibition of this molecule reduced macrophage migration into the urothelium and aggravated the infection. After IL-6 inhibition, the expression of matrix metalloproteinases and chemokines, such as CX3CL1 was reduced in the urothelium. Accordingly, macrophage migration into the urothelium was diminished in the absence of CX3CL1 signaling in Cx3cr1gfp/gfp mice. Conclusively, this study describes the crosstalk between the infected urothelium and macrophages through IL-6-induced CX3CL1 expression. Such crosstalk facilitates the relocation of macrophages into the urothelium and reduces bacterial burden in the urinary bladder.
Background Sepsis, a life-threatening condition caused by the dysregulated host response to infection, is a major global health concern. Understanding the impact of viral or bacterial pathogens in sepsis is crucial for improving patient outcomes. This study aimed to investigate the human cytomegalovirus (HCMV) seropositivity as a risk factor for development of sepsis in patients with COVID-19. Methods A multicenter observational study enrolled 95 intensive care patients with COVID-19-induced sepsis and 80 post-surgery individuals as controls. HCMV serostatus was determined using an ELISA test. Comprehensive clinical data, including demographics, comorbidities, and 30-day mortality, were collected. Statistical analyses evaluated the association between HCMV seropositivity and COVID-19 induced sepsis. Results The prevalence of HCMV seropositivity did not significantly differ between COVID-19-induced sepsis patients (78%) and controls (71%, p = 0.382) in the entire cohort. However, among patients aged ≤60 years, HCMV seropositivity was significantly higher in COVID-19 sepsis patients compared to controls (86% vs 61%, respectively; p = 0.030). Nevertheless, HCMV serostatus did not affect 30-day survival. Discussion These findings confirm the association between HCMV seropositivity and COVID-19 sepsis in non-geriatric patients. However, the lack of an independent effect on 30-day survival can be explained by the cross-reactivity of HCMV specific CD8 + T-cells towards SARS-CoV-2 peptides, which might confer some protection to HCMV seropositive patients. The inclusion of a post-surgery control group strengthens the generalizability of the findings. Further research is needed to elucidate the underlying mechanisms of this association, explore different patient populations, and identify interventions for optimizing patient management. Conclusion This study validates the association between HCMV seropositivity and severe COVID-19-induced sepsis in non-geriatric patients, contributing to the growing body of evidence on viral pathogens in sepsis. Although HCMV serostatus did not independently influence 30-day survival, future investigations should focus on unraveling the intricate interplay between HCMV, immune responses, and COVID-19. These insights will aid in risk stratification and the development of targeted interventions for viral sepsis.
Introduction: Application of systems biology/systems medicine approaches is promising for proteomics/biomedical research, but requires selection of an adequate modeling type.Areas covered: This article reviews the existing Boolean network modeling approaches, which provide in comparison with alternative modeling techniques several advantages for the processing of proteomics data. Application of methods for inference, reduction and validation of protein co-expression networks that are derived from quantitative high-throughput proteomics measurements is presented. It's also shown how Boolean models can be used to derive system-theoretic characteristics that describe both the dynamical behavior of such networks as a whole and the properties of different cell states (e.g. healthy or diseased cell states). Furthermore, application of methods derived from control theory is proposed in order to simulate the effects of therapeutic interventions on such networks, which is a promising approach for the computer-assisted discovery of biomarkers and drug targets. Finally, the clinical application of Boolean modeling analyses is discussed.Expert commentary: Boolean modeling of proteomics data is still in its infancy. Progress in this field strongly depends on provision of a repository with public access to relevant reference models. Also required are community supported standards that facilitate input of both proteomics and patient related data (e.g. age, gender, laboratory results, etc.).