Abstract Background Metagenomic next-generation sequencing (mNGS) offers the promise of unbiased detection of emerging pathogens. However, in indexed sequencing, the sequential paradigm of data acquisition, demultiplexing, and analysis restrain read assignment in advance and real-time analysis, resulting in lengthy turnaround time for clinical metagenomic detection. Methods We described the utility of internal-index adaptors with different lengths of barcode in multiplex sequencing. The base composition for each position within these adaptors was well-balanced to ensure nucleotide diversity and optimal sequencing performance and to achieve the early assignment of reads by first sequencing the barcodes. Combined with an automated library preparation device, we delivered a rapid and real-time bioinformatics pathogen identification solution for the Illumina NextSeq platform. The diagnostic performance was evaluated by testing 153 lower respiratory tract specimens using mNGS in comparison to culture, 16S/internal transcribed spacer amplicon sequencing, and additional PCR-based tests. Results By calculating the average F1 scores of all read lengths under different threshold values, we established the optimal threshold for pathogens identification, and found that 36 bp was the optimal shortest read length for rapid mNGS analysis. Rapid detection had a negative percentage agreement and positive percentage agreement of 100% and 85.1% for bacteria and 97.4% and 80.3% for fungi, when compared to a composite standard. The rapid mNGS solution enabled accurate pathogen identification in about 9.1 to 10.1 h sample-to-answer turnaround time. Conclusions Optimized internal index adaptors combined with a real-time analysis pipeline provide a potential tool for a first-line test in critically ill patients.
RNAs play crucial and versatile roles in biological processes. Computational prediction approaches can help to understand RNA structures and their stabilizing factors, thus providing information on their functions, and facilitating the design of new RNAs. Machine learning (ML) techniques have made tremendous progress in many fields in the past few years. Although their usage in protein-related fields has a long history, the use of ML methods in predicting RNA tertiary structures is new and rare. Here, we review the recent advances of using ML methods on RNA structure predictions and discuss the advantages and limitation, the difficulties and potentials of these approaches when applied in the field.