Abstract Background Cataracts are lens opacifications that are responsible for more than half of blindness cases worldwide, and the only treatment is surgical intervention. Phacoemulsification surgery, the most frequently performed cataract surgery in developed countries, has associated risks, some of which are related to excessive phacoemulsification energy levels and times. The protocol proposed in herein will be used to evaluate the feasibility of a new experimental medical device, the Eye Scan Ultrasound System (ESUS), for the automatic classification of cataract type and severity and quantitative estimation of the optimal phacoemulsification energy. Methods The pilot study protocol will be used to evaluate the feasibility and safety of the ESUS in clinical practice. The study will be conducted in subjects with age-related cataracts and on healthy subjects as controls. The procedures include data acquisition with the experimental ESUS, classification based on the Lens Opacity Classification System III (LOCS III, comparator) using a slit lamp, contrast sensitivity test, optical coherence tomography, specular microscopy and surgical parameters. ESUS works in A-scan pulse-echo mode, with a central frequency of 20 MHz. From the collected signals, acoustic parameters will be extracted and used for automatic cataract characterization and optimal phacoemulsification energy estimation. The study includes two phases. The data collected in the first phase (40 patients, 2 eyes per patient) will be used to train the ESUS algorithms, while the data collected in the second phase (10 patients, 2 eyes per patient) will be used to assess the classification performance. System safety will be monitored during the study. Discussion The present pilot study protocol will evaluate the feasibility and safety of the ESUS for use in clinical practice, and the results will support a larger clinical study for the efficacy assessment of the ESUS as a diagnostic tool. Ultimately, the ESUS is expected to represent a valuable tool for surgical planning by reducing complications associated with excessive levels of phacoemulsification energy and surgical times, which will have a positive impact on healthcare systems and society. The study is not yet recruiting. Trial registration ClinicalTrials.gov identifier NCT04461912 , registered on July 8, 2020.
We address the problem of robustness of auditory models as front ends for speech recognition. Auditory models have been referred as superior front ends when speech is corrupted by noise or linear filtering, but there is not yet a deep understanding of its functioning. We analyze some commonly used auditory models and show that they present some interesting properties which are useful for robust speech recognition. In our view, the short-time adaptation provided by hair cell models is a key factor for this robustness. A disadvantage of auditory models is that the distributions of the obtained features are not well represented by gaussian pdfs. We discuss the problem of parameter transformation in order to use a standard recognizer based on CDHMMs with gaussian pdfs and present some digit recognition experiments.
This paper presents an approach to the Query-by-Example task of finding spoken queries on speech databases when the intended match may be non-exact or slightly complex. The built system is low-resource as it tries to solve the problem where the language of queries and searched audio is unspecified. Our method is based on a modified Dynamic Time Warping (DTW) algorithm using posterior-grams and extracting intricate paths to account for special cases of query match such as word re-ordering, lexical variations and filler content. This system was evaluated on the MediaEval 2014 task of Query by Example Search on Speech (QUESST) where the spoken data is from different languages, unknown to the participant. We combined the results of five DTW modifications computed on the output of three phoneme recognizers of different languages. The combination of all systems provided the best performance overall and improved detection of complex case queries.
The main objective of the research prototype is to demonstrate a narrow band multi-conference system over the Internet where one route is a high frequency (HF) wireless link.