The proliferation of wearable devices that collect activity and heart rate data has facilitated new ways to measure sleeping and waking durations unobtrusively and longitudinally. Most existing sleep/wake identification algorithms are based on activity only and are trained on expensive and laboriously annotated polysomnography (PSG). Heart rate can also be reflective of sleep/wake transitions, which has motivated its investigation herein in an unsupervised algorithm. Moreover, it is necessary to develop a personalized approach to deal with interindividual variance in sleep/wake patterns.We aimed to develop an unsupervised personalized sleep/wake identification algorithm using multifaceted data to explore the benefits of incorporating both heart rate and activity level in these types of algorithms and to compare this approach's output with that of an existing commercial wearable device's algorithms.In this study, a total of 14 community-dwelling older adults wore wearable devices (Fitbit Alta; Fitbit Inc) 24 hours a day and 7 days a week over period of 3 months during which their heart rate and activity data were collected. After preprocessing the data, a model was developed to distinguish sleep/wake states based on each individual's data. We proposed the use of hidden Markov models and compared different modeling schemes. With the best model selected, sleep/wake patterns were characterized by estimated parameters in hidden Markov models, and sleep/wake states were identified.When applying our proposed algorithm on a daily basis, we found there were significant differences in estimated parameters between weekday models and weekend models for some participants.Our unsupervised approach can be effectively implemented based on an individual's multifaceted sleep-related data from a commercial wearable device. A personalized model is shown to be necessary given the interindividual variability in estimated parameters.
Abstract Objective This study aims to improve the classification of the fall incident severity level by considering data imbalance issues and structured features through machine learning. Materials and Methods We present an incident report classification (IRC) framework to classify the in-hospital fall incident severity level by addressing the imbalanced class problem and incorporating structured attributes. After text preprocessing, bag-of-words features, structured text features, and structured clinical features were extracted from the reports. Next, resampling techniques were incorporated into the training process. Machine learning algorithms were used to build classification models. IRC systems were trained, validated, and tested using a repeated and randomly stratified shuffle-split cross-validation method. Finally, we evaluated the system performance using the F1-measure, precision, and recall over 15 stratified test sets. Results The experimental results demonstrated that the classification system setting considering both data imbalance issues and structured features outperformed the other system settings (with a mean macro-averaged F1-measure of 0.733). Considering the structured features and resampling techniques, this classification system setting significantly improved the mean F1-measure for the rare class by 30.88% (P value < .001) and the mean macro-averaged F1-measure by 8.26% from the baseline system setting (P value < .001). In general, the classification system employing the random forest algorithm and random oversampling method outperformed the others. Conclusions Structured features provide essential information for categorizing the fall incident severity level. Resampling methods help rebalance the class distribution of the original incident report data, which improves the performance of machine learning models. The IRC framework presented in this study effectively automates the identification of fall incident reports by the severity level.
BACKGROUND The proliferation of wearable devices that collect activity and heart rate data has facilitated new ways to measure sleeping and waking durations unobtrusively and longitudinally. Most existing sleep/wake identification algorithms are based on activity only and are trained on expensive and laboriously annotated polysomnography (PSG). Heart rate can also be reflective of sleep/wake transitions, which has motivated its investigation herein in an unsupervised algorithm. Moreover, it is necessary to develop a personalized approach to deal with interindividual variance in sleep/wake patterns. OBJECTIVE We aimed to develop an unsupervised personalized sleep/wake identification algorithm using multifaceted data to explore the benefits of incorporating both heart rate and activity level in these types of algorithms and to compare this approach’s output with that of an existing commercial wearable device’s algorithms. METHODS In this study, a total of 14 community-dwelling older adults wore wearable devices (Fitbit Alta; Fitbit Inc) 24 hours a day and 7 days a week over period of 3 months during which their heart rate and activity data were collected. After preprocessing the data, a model was developed to distinguish sleep/wake states based on each individual’s data. We proposed the use of hidden Markov models and compared different modeling schemes. With the best model selected, sleep/wake patterns were characterized by estimated parameters in hidden Markov models, and sleep/wake states were identified. RESULTS When applying our proposed algorithm on a daily basis, we found there were significant differences in estimated parameters between weekday models and weekend models for some participants. CONCLUSIONS Our unsupervised approach can be effectively implemented based on an individual’s multifaceted sleep-related data from a commercial wearable device. A personalized model is shown to be necessary given the interindividual variability in estimated parameters.
Good sleep habit is essential to maintain a quality life. Sensor-based wearable devices have been increasingly deployed to monitor activity and measure sleep duration in a non-intrusive, affordable, and portable way. Existing algorithms for detecting sleep and wake in wrist-worn devices are mainly based on activity count or accelerometer data inference. It is validated that sleep can be reflected by various vital signs, including physical activity, heart rates, and pulse oximetry. However, little attention has been paid to heart rates measurements for sleep/wake identification in commercial wearable devices together with activity information. Our study developed an unsupervised and personalized algorithm to infer sleep and wake states using heart rates and step counts based on hidden Markov models. The fusion of two HMMs successfully dealt with multi-granularity data and predicted sleep/wake states in the minimal granularity. The proposed algorithm was illustrated through a real-life case study. The agreement between our algorithm and Fitbit's scoring was 89.35%. The proposed algorithm enabled identifying more afternoon naps, earlier sleep onset compared to Fitbit's scoring. The results showed that heart rates were informative when distinguishing sleep and wake while compensating estimations driven from step counts.
Retrospective analysing of fall incident reports can uncover hidden information, identify potential risk factors, and improve healthcare quality. This study explores potential fall incident clusters using word embeddings and hierarchical clustering. Fall incident reports from 7 local hospitals in Hong Kong were catalogued into 5 potential clusters with significantly different fall severity, gender, reporting department, and keywords. This study demonstrates the feasibility of using text clustering methods on real-world fall incident reports mining.
In an age of big data and information overload, recommendation systems have evolved rapidly. Throughout the traditional design of interior spaces, the specialised nature of the work and the high rate of human involvement has led to high costs. With the continuous development of artificial intelligence technology, it provides a favourable environment for reducing the development cost of the system. This study proposes a two-stage modelling scheme based on deep learning networks for the intelligent design of display space layouts, divided into two parts: matching and layout, which greatly improves design efficiency. The research results show that through comparison tests, its prediction accuracy reaches more than 80%, which can well meet the matching requirements of household products. The training number of Epochs is between 15 and 30, its training curve tends to saturate and the best accuracy can reach 100%, while the running time of the hybrid algorithm proposed in this study is only 20.716 s, which is significantly better compared with other algorithms. The proposed hybrid algorithm has a running time of only 20.716 s, which is significantly better than other algorithms. The approach innovatively combines deep learning technology with computer-aided design (CAD), enabling designers to automatically generate display space layouts with good visibility and usability based on complex design constraints. This study presents an innovative application of the research methodology by combining quantitative and qualitative methods to analyse the data. The application of both methods provides a more comprehensive understanding of the problem under study and provides insight into the key factors that influence the results. The findings of this study can provide useful insights for policy makers and practitioners.
Background Electronic health records (EHRs) in unstructured formats are valuable sources of information for research in both the clinical and biomedical domains. However, before such records can be used for research purposes, sensitive health information (SHI) must be removed in several cases to protect patient privacy. Rule-based and machine learning–based methods have been shown to be effective in deidentification. However, very few studies investigated the combination of transformer-based language models and rules. Objective The objective of this study is to develop a hybrid deidentification pipeline for Australian EHR text notes using rules and transformers. The study also aims to investigate the impact of pretrained word embedding and transformer-based language models. Methods In this study, we present a hybrid deidentification pipeline called OpenDeID, which is developed using an Australian multicenter EHR-based corpus called OpenDeID Corpus. The OpenDeID corpus consists of 2100 pathology reports with 38,414 SHI entities from 1833 patients. The OpenDeID pipeline incorporates a hybrid approach of associative rules, supervised deep learning, and pretrained language models. Results The OpenDeID achieved a best F1-score of 0.9659 by fine-tuning the Discharge Summary BioBERT model and incorporating various preprocessing and postprocessing rules. The OpenDeID pipeline has been deployed at a large tertiary teaching hospital and has processed over 8000 unstructured EHR text notes in real time. Conclusions The OpenDeID pipeline is a hybrid deidentification pipeline to deidentify SHI entities in unstructured EHR text notes. The pipeline has been evaluated on a large multicenter corpus. External validation will be undertaken as part of our future work to evaluate the effectiveness of the OpenDeID pipeline.
BACKGROUND Telehealth is an effective means to assist existing health care systems, particularly for the current aging society. However, most extant telehealth systems employ individual data sources by offline data processing, which may not recognize health deterioration in a timely way. OBJECTIVE Our study objective was two-fold: to design and implement an integrated, personalized telehealth system on a community-based level; and to evaluate the system from the perspective of user acceptance. METHODS The system was designed to capture and record older adults’ health-related information (eg, daily activities, continuous vital signs, and gait behaviors) through multiple measuring tools. State-of-the-art data mining techniques can be integrated to detect statistically significant changes in daily records, based on which a decision support system could emit warnings to older adults, their family members, and their caregivers for appropriate interventions to prevent further health deterioration. A total of 45 older adults recruited from 3 elderly care centers in Hong Kong were instructed to use the system for 3 months. Exploratory data analysis was conducted to summarize the collected datasets. For system evaluation, we used a customized acceptance questionnaire to examine users’ attitudes, self-efficacy, perceived usefulness, perceived ease of use, and behavioral intention on the system. RESULTS A total of 179 follow-up sessions were conducted in the 3 elderly care centers. The results of exploratory data analysis showed some significant differences in the participants’ daily records and vital signs (eg, steps, body temperature, and systolic blood pressure) among the 3 centers. The participants perceived that using the system is a good idea (ie, attitude: mean 5.67, SD 1.06), comfortable (ie, self-efficacy: mean 4.92, SD 1.11), useful to improve their health (ie, perceived usefulness: mean 4.99, SD 0.91), and easy to use (ie, perceived ease of use: mean 4.99, SD 1.00). In general, the participants showed a positive intention to use the first version of our personalized telehealth system in their future health management (ie, behavioral intention: mean 4.45, SD 1.78). CONCLUSIONS The proposed health monitoring system provides an example design for monitoring older adults’ health status based on multiple data sources, which can help develop reliable and accurate predictive analytics. The results can serve as a guideline for researchers and stakeholders (eg, policymakers, elderly care centers, and health care providers) who provide care for older adults through such a telehealth system.
Background Telehealth is an effective means to assist existing health care systems, particularly for the current aging society. However, most extant telehealth systems employ individual data sources by offline data processing, which may not recognize health deterioration in a timely way. Objective Our study objective was two-fold: to design and implement an integrated, personalized telehealth system on a community-based level; and to evaluate the system from the perspective of user acceptance. Methods The system was designed to capture and record older adults’ health-related information (eg, daily activities, continuous vital signs, and gait behaviors) through multiple measuring tools. State-of-the-art data mining techniques can be integrated to detect statistically significant changes in daily records, based on which a decision support system could emit warnings to older adults, their family members, and their caregivers for appropriate interventions to prevent further health deterioration. A total of 45 older adults recruited from 3 elderly care centers in Hong Kong were instructed to use the system for 3 months. Exploratory data analysis was conducted to summarize the collected datasets. For system evaluation, we used a customized acceptance questionnaire to examine users’ attitudes, self-efficacy, perceived usefulness, perceived ease of use, and behavioral intention on the system. Results A total of 179 follow-up sessions were conducted in the 3 elderly care centers. The results of exploratory data analysis showed some significant differences in the participants’ daily records and vital signs (eg, steps, body temperature, and systolic blood pressure) among the 3 centers. The participants perceived that using the system is a good idea (ie, attitude: mean 5.67, SD 1.06), comfortable (ie, self-efficacy: mean 4.92, SD 1.11), useful to improve their health (ie, perceived usefulness: mean 4.99, SD 0.91), and easy to use (ie, perceived ease of use: mean 4.99, SD 1.00). In general, the participants showed a positive intention to use the first version of our personalized telehealth system in their future health management (ie, behavioral intention: mean 4.45, SD 1.78). Conclusions The proposed health monitoring system provides an example design for monitoring older adults’ health status based on multiple data sources, which can help develop reliable and accurate predictive analytics. The results can serve as a guideline for researchers and stakeholders (eg, policymakers, elderly care centers, and health care providers) who provide care for older adults through such a telehealth system.