Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

arXiv (Cornell University) (2023)

Kota Dohi Keisuke Imoto Noboru Harada Daisuke Niizumi Yuma Koizumi Tomoya Nishida Harsh Purohit Ryo Tanabe Takashi Endo Yohei Kawaguchi

Citation

Reference

Related Paper

Abstract:

We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2023 Challenge Task 2: ``First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring''. The main goal is to enable rapid deployment of ASD systems for new kinds of machines without the need for hyperparameter tuning. In the past ASD tasks, developed methods tuned hyperparameters for each machine type, as the development and evaluation datasets had the same machine types. However, collecting normal and anomalous data as the development dataset can be infeasible in practice. In 2023 Task 2, we focus on solving the first-shot problem, which is the challenge of training a model on a completely novel machine type. Specifically, (i) each machine type has only one section (a subset of machine type) and (ii) machine types in the development and evaluation datasets are completely different. Analysis of 86 submissions from 23 teams revealed that the keys to outperform baselines were: 1) sampling techniques for dealing with class imbalances across different domains and attributes, 2) generation of synthetic samples for robust detection, and 3) use of multiple large pre-trained models to extract meaningful embeddings for the anomaly detector.

Keywords:

Hyperparameter

Topics:

Anomaly Detection Techniques and Applications

Water Systems and Optimization

Multidisciplinary Science and Engineering Research

10.48550/arxiv.2305.07828

Cite

PDF

Hyperparameter Optimisation with Early Termination of Poor Performers

arXiv (Cornell University) (2019)

Dobromir Marinov Daniel Karapetyan

It is typical for a machine learning system to have numerous hyperparameters that affect its learning rate and prediction quality. Finding a good combination of the hyperparameters is, however, a challenging job. This is mainly because evaluation of each combination is extremely expensive computationally; indeed, training a machine learning system on real data with just a single combination of hyperparameters usually takes hours or even days. In this paper, we address this challenge by trying to predict the performance of the machine learning system with a given combination of hyperparameters without completing the expensive learning process. Instead, we terminate the training process at an early stage, collect the model performance data and use it to predict which of the combinations of hyperparameters is most promising. Our preliminary experiments show that such a prediction improves the performance of the commonly used random search approach.

Hyperparameter

Hyperparameter Optimization

10.48550/arxiv.1907.08651

Cite

Citations (0)

Recommending Learning Algorithms and Their Associated Hyperparameters

arXiv (Cornell University) (2014)

Michael R. Smith L. Mitchell Christophe Giraud-Carrier Tony Martinez

The success of machine learning on a given task dependson, among other things, which learning algorithm is selected and its associated hyperparameters. Selecting an appropriate learning algorithm and setting its hyperparameters for a given data set can be a challenging task, especially for users who are not experts in machine learning. Previous work has examined using meta-features to predict which learning algorithm and hyperparameters should be used. However, choosing a set of meta-features that are predictive of algorithm performance is difficult. Here, we propose to apply collaborative filtering techniques to learning algorithm and hyperparameter selection, and find that doing so avoids determining which meta-features to use and outperforms traditional meta-learning approaches in many cases.

Hyperparameter

Hyperparameter Optimization

Learning classifier system

10.48550/arxiv.1407.1890

Cite

Citations (0)

Bayesian Optimization Machine Learning Models for True and Fake News Classification

2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) (2023)

Gaohua Zhao Shouyou Song Hao Lin Wei Jiang

The performance of a machine learning algorithm depends largely on determining a set of hyperparameters. These hyperparameters have a significant influence on the accuracy of the algorithm. With the increase in algorithm complexity, there are more and more candidates for hyperparameters. How to quickly and accurately select the right hyperparameters for a given problem has become a popular area of research. This paper is based on a Bayesian optimization approach to assist machine learning for hyperparameter extraction. It is also fully validated based on the task of dichotomous classification of true and false news. This paper analyses the principles of the Bayesian optimization approach and how it can be applied to machine learning model parameter selection. The machine learning models to be used in this paper include K-Nearest Neighbour (KNN), Random Forest as well as Gradient Boosted Decision Trees (GBDT). These three are commonly used machine learning models for binary classification problems, with different numbers and classes of hyperparameters. The results of the experiments show that adjusting the original hyperparameters of machine learning using Bayesian optimization can substantially improve classification accuracy. The research in this paper can also provide ideas for other similar work of super parameter selection.

Hyperparameter

Bayesian Optimization

Hyperparameter Optimization

Binary classification

10.1109/itnec56291.2023.10082424

Cite

Citations (2)

Recommending Learning Algorithms and Their Associated Hyperparameters

arXiv (Cornell University) (2014)

Michael R. Smith Logan Mitchell Christophe Giraud-Carrier Tony Martinez

Hyperparameter

Hyperparameter Optimization

Learning classifier system

Source

Cite

Citations (17)

Hyperparameter optimization to improve bug prediction accuracy

Haidar Osman Mohammad Ghafari Oscar Nierstrasz

Bug prediction is a technique that strives to identify where defects will appear in a software system. Bug prediction employs machine learning to predict defects in software entities based on software metrics. These machine learning models usually have adjustable parameters, called hyperparameters, that need to be tuned for the prediction problem at hand. However, most studies in the literature keep the model hyperparameters set to the default values provided by the used machine learning frameworks. In this paper we investigate whether optimizing the hyperparameters of a machine learning model improves its prediction power. We study two machine learning algorithms: k-nearest neighbours (IBK) and support vector machines (SVM). We carry out experiments on five open source Java systems. Our results show that (i) models differ in their sensitivity to their hyperparameters, (ii) tuning hyperparameters gives at least as accurate models for SVM and significantly more accurate models for IBK, and (iii) most of the default values are changed during the tuning phase. Based on these findings we recommend tuning hyperparameters as a necessary step before using a machine learning model in bug prediction.

Hyperparameter

Hyperparameter Optimization

Software bug

10.1109/maltesque.2017.7882014

Cite

Citations (47)

Hyperparameter Optimisation with Early Termination of Poor Performers

Dobromir Marinov Daniel Karapetyan

Hyperparameter

Hyperparameter Optimization

10.1109/ceec47804.2019.8974317

Cite

Citations (15)

Stealing Hyperparameters in Machine Learning

arXiv (Cornell University) (2018)

Binghui Wang Neil Zhenqiang Gong

Hyperparameters are critical in machine learning, as different hyperparameters often result in models with significantly different performance. Hyperparameters may be deemed confidential because of their commercial value and the confidentiality of the proprietary algorithms that the learner uses to learn them. In this work, we propose attacks on stealing the hyperparameters that are learned by a learner. We call our attacks hyperparameter stealing attacks. Our attacks are applicable to a variety of popular machine learning algorithms such as ridge regression, logistic regression, support vector machine, and neural network. We evaluate the effectiveness of our attacks both theoretically and empirically. For instance, we evaluate our attacks on Amazon Machine Learning. Our results demonstrate that our attacks can accurately steal hyperparameters. We also study countermeasures. Our results highlight the need for new defenses against our hyperparameter stealing attacks for certain machine learning algorithms.

Hyperparameter

Hyperparameter Optimization

10.48550/arxiv.1802.05351

Cite

Citations (35)

Stealing Hyperparameters in Machine Learning

2022 IEEE Symposium on Security and Privacy (SP) (2018)

Binghui Wang Neil Zhenqiang Gong

Hyperparameter

Hyperparameter Optimization

10.1109/sp.2018.00038

Cite

Citations (403)

Hyperparameter Tuning for Overlapped Software Defect Prediction Data sets

Solid State Technology (2020)

Shivani Gupta Kusum Lata Jain Smaranika Mohapatra

The machine learning algorithms has become increasingly widely in many applications andresearch. The classifiers in machine learning build a model from data that allows computers toimprove future predictions.Despite the popularity of most machine learning algorithms require knowledge to take the decisionsabout the appropriate model and parameter settings for a particular domain of problem. It’s verydifficult to identify the machine learning classifier that is most well suited for specific characteristicsof data.In this work, we present a method to tune the hyperparameters of best machine learning algorithmfor a overlapped software defect data-set. The results obtained shows that using the machine learningalgorithm and hyperparameter tuning suggested by our method improves the predictive performanceof a classifier than its default settings on overlapped data-sets

Hyperparameter

Hyperparameter Optimization

Source

Cite

Citations (0)

To tune or not to tune? An Approach for Recommending Important Hyperparameters

arXiv (Cornell University) (2021)

Mohamadjavad Bahmani Radwa El Shawi Nshan Potikyan Sherif Sakr

Novel technologies in automated machine learning ease the complexity of algorithm selection and hyperparameter optimization. Hyperparameters are important for machine learning models as they significantly influence the performance of machine learning models. Many optimization techniques have achieved notable success in hyperparameter tuning and surpassed the performance of human experts. However, depending on such techniques as blackbox algorithms can leave machine learning practitioners without insight into the relative importance of different hyperparameters. In this paper, we consider building the relationship between the performance of the machine learning models and their hyperparameters to discover the trend and gain insights, with empirical results based on six classifiers and 200 datasets. Our results enable users to decide whether it is worth conducting a possibly time-consuming tuning strategy, to focus on the most important hyperparameters, and to choose adequate hyperparameter spaces for tuning. The results of our experiments show that gradient boosting and Adaboost outperform other classifiers across 200 problems. However, they need tuning to boost their performance. Overall, the results obtained from this study provide a quantitative basis to focus efforts toward guided automated hyperparameter optimization and contribute toward the development of better-automated machine learning frameworks.

Hyperparameter

Hyperparameter Optimization

AdaBoost

10.48550/arxiv.2108.13066

Cite

Citations (2)