Artikel ; Online: Outlier-SMOTE: A refined oversampling technique for improved detection of COVID-19.
Intelligence-based medicine
2020 Band 3, Seite(n) 100023
Abstract: Almost every dataset these days continually faces the predicament of class imbalance. It is difficult to train classifiers on these types of data as they become biased towards a set of classes, hence leading to reduction in classifier performance. This ... ...
Abstract | Almost every dataset these days continually faces the predicament of class imbalance. It is difficult to train classifiers on these types of data as they become biased towards a set of classes, hence leading to reduction in classifier performance. This setback is often tackled by the use of various over-sampling or under-sampling algorithms. But, the method which stood out of all the numerous algorithms was the Synthetic Minority Oversampling Technique (SMOTE). SMOTE generates synthetic samples of the minority class by oversampling each data-point by considering linear combinations of existing minority class neighbors. Each minority data sample generates an equal number of synthetic data. As the world is suffering from the plight of COVID-19 pandemic, the authors applied the idea to help boost the classifying performance whilst detecting this deadly virus. This paper presents a modified version of SMOTE known as Outlier-SMOTE wherein each data-point is oversampled with respect to its distance from other data-points. The data-point which is farther than the other data-points is given greater importance and is oversampled more than its counterparts. Outlier-SMOTE reduces the chances of overlapping of minority data samples which often occurs in the traditional SMOTE algorithm. This method is tested on five benchmark datasets and is eventually tested on a COVID-19 dataset. F-measure, Recall and Precision are used as principle metrics to evaluate the performance of the classifier as is the case for any class imbalanced data set. The proposed algorithm performs considerably better than the traditional SMOTE algorithm for the considered datasets. |
---|---|
Sprache | Englisch |
Erscheinungsdatum | 2020-12-03 |
Erscheinungsland | Netherlands |
Dokumenttyp | Journal Article |
ISSN | 2666-5212 |
ISSN (online) | 2666-5212 |
DOI | 10.1016/j.ibmed.2020.100023 |
Datenquelle | MEDical Literature Analysis and Retrieval System OnLINE |
Zusatzmaterialien
Kategorien
Über subito bestellen
Dieser Service ist kostenpflichtig (siehe Lieferbedingungen von subito). Bestellungen, die einen Artikel nebst Supplementary Material umfassen, werden grundsätzlich wie mehrfache Bestellungen bearbeitet. Gebühren fallen in diesen Fällen für jede einzelne Bestellung an.
Fernleihe an ZB MED
Sie können sich den gewünschten Titel als lokale Nutzerin oder lokaler Nutzer von ZB MED direkt an den Standort Köln schicken lassen.