Skip to main content

01.07.2016 | Systems-Level Quality Improvement | Ausgabe 7/2016

Journal of Medical Systems 7/2016

Distance Metric Based Oversampling Method for Bioinformatics and Performance Evaluation

Journal of Medical Systems > Ausgabe 7/2016
Meng-Fong Tsai, Shyr-Shen Yu
Wichtige Hinweise
This article is part of the Topical Collection on Systems-Level Quality Improvement


An imbalanced classification means that a dataset has an unequal class distribution among its population. For any given dataset, regardless of any balancing issue, the predictions made by most classification methods are highly accurate for the majority class but significantly less accurate for the minority class. To overcome this problem, this study took several imbalanced datasets from the famed UCI datasets and designed and implemented an efficient algorithm which couples Top-N Reverse k-Nearest Neighbor (TRkNN) with the Synthetic Minority Oversampling TEchnique (SMOTE). The proposed algorithm was investigated by applying it to classification methods such as logistic regression (LR), C4.5, Support Vector Machine (SVM), and Back Propagation Neural Network (BPNN). This research also adopted different distance metrics to classify the same UCI datasets. The empirical results illustrate that the Euclidean and Manhattan distances are not only more accurate, but also show greater computational efficiency when compared to the Chebyshev and Cosine distances. Therefore, the proposed algorithm based on TRkNN and SMOTE can be widely used to handle imbalanced datasets. Our recommendations on choosing suitable distance metrics can also serve as a reference for future studies.

Bitte loggen Sie sich ein, um Zugang zu diesem Inhalt zu erhalten

e.Med Interdisziplinär

Mit e.Med Interdisziplinär erhalten Sie Zugang zu allen CME-Fortbildungen und Fachzeitschriften auf Zusätzlich können Sie eine Zeitschrift Ihrer Wahl in gedruckter Form beziehen – ohne Aufpreis.

Über diesen Artikel

Weitere Artikel der Ausgabe 7/2016

Journal of Medical Systems 7/2016 Zur Ausgabe

Transactional Processing Systems

Neonatal Jaundice Detection System