Skip to main content
main-content

01.07.2016 | Systems-Level Quality Improvement | Ausgabe 7/2016

Journal of Medical Systems 7/2016

Distance Metric Based Oversampling Method for Bioinformatics and Performance Evaluation

Zeitschrift:
Journal of Medical Systems > Ausgabe 7/2016
Autoren:
Meng-Fong Tsai, Shyr-Shen Yu
Wichtige Hinweise
This article is part of the Topical Collection on Systems-Level Quality Improvement

Abstract

An imbalanced classification means that a dataset has an unequal class distribution among its population. For any given dataset, regardless of any balancing issue, the predictions made by most classification methods are highly accurate for the majority class but significantly less accurate for the minority class. To overcome this problem, this study took several imbalanced datasets from the famed UCI datasets and designed and implemented an efficient algorithm which couples Top-N Reverse k-Nearest Neighbor (TRkNN) with the Synthetic Minority Oversampling TEchnique (SMOTE). The proposed algorithm was investigated by applying it to classification methods such as logistic regression (LR), C4.5, Support Vector Machine (SVM), and Back Propagation Neural Network (BPNN). This research also adopted different distance metrics to classify the same UCI datasets. The empirical results illustrate that the Euclidean and Manhattan distances are not only more accurate, but also show greater computational efficiency when compared to the Chebyshev and Cosine distances. Therefore, the proposed algorithm based on TRkNN and SMOTE can be widely used to handle imbalanced datasets. Our recommendations on choosing suitable distance metrics can also serve as a reference for future studies.

Bitte loggen Sie sich ein, um Zugang zu diesem Inhalt zu erhalten

★ PREMIUM-INHALT
e.Med Interdisziplinär

Mit e.Med Interdisziplinär erhalten Sie Zugang zu allen CME-Fortbildungen und Fachzeitschriften auf SpringerMedizin.de. Zusätzlich können Sie eine Zeitschrift Ihrer Wahl in gedruckter Form beziehen – ohne Aufpreis.

Literatur
Über diesen Artikel

Weitere Artikel der Ausgabe 7/2016

Journal of Medical Systems 7/2016 Zur Ausgabe

Transactional Processing Systems

Neonatal Jaundice Detection System