Introduction
Material and methods
Strain collection and cultivation
Group | Training set | Validation set | Total |
---|---|---|---|
HVR | 65 | 39 | 104 |
Non-HVR | 92 | 44 | 136 |
Total | 157 | 83 | 240 |
Protein extraction, spectra acquisition, and species confirmation
MALDI-TOF parameters
Spectra analysis
Classification using machine learning algorithms
External validation
Actual/predicted | HVR RTs | Non-HVR RTs | % Correct |
---|---|---|---|
Support vector machine (SVM) | |||
HVR RTs | 39 (TP) | 26 (FN) | 60.0% (sensitivity) |
Non-HVR RTs | 8 (FP) | 84 (TN) | 91.3% (specificity) |
83.0% (PPV) | 76.4% (NPV) | 78.3% (accuracy) | |
K-nearest neighbor (KNN) | |||
HVR RTs | 58 (TP) | 7 (FN) | 89.2% (sensitivity) |
Non-HVR RTs | 4 (FP) | 88 (TN) | 95.7% (specificity) |
93.6% (PPV) | 92.6% (NPV) | 93.0% (accuracy) | |
Partial least square discriminant analysis (PLS-DA) | |||
HVR RTs | 64 (TP) | 1 (FN) | 98.5% (sensitivity) |
Non-HVR RTs | 1 (FP) | 91 (TN) | 98.9% (specificity) |
98.5% (PPV) | 98.9% (NPV) | 98.7% (accuracy) | |
Random forest (RF) | |||
HVR RTs | 64 (TP) | 1 (FN) | 98.5% (sensitivity) |
Non-HVR RTs | 0 (FP) | 92 (TN) | 100% (specificity) |
100% (PPV) | 98.9% (NPV) | 99.4% (accuracy) |
Results
MALDI-TOF spectra acquisition
Discrimination between HVRTs and non-HVRTs
External validation
Actual/predicted | HVR RTs | Non-HVR RTs | % Correct |
---|---|---|---|
Partial least square discriminant analysis (PLS-DA) | |||
HVR RTs | 38 (TP) | 1 (FN) | 97.4% (sensitivity) |
Non-HVR RTs | 1 (FP) | 43 (TN) | 97.7% (specificity) |
97.4% (PPV) | 97.7% (NPV) | 97.6% (accuracy) | |
Random forest (RF) | |||
HVR RTs | 39 (TP) | 0 (FN) | 100% (sensitivity) |
Non-HVR RTs | 1 (FP) | 43 (TN) | 97.7% (specificity) |
97.5% (PPV) | 100% (NPV) | 98.8% (accuracy) |
ML-subtyping of HVRTs
10-fold cross-validation (65 HVR isolates) | ||||
---|---|---|---|---|
Random forest (RF) and partial least square discriminant analysis (PLS-DA) | ||||
Actual/predicted | RT023 | RT027/176 | RT045/078/126/127 | % Correct |
RT023 | 10 | 0 | 0 | 100% |
RT027/176 | 0 | 24 | 0 | 100% |
RT045/078/126/127 | 0 | 0 | 31 | 100% |
100% (accuracy) |
External validation (39 isolates) | ||||
---|---|---|---|---|
Actual/predicted | RT023 | RT027/176 | RT045/078/126/127 | % Correct |
Random forest (RF) | ||||
RT023 | 6 | 0 | 3 | 66.7% |
RT027/176 | 0 | 7 | 0 | 100% |
RT045/078/126/127 | 0 | 0 | 23 | 100% |
92.3% (accuracy) | ||||
Partial least square discriminant analysis (PLS-DA) | ||||
RT023 | 9 | 0 | 0 | 100% |
RT027/176 | 0 | 7 | 0 | 100% |
RT045/078/126/127 | 1 | 0 | 22 | 95.7% |
97.4% (accuracy) |