Development of machine learning models for diagnosis of glaucoma

Seong Jae Kim; Kyong Jin Cho; Sejong Oh

doi:10.1371/journal.pone.0177726

Abstract

The study aimed to develop machine learning models that have strong prediction power and interpretability for diagnosis of glaucoma based on retinal nerve fiber layer (RNFL) thickness and visual field (VF). We collected various candidate features from the examination of retinal nerve fiber layer (RNFL) thickness and visual field (VF). We also developed synthesized features from original features. We then selected the best features proper for classification (diagnosis) through feature evaluation. We used 100 cases of data as a test dataset and 399 cases of data as a training and validation dataset. To develop the glaucoma prediction model, we considered four machine learning algorithms: C5.0, random forest (RF), support vector machine (SVM), and k-nearest neighbor (KNN). We repeatedly composed a learning model using the training dataset and evaluated it by using the validation dataset. Finally, we got the best learning model that produces the highest validation accuracy. We analyzed quality of the models using several measures. The random forest model shows best performance and C5.0, SVM, and KNN models show similar accuracy. In the random forest model, the classification accuracy is 0.98, sensitivity is 0.983, specificity is 0.975, and AUC is 0.979. The developed prediction models show high accuracy, sensitivity, specificity, and AUC in classifying among glaucoma and healthy eyes. It will be used for predicting glaucoma against unknown examination records. Clinicians may reference the prediction results and be able to make better decisions. We may combine multiple learning models to increase prediction accuracy. The C5.0 model includes decision rules for prediction. It can be used to explain the reasons for specific predictions.

Citation: Kim SJ, Cho KJ, Oh S (2017) Development of machine learning models for diagnosis of glaucoma. PLoS ONE 12(5): e0177726. https://doi.org/10.1371/journal.pone.0177726

Editor: Bin Liu, Harbin Institute of Technology Shenzhen Graduate School, CHINA

Received: February 1, 2017; Accepted: May 2, 2017; Published: May 23, 2017

Copyright: © 2017 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data are available from Dryad at: doi:10.5061/dryad.q6ft5.

Funding: This study was conducted by the research fund of Dankook University in 2015. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Glaucoma is characterized by dysfunction and loss of retinal ganglion cells (RGCs), with resulting structural changes to the optic nerve head, retinal nerve fiber layer (RNFL) thickness, and ganglion cell and inner plexiform layers as well as loss of the visual field [1].

The diagnosis of glaucoma in its early stages is challenging. Misdiagnosis can lead to failure to identify individuals with the condition during its early stages until significant functional loss has occurred. Thus, early detection of glaucoma allows for early treatment to delay vision loss [2,3]. Diagnosing glaucoma is problematic, especially when it is in the earliest stage of glaucoma. Diagnosis of glaucoma in myopic eyes and patients with brain diseases such as brain tumor is known to be difficult due to those eye’s characteristic disc shape and visual field defect. A more effective glaucoma-detection machine learning model would be very helpful to clinicians.

The classification scheme in machine learning is suitable for diagnosis glaucoma. Chan et al [4] tested various classification algorithms based on the examination of visual fields. Goldbaum et al [5] also compared machine learning classifiers and suggested a mixture of of Gaussian as the best classifier. Bizios et al [6] tested the artificial neural network (ANN) and support vector machine (SVM) based on RNFL thickness parameters. Barella et al [7] investigated the diagnostic accuracy of machine learning classifiers (MLCs) and random forest (RF) using RNFL and optic nerve data. They got 0.877 of area under the ROC value using RF. Recently, Silva et al [8] tested almost all of the classifiers using Spectral Domain optical coherence tomography (OCT) and standard automated perimetry. They got 0.946 as the best aROC value using RF. Previous studies show that SVM and RF have good prediction power. The trade-off between prediction power and interpretability is one of the well-known issues in machine learning. The black box models such as SVM and deep learning algorithm show good prediction power. However, it is difficult to understand why the model gives the prediction result. Therefore, they are not entirely suitable for medical diagnosis because clinicians want to know both the prediction and the reason for the prediction. Decision tree models [9] such as C5.0 [10,11] show good interpretability and poor prediction power. Logistic Regression and Naïve Bayes are algorithms used for probabilistic classification [12]. iDHS-EL [13] and iRSpot-EL [14] are predictors developed for identifying the location of DNase I Hypersensitive Sites (DHSs) and DNA recombination spots in human genomes. The goal of this study is to develop a machine learning model that has strong prediction power for diagnosis of glaucoma. To achieve the goal, we developed good features from examination data for prediction, and we tested C5.0, RF, SVM, and k-nearest neighbor (KNN) algorithms. We describe details in the next section.

Materials and methods

Procedure

We used three kinds of examination records to develop the learning model: RNFL thickness, visual field (VF) test parameters, and general ophthalmic examination. The records contained glaucoma cases and healthy controls. We extracted as many features (data attributes) as possible from the examination record. All the features were arranged as a data table form except missing values. We performed t-tests to evaluate each feature in the data table, and select suitable features for classifying healthy controls and glaucoma.

We divided the base dataset into a test dataset (100 cases) and another dataset (399 cases). Another dataset was used for developing the learning model. We used 80% of it for model training and 20% of it for validation of the model. After finding the best learning model, we evaluated the model using the test dataset.

To develop the learning (glaucoma prediction) model, we considered four machine learning algorithms: C5.0, RF, SVM, and KNN. We repeatedly composed a learning model using training dataset and evaluated it by validation dataset, and a model which showed the best validation accuracy was chosen as the best learning model.

After building the best learning models upon four algorithms, we evaluated the models in various ways. The classification accuracy, sensitivity, specificity, and likelihood ratios were compared. Receiver operating characteristics, (ROC) curves and areas under the curve (AUC) value were also analyzed. In the case of the C5.0 model, this includes a decision tree to predict glaucoma. We analyzed the clinical meanings of the decision rules in the tree. All procedure was implemented by R (http://www.r-project.org) and its supported packages.

Participants

The medical records of patients who underwent optical coherence tomography (OCT) and VF examinations at Dankook University Hospital and Gyeongsang National University Hospital between January 2012 and November 2015 were reviewed. To conduct the study, all the patients underwent comprehensive ophthalmological examinations, which included slit-lamp biomicroscopy, best corrected visual acuity (BCVAC), refractive error examination, central corneal thickness (CCT) measurement, Goldmann applanation tonometry, dilated fundus examination, and fundus and red-free fundus photography (Canon, Tokyo, Japan). An automated VF test was conducted using the 30–2 program Swedish interactive threshold algorithm standard on a Humphrey 740 visual field analyzer (Carl Zeiss Meditec Inc., Dublin, CA). The spectral-domain OCT (SD OCT) images, obtained using the Spectralis^® (Heidelberg Engineering GmbH, Heidelberg, Germany) platform, were used to measure the peripapillary RNFL thickness. This study was approved by the Dankook University Hospital Institutional Review Board, Korea (ID# DKUH 2016-11-011). Informed consent of participants was waived by the Dankook University Hospital Institutional Review Board. This research follows the tenets of the Declaration of Helsinki.

In total, 297 cases of eyes (of patients) with glaucoma (POAG or NTG) and 202 cases of eyes (of patients) without glaucoma were included. The inclusion criteria for glaucomatous eyes were: best-corrected visual acuity of 20/40 or better; normal anterior segment on a slit-lamp examination; and diagnosis of glaucoma by the principal investigator or co-investigator. The glaucoma diagnosis was based on characteristic glaucomatous structural change to the optic disc accompanied by glaucomatous visual field defects. The criteria for a glaucomatous visual field defect were: glaucoma hemifield test [15] outside the normal limit, pattern standard deviation with a P value <5%, or a cluster of <3 points in the pattern deviation plot in a single hemifield (superior or inferior) with a P value of <5%, one of which must have a P value of <1%. Any one of the preceding criteria, if repeatable, was considered sufficient evidence of a glaucomatous visual field defect.

Exclusion criteria were as follows in addition to those who do not met the inclusion criteria: history of ocular inflammation or trauma; and the presence of concurrent retinal disease (i.e., vascular disorder or macular degeneration), optic nerve disease other than glaucoma, or a brain disorder that could influence the visual field results.

The inclusion criteria for normal eyes were a best-corrected visual acuity of 20/40, normal anterior segment on a slit-lamp examination, no RNFL defects in red-free fundus photographs, no visual field defects, and an intraocular pressure ≤21 mmHg. Table 1 summarizes the characteristics of the participants.

Download:

Table 1. Characteristics of the participants.

https://doi.org/10.1371/journal.pone.0177726.t001

Feature selection and dataset preparation

Table 2 summarizes the basic features that we extracted from the examination records for the glaucoma and healthy controls. To select good features for building the learning model, firstly we removed the features that contained missing values in over 50% of whole cases. We then performed t-tests against the rest of the features to see the class separability of the features. As a result, the features 2–8, 10–12 were selected.

Download:

Table 2. List of basic features from the examination data.

We extracted them from examination records for glaucoma and healthy controls.

https://doi.org/10.1371/journal.pone.0177726.t002

To increase the quality of the training dataset, we made a synthesized feature. The feature RNFL4.mean reflects the average value of RNFL SUP, INF, and TMP. Because the four RNFL features contain partial information about RNFL, and we need to reflect whole RNFL, we tested every combination of the four RNFL features and SUP-INF-TMP combination showed best performance.

The final step of feature selection was to sort good features from candidates’ features. We performed a classification test on every combination of the feature subset of candidates’ features using the C5.0 algorithm. Table 3 summarizes the final features from the feature selection process. Fig 1 shows a box plot for the features. All features show a large difference of median value between glaucoma and healthy controls. Fig 2 shows the PCA plot for the prepared dataset. In the plot, each point means a case in the dataset. Generally, the glaucoma cases are well separated from healthy control cases. Some cases are located in border areas or opposite areas. The goal of the learning model may be to correctly predict the cases. The right plot of Fig 2 shows the relationship between distribution of cases and features. In the glaucoma group, PSD, GHT, ocular_pressure, and age have high values whereas MD and RNFL4_mean have low values. In the case of cornea_thickness, the healthy control group has a little bit higher value than glaucoma group.

Download:

Table 3. Final features list for building the training model.

We removed the features that contained many missing values. We then performed t-tests against the rest of the features to see class separability of the features. The feature RNFL4.mean reflects mean of SUP-INF-TMP combination.

https://doi.org/10.1371/journal.pone.0177726.t003

Download:

Fig 1. Box plots for selected features (g: Glaucoma, h: Health control).

All features show a large difference of median values between glaucoma and healthy controls.

https://doi.org/10.1371/journal.pone.0177726.g001

Download:

Fig 2. PCA plot for prepared dataset.

Each point means a case in the dataset. Generally, the glaucoma cases are well separated from the healthy control cases. Some cases are located in the border area or opposite area. Right plot shows relationship between distribution of cases and features. In the glaucoma group, PSD, GHT, ocular_presure, and age have high values whereas MD and RNFL4_mean have low values.

https://doi.org/10.1371/journal.pone.0177726.g002

Learning model evaluation criteria

The accuracy, sensitivity, specificity, likelihood ratio, and ROC/AUC have been widely used as criteria for evaluating a diagnosis model [16]. The following terms are fundamental to understanding the utility of them:

True positive (TP): the patient has a disease and the prediction is positive.
False positive (FP): the patient does not have a disease but the prediction is positive.
True negative (TN): the patient does not have a disease and the prediction is negative
False negative (FN): the patient has a disease but the prediction is negative.

The accuracy of a diagnosis model refers to the ability of the model to correctly identify those patients with the disease and without the disease:

The sensitivity of a diagnosis model refers to the ability of the model to correctly identify those patients with the disease:

The specificity of a diagnosis model refers to the ability of the test to correctly identify those patients without the disease:

The likelihood ratio is defined as the ratio of expected test results in subjects with a certain disease to the subjects without the disease.¹⁰ The Likelihood ratio for positive test results (LR+) tells us how much more likely the positive test result is to occur in subjects with the disease compared to those without the disease:

The Likelihood ratio for negative test results (LR–) represents the ratio of the probability that a negative result will occur in subjects with the disease to the probability that the same result will occur in subjects without the disease:¹⁰

The receiver operating characteristic (ROC) plot expresses relationship between sensitivity and 1 –Specificity. The closer the ROC curve is located to upper-left hand corner, the better the model. The area under the curve (AUC) can have any value between 0 and 1 and it is a good indicator of the goodness of the model.

Results

Classification test

Fig 3 depicts the classification testing procedure conducted using learning models. From the classification test using the validation dataset, we recorded the statistics as shown in Table 4. As can be seen, the RF model shows the best values on all evaluation criteria. Other models show similar performance.

Download:

Fig 3. Classification test procedure using learning models.

https://doi.org/10.1371/journal.pone.0177726.g003

Download:

Table 4. Statistics of four learning models from classification tests.

The RF model shows the best values on all evaluation criteria. Other models show similar performance.

https://doi.org/10.1371/journal.pone.0177726.t004

In accuracy, the RF model has a 0.98 rate. The other three models have 0.97. All four models have enough accuracy for medical application.

The Sensitivity of RF and C5.0 is 0.983. It means that the proposed three models exactly predict against glaucoma patients and their accuracy is 0.983. It also means that they show a very small FN ratio. In the medical field, FN ratio is more important than FP ratio. Therefore, the thrree models are suitable for diagnosis of glaucoma.

The Specificity of the RF and KNN is 0.975. It shows good prediction power against healthy controls. LR+ is the best indicator for ruling in diagnosis. The higher the LR+, the more the test is indicative of a disease. Good diagnostic tests have LR+ > 10 and their positive result has a significant contribution to the diagnosis [17]. The LR+ of the RF and KNN models shows 39.33 and 38.67, respectively, and C5.0 and SVM are also larger than 19. LR—is a good indicator for ruling out the diagnosis. Good diagnostic tests have LR–< 0.1. The lower the LR–, the more significant contribution of the test is in ruling out disease [17]. The LR—of RF model shows 0.017, C5.0 and SVM shows 0.018, and KNN is 0.034.

Table 5 shows the detailed evaluation results of the RF model. There are two misclassified training samples in the table; one healthy sample is classified into the glaucoma group (FP and one glaucoma sample is classified into the healthy group (FN). In the medical situation, FN is more important than FP. The RF model shows very high accuracy (0.98) and very low FN rate (0.01).

Download:

Table 5. Classification results of RF model using the test dataset.

There are two misclassified training samples in the table, one healthy sample is classified into the glaucoma group (FP and one glaucoma sample is classified into the healthy group (FN).

https://doi.org/10.1371/journal.pone.0177726.t005

Fig 4 shows ROC curves and AUC values for all four models. AUC expresses global quality of the prediction model and the RF and C5.0 models show 0.979, SVM is over 0.967, and KNN is 0.971.

Download:

Fig 4. ROC curve and AUC for four models.

AUC expresses global quality of prediction models and RF and C5.0 models show 0.979, SVM is over 0.967, and KNN is 0.971. All models show very high values near 1.0.

https://doi.org/10.1371/journal.pone.0177726.g004

Table 6 shows the comparison of model performance between previous works and the proposed RF model. Only AUC of Bizios et al⁶ is higher than the proposed model, but the number of features is much more and sensitivity and specificity are lower than in the proposed model.

Download:

Table 6. Comparison of previous works and the proposed model.

Only AUC of Bizios⁶ is higher than the proposed model, but the number of features is higher and sensitivity and specificity are lower than the proposed model.

https://doi.org/10.1371/journal.pone.0177726.t006

From the classification test, we reached following conclusions:

The quality of our developed features is suitable to use for our glaucoma prediction. It does not depend on any specific learning model. It leads to best evaluation values on the RF model, but it also leads to good evaluation values on C5.0, SVM, and KNN models.
The values of measures in Table 4 say that RF, C5.0, and SVM prediction models have very strong and stable potential for glaucoma prediction, with its sensitivity measure being very high. Furthermore, the C5.0 model has good interpretability because it is a decision tree model.
In conclusion, RF, C5.0, and SVM, based on the proposed features, may be useful for the diagnosis of glaucoma.

Decision tree of C5.0

C5.0 is an advanced version of ID3 and C4.5 that is developed by Ross Quinlan.^10,11 C5.0 became an industrial standard for making a decision tree. We used the C50 package in R for testing the C5.0 algorithm. By using the package, we could see and manipulate the structure of decision tree. During the building process of the decision tree, C50 automatically performed the pruning tasks. From the C5.0 model, we constructed a decision tree. Fig 5 shows a whole decision tree. It contains 19 rules and training error of the model is 0.016. Table 7 summarizes usage of features on the decision tree. In C5.0 model, RNFL4.mean, ocular_pressure, MD, and PSD are mainly used for decision (prediction) rules. Most of cases that have RNFL4.mean < = 89.34 are the glaucoma group.

Download:

Fig 5. Decision tree for diagnosis of glaucoma from C5.0 algorithm.

It contains 19 rules and the training error of the model is 0.016.

https://doi.org/10.1371/journal.pone.0177726.g005

Download:

Table 7. Usage of features in the decision tree.

RNFL4.mean, ocular_pressure, MD, and PSD are mainly used for decision (prediction) rules. Most of cases that have RNFL4.mean < = 89.34 are glaucoma group.

https://doi.org/10.1371/journal.pone.0177726.t007

The rule in line 2 on Fig 5 means that if a case has RNFL4.mean < = 89.33334 and MD < = –4.02 then the case is classified into 1 (glaucoma). The number 126 refers to correctly classified cases in the training dataset by this rule. The rule in line 4 says that if a case has RNFL4.mean < = 89.33334, MD > –4.02, and age > 77, it will be classified into class 0 health control). The number 1 in line 12 means number of misclassified cases.

Discussion

Glaucoma is a serious disease that can cause complete, permanent blindness, and its early diagnosis is very difficult. In recent years, computer-aided screening and diagnosis of glaucoma has made considerable progress. The accuracy of the prediction model developed in this study was investigated. A visual field index (VFI) value of 97 or higher was defined as early glaucoma. Among the 12 cases of early glaucoma, 11 cases were diagnosed as glaucoma and 1 case was misdiagnosed as normal. Therefore, the diagnosis rate of early glaucoma was 91.7%. One case that was misdiagnosed is discussed below.

We reviewed several cases in which there are differences in the results between the clinical diagnosis and the algorithm (C5.0) in detail (Table 8). Firstly, in case 6 (Fig 6), the presence of tigroid fundus and peripapillary atrophy was observed, and there was a decrease in RNFL thickness on the peripapillary RNFL OCT scan. Both eyes were clinically diagnosed as normal by the clinical findings that comprehensively judged the appearance of the optic disc, visual field examination, and normal range of IOP. On the contrary, based on the decision tree generated by the C5.0 algorithm, the mean deviation was reduced to –1.82 dB in both eyes and finally she was diagnosed as having bilateral glaucoma. In this case, the algorithm seems to be diagnosed as glaucoma from the beginning due to the decrease in peripapillary RNFL thickness by high myopia (actually, she had myopia of 6 diopters). Likewise, it is difficult to clinically differentiate between normal and glaucoma because RNFL thickness is often reduced even without glaucoma in patients with high myopia [18–20]. Recently, reports on various OCT parameters and the optic disc morphology for distinguishing normal from glaucoma in high myopia have been published. In the future, it might be possible to improve the accuracy of the algorithm by adding the refraction of the eye and OCT indices, such as macular ganglion cell-inner plexiform layer (GCIPL) thickness, quadrant or clock-hour thickness of RNFL [21–23]. Secondly, in case 81 (Fig 7), clinically, glaucoma was diagnosed in the left eye, but the C5.0 algorithm judged it to be normal. In this patient, both MD and PSD were not significantly worse than the algorithm’s criteria because of the early glaucoma in the left eye. As in this case, we thought that the algorithm had a limitation on the diagnosis of “early glaucoma” with a lack of data in this study. However, this limitation might be improved by using a quadrant or clock-hour thickness of RNFL instead of mean RNFL thickness or by increasing the number of cases. Finally, in case 161 (Fig 8), the review of the decision tree of the C5.0 algorithm revealed that the central corneal thickness was higher than that of the others, and thus it was determined to be normal rather than glaucoma. This error would improve if the number of cases is increased and the standard of central corneal is changed and if the mean RNFL thickness is changed to another OCT index.

Download:

Fig 6. Case 6, color-fundus and red-free fundus photography (A), peripapillary RNFL thickness measured by SD-OCT (B), and automated 30–2 visual field test (C).

The presence of a tigroid fundus and peripapillary atrophy was observed, and there was a decrease in the RNFL thickness on the peripapillary RNFL thickness scan. In the visual field test, the abnormalities were judged to be of no clinical significance.

https://doi.org/10.1371/journal.pone.0177726.g006

Download:

Fig 7. Case 81, color-fundus and red-free fundus photography (A), peripapillary RNFL thickness measured by SD-OCT (B), and automated 30–2 visual field test (C).

Fundus photographs show an increased cup-to-disc ratio and RNFL defects in the both eyes. SD-OCT shows decrease in peripapillary thickness of inferotemporal quadrant for both eyes. Visual field defects are apparent in both eyes.

https://doi.org/10.1371/journal.pone.0177726.g007

Download:

Fig 8. Case 161, color-fundus and red-free fundus photography (A), peripapillary RNFL thickness measured by SD-OCT (B), and automated 30–2 visual field test (C).

Fundus photographs show an increased cup-to-disc ratio in both eyes and a RNFL defect in the left eye. SD-OCT shows a decrease in the peripapillary thickness of the infratemporal quadrant of the left eye. The visual field test demonstrates field defect in the left eye.

https://doi.org/10.1371/journal.pone.0177726.g008

Download:

Table 8. Demographic and clinical data of cases with differences between clinical diagnosis and algorithmic judgment.

https://doi.org/10.1371/journal.pone.0177726.t008

Recently, two new machine-learning genome analysis methods, Pse-Analysis (http://bioinformatics.hitsz.edu.cn/Pse-Analysis/) and Pse-in-One [24], have been introduced. These methods support sample feature extraction, optimal parameter selection, model training, cross-validation, and prediction quality evaluation. The methods are optimized for DNA/RNA and protein/peptide sequence data. The in-built support for feature extraction and optimal parameter selection can render these methods invaluable for the diagnosis of Glaucoma. Further investigation and research needs to be conducted to establish if this is a plausible solution for diagnosis of Glaucoma.

Most classification test report results demonstrate that learning models such as RF and SVM deliver a better performance than KNN. Table 4 shows that the accuracy of KNN is similar to other advanced learning models, which means that the derived dataset used in our research has the high quality required for classification but does not clearly reveal the relative performance of the learning models. This establishes that the performance of KNN is expected to deteriorate with an increase in the volume of validation data.

As we mentioned earlier, FN is a more serious error than FP in medical applications. Many learning models support control of the FN rate. In the C5.0 model, we can assign higher error cost to FN than FP. In the RF model, we can modify cutoff values for classes. For example, if we assign stricter values for decisions of healthy control, then we can reduce the number of glaucoma cases that are classified into the healthy control group.

If we want to improve classification accuracy, we can use ensemble learning [25]. It uses multiple learning algorithms to obtain better accuracy. In our cases, we can consider prediction results from the four learning models, and take the majority of results as a final decision.

In our study, we confirm that the machine learning model has many clinical applications and is useful for diagnosing glaucoma. If we gather additional clinical data, we can construct a more accurate, elaborate learning model. In our future studies, we will clarify the cases that on the border between healthy controls and glaucoma cases. For this purpose, we will analyze clinical image data and merge the data with our model. We will also develop diagnostic support software using pre-constructed learning models. With precision medicine gaining considerable attention, we plan to construct new machine-learning models for major ophthalmological diseases and their treatments using precision medicines.

Acknowledgments

This study was conducted by the research fund of Dankook University in 2015.

Author Contributions

Conceptualization: SK KJC SO.
Data curation: SO.
Formal analysis: SK KJC SO.
Funding acquisition: SO.
Investigation: SK KJC.
Methodology: SO.
Project administration: SO.
Resources: SK KJC.
Software: SO.
Supervision: SO.
Validation: SK KJC.
Visualization: SO.
Writing – original draft: SK KJC SO.
Writing – review & editing: SK KJC SO.

References

1. Weinreb RN, Aung T, Medeiros FA. The pathophysiology and treatment of glaucoma: a review. Jama 2014;311(18):1901–11. pmid:24825645
- View Article
- PubMed/NCBI
- Google Scholar
2. Tay E, Seah SK, Chan SP, Lim ATH, Chew SJ, Foster PJ. Optic disk ovality as an index of tilt and its relationship to myopia and perimetry. Am J Ophthalmol 2005;139(2):247–52. pmid:15733984
- View Article
- PubMed/NCBI
- Google Scholar
3. Özdek SC, Önol M, Gürelik G, Hasanreisoglu B. Scanning laser polarimetry in normal subjects and patients with myopia. Br J Ophthalmol 2000;84(3):264–7. pmid:10684835
- View Article
- PubMed/NCBI
- Google Scholar
4. Chan K, Lee TW, Sample PA, Goldbaum MH, Weinreb RN, Sejnowski TJ. Comparison of machine learning and traditional classifiers in glaucoma diagnosis. IEEE T Bio-med Eng 2002;49(9):963–74
- View Article
- Google Scholar
5. Goldbaum MH, Sample PA, Chan K, Williams J, Lee TW, Blumenthal E, et al. Comparing machine learning classifiers for diagnosing glaucoma from standard automated perimetry. Invest Ophth Vis Sci 2002;43(1):162–9.
- View Article
- Google Scholar
6. Bizios D, Heijl A, Hougaard JL, Bengtsson B. Machine learning classifiers for glaucoma diagnosis based on classification of retinal nerve fibre layer thickness parameters measured by Stratus OCT. Acta Ophthalmol 2010;88(1):44–52. pmid:20064122
- View Article
- PubMed/NCBI
- Google Scholar
7. Barella KA, Costa VP, Gonçalves Vidotti V, Silva FR, Dias M, Gomi ES. Glaucoma diagnostic accuracy of machine learning classifiers using retinal nerve fiber layer and optic nerve data from SD-OCT. J Ophthalmol 2013;2013.
- View Article
- Google Scholar
8. Silva FR, Vidotti VG, Cremasco F, Dias M, Gomi ES, Costa VP. Sensitivity and specificity of machine learning classifiers for glaucoma diagnosis using Spectral Domain OCT and standard automated perimetry. Arq Bras Oftalmol 2013;76(3):170–4. pmid:23929078
- View Article
- PubMed/NCBI
- Google Scholar
9. Kotsiantis SB, Zaharakis I, Pintelas P. Supervised machine learning: A review of classification techniques. Emerging Artificial Intelligence Applications in Computer Engineering, Maglogiannis I. et al. (Eds.) IOS Press 2007; 3–24.
10. Quinlan JR. Induction of decision trees. Mach. Learn 1986;1(1):81–106.
- View Article
- Google Scholar
11. Quinlan J. R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.
12. Caruana R. An Empirical Comparison of Supervised Learning Algorithms. Proceedings of the 23rd international conference on Machine learning. 2006 June 25–29; Pittsburgh USA; ACM; 2006. p.161-168.
13. Liu B, Long R, Chou K. iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework. Bioinformatics. 2016;32(16):2411–8. pmid:27153623
- View Article
- PubMed/NCBI
- Google Scholar
14. Liu B, Wang S, Long R, Chou K. iRSpot-EL: identify recombination spots with an ensemble learning approach. Bioinformatics. 2017;33(1):35–41. pmid:27531102
- View Article
- PubMed/NCBI
- Google Scholar
15. Åsman P, Heijl A. Glaucoma hemifield test: automated visual field evaluation. Arch Ophthalmol 1992;110(6):812–9. pmid:1596230
- View Article
- PubMed/NCBI
- Google Scholar
16. Lalkhen AG, McCluskey A. Clinical tests: sensitivity and specificity. Continuing Education in Anaesthesia, Critical Care & Pain 2008;8(6):221–3.
- View Article
- Google Scholar
17. Šimundić A-M. Measures of diagnostic accuracy: basic definitions. Med Biol Sci 2008;22(4):61–5.
- View Article
- Google Scholar
18. Leung CK-S, Mohamed S, Leung KS, Cheung CYL, Chan SLW, Cheng DKY, et al. Retinal nerve fiber layer measurements in myopia: an optical coherence tomography study. Invest Ophthalmol Vis Sci 2006;47(12):5171–6. pmid:17122099
- View Article
- PubMed/NCBI
- Google Scholar
19. Rauscher FM, Sekhon N, Feuer WJ, Budenz DL. Myopia affects retinal nerve fiber layer measurements as determined by optical coherence tomography. J Glaucoma 2009;18(7):501. pmid:19745664
- View Article
- PubMed/NCBI
- Google Scholar
20. Kang SH, Hong SW, Im SK, Lee SH, Ahn MD. Effect of myopia on the thickness of the retinal nerve fiber layer measured by Cirrus HD optical coherence tomography. Invest Ophthalmol Vis Sci 2010;51(8):4075–83. pmid:20237247
- View Article
- PubMed/NCBI
- Google Scholar
21. Seong M, Sung KR, Choi EH, Kang SY, Cho JW, Um TW, et al. Macular and peripapillary retinal nerve fiber layer measurements by spectral domain optical coherence tomography in normal-tension glaucoma. Invest Ophthalmol Vis Sci 2010;51(3):1446–52. pmid:19834029
- View Article
- PubMed/NCBI
- Google Scholar
22. Choi YJ, Jeoung JW, Park KH, Kim DM. Glaucoma Detection Ability of Ganglion Cell-Inner Plexiform Layer Thickness by Spectral-Domain Optical Coherence Tomography in High MyopiaGanglion Cell-Inner Plexiform Layer in High Myopia. Invest Ophthalmol Vis Sci 2013;54(3):2296–304. pmid:23462754
- View Article
- PubMed/NCBI
- Google Scholar
23. Seol BR, Jeoung JW, Park KH. Glaucoma Detection Ability of Macular Ganglion Cell-Inner Plexiform Layer Thickness in Myopic Preperimetric GlaucomaGlaucoma Detection Ability in Myopic Glaucoma. Invest Ophthalmol Vis Sci 2015;56(13):8306–13. pmid:26720484
- View Article
- PubMed/NCBI
- Google Scholar
24. Liu B,Liu F, Wang X, Chen J, Fang L, Chou KC. Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences. Nucleic Acids Res 2015;43 (W1): W65–W71. pmid:25958395
- View Article
- PubMed/NCBI
- Google Scholar
25. Dietterich TG. Ensemble learning. The handbook of brain theory and neural networks. Arbib MA; 2002.

[ref1] 1. Weinreb RN, Aung T, Medeiros FA. The pathophysiology and treatment of glaucoma: a review. Jama 2014;311(18):1901–11. pmid:24825645
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Tay E, Seah SK, Chan SP, Lim ATH, Chew SJ, Foster PJ. Optic disk ovality as an index of tilt and its relationship to myopia and perimetry. Am J Ophthalmol 2005;139(2):247–52. pmid:15733984
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Özdek SC, Önol M, Gürelik G, Hasanreisoglu B. Scanning laser polarimetry in normal subjects and patients with myopia. Br J Ophthalmol 2000;84(3):264–7. pmid:10684835
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Chan K, Lee TW, Sample PA, Goldbaum MH, Weinreb RN, Sejnowski TJ. Comparison of machine learning and traditional classifiers in glaucoma diagnosis. IEEE T Bio-med Eng 2002;49(9):963–74
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. Goldbaum MH, Sample PA, Chan K, Williams J, Lee TW, Blumenthal E, et al. Comparing machine learning classifiers for diagnosing glaucoma from standard automated perimetry. Invest Ophth Vis Sci 2002;43(1):162–9.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref6] 6. Bizios D, Heijl A, Hougaard JL, Bengtsson B. Machine learning classifiers for glaucoma diagnosis based on classification of retinal nerve fibre layer thickness parameters measured by Stratus OCT. Acta Ophthalmol 2010;88(1):44–52. pmid:20064122
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref7] 7. Barella KA, Costa VP, Gonçalves Vidotti V, Silva FR, Dias M, Gomi ES. Glaucoma diagnostic accuracy of machine learning classifiers using retinal nerve fiber layer and optic nerve data from SD-OCT. J Ophthalmol 2013;2013.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref8] 8. Silva FR, Vidotti VG, Cremasco F, Dias M, Gomi ES, Costa VP. Sensitivity and specificity of machine learning classifiers for glaucoma diagnosis using Spectral Domain OCT and standard automated perimetry. Arq Bras Oftalmol 2013;76(3):170–4. pmid:23929078
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Kotsiantis SB, Zaharakis I, Pintelas P. Supervised machine learning: A review of classification techniques. Emerging Artificial Intelligence Applications in Computer Engineering, Maglogiannis I. et al. (Eds.) IOS Press 2007; 3–24.

[ref10] 10. Quinlan JR. Induction of decision trees. Mach. Learn 1986;1(1):81–106.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref11] 11. Quinlan J. R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.

[ref12] 12. Caruana R. An Empirical Comparison of Supervised Learning Algorithms. Proceedings of the 23rd international conference on Machine learning. 2006 June 25–29; Pittsburgh USA; ACM; 2006. p.161-168.

[ref13] 13. Liu B, Long R, Chou K. iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework. Bioinformatics. 2016;32(16):2411–8. pmid:27153623
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref14] 14. Liu B, Wang S, Long R, Chou K. iRSpot-EL: identify recombination spots with an ensemble learning approach. Bioinformatics. 2017;33(1):35–41. pmid:27531102
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref15] 15. Åsman P, Heijl A. Glaucoma hemifield test: automated visual field evaluation. Arch Ophthalmol 1992;110(6):812–9. pmid:1596230
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref16] 16. Lalkhen AG, McCluskey A. Clinical tests: sensitivity and specificity. Continuing Education in Anaesthesia, Critical Care & Pain 2008;8(6):221–3.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref17] 17. Šimundić A-M. Measures of diagnostic accuracy: basic definitions. Med Biol Sci 2008;22(4):61–5.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref18] 18. Leung CK-S, Mohamed S, Leung KS, Cheung CYL, Chan SLW, Cheng DKY, et al. Retinal nerve fiber layer measurements in myopia: an optical coherence tomography study. Invest Ophthalmol Vis Sci 2006;47(12):5171–6. pmid:17122099
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref19] 19. Rauscher FM, Sekhon N, Feuer WJ, Budenz DL. Myopia affects retinal nerve fiber layer measurements as determined by optical coherence tomography. J Glaucoma 2009;18(7):501. pmid:19745664
View Article
PubMed/NCBI
Google Scholar

[59] View Article

[60] PubMed/NCBI

[61] Google Scholar

[ref20] 20. Kang SH, Hong SW, Im SK, Lee SH, Ahn MD. Effect of myopia on the thickness of the retinal nerve fiber layer measured by Cirrus HD optical coherence tomography. Invest Ophthalmol Vis Sci 2010;51(8):4075–83. pmid:20237247
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref21] 21. Seong M, Sung KR, Choi EH, Kang SY, Cho JW, Um TW, et al. Macular and peripapillary retinal nerve fiber layer measurements by spectral domain optical coherence tomography in normal-tension glaucoma. Invest Ophthalmol Vis Sci 2010;51(3):1446–52. pmid:19834029
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref22] 22. Choi YJ, Jeoung JW, Park KH, Kim DM. Glaucoma Detection Ability of Ganglion Cell-Inner Plexiform Layer Thickness by Spectral-Domain Optical Coherence Tomography in High MyopiaGanglion Cell-Inner Plexiform Layer in High Myopia. Invest Ophthalmol Vis Sci 2013;54(3):2296–304. pmid:23462754
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref23] 23. Seol BR, Jeoung JW, Park KH. Glaucoma Detection Ability of Macular Ganglion Cell-Inner Plexiform Layer Thickness in Myopic Preperimetric GlaucomaGlaucoma Detection Ability in Myopic Glaucoma. Invest Ophthalmol Vis Sci 2015;56(13):8306–13. pmid:26720484
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref24] 24. Liu B,Liu F, Wang X, Chen J, Fang L, Chou KC. Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences. Nucleic Acids Res 2015;43 (W1): W65–W71. pmid:25958395
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref25] 25. Dietterich TG. Ensemble learning. The handbook of brain theory and neural networks. Arbib MA; 2002.

Figures

Abstract

Introduction

Materials and methods

Procedure

Participants

Feature selection and dataset preparation

Learning model evaluation criteria

Results

Classification test

Decision tree of C5.0

Discussion

Acknowledgments

Author Contributions

References