Machine learning for improved pathological staging of prostate cancer: A performance comparison on a range of classifiers

https://doi.org/10.1016/j.artmed.2011.11.003Get rights and content

Abstract

Objectives

Prediction of prostate cancer pathological stage is an essential step in a patient's pathway. It determines the treatment that will be applied further. In current practice, urologists use the pathological stage predictions provided in Partin tables to support their decisions. However, Partin tables are based on logistic regression (LR) and built from US data. Our objective is to investigate a range of both predictive methods and of predictive variables for pathological stage prediction and assess them with respect to their predictive quality based on UK data.

Methods and material

The latest version of Partin tables was applied to a large scale British dataset in order to measure their performances by mean of concordance index (c-index). The data was collected by the British Association of Urological Surgeons (BAUS) and gathered records from over 1700 patients treated with prostatectomy in 57 centers across UK. The original methodology was replicated using the BAUS dataset and evaluated using concordance index. In addition, a selection of classifiers, including, among others, LR, artificial neural networks and Bayesian networks (BNs) was applied to the same data and compared with each other using the area under the ROC curve (AUC). Subsets of the data were created in order to observe how classifiers perform with the inclusion of extra variables. Finally a local dataset prepared by the Aberdeen Royal Infirmary was used to study the effect on predictive performance of using different variables.

Results

Partin tables have low predictive quality (c-index = 0.602) when applied on UK data for comparison on patients with organ confined and extra prostatic extension conditions, patients at the two most frequently observed pathological stages. The use of replicate lookup tables built from British data shows an improvement in the classification, but the overall predictive quality remains low (c-index = 0.610).

Comparing a range of classifiers shows that BNs generally outperform other methods. Using the four variables from Partin tables, naive Bayes is the best classifier for the prediction of each class label (AUC = 0.662 for OC). When two additional variables are added, the results of LR (0.675), artificial neural networks (0.656) and BN methods (0.679) are overall improved. BNs show higher AUCs than the other methods when the number of variables raises

Conclusion

The predictive quality of Partin tables can be described as low to moderate on UK data. This means that following the predictions generated by Partin tables, many patients would received an inappropriate treatment, generally associated with a deterioration of their quality of life. In addition to demographic differences between UK and the original US population, the methodology and in particular LR present limitations. BN represents a promising alternative to LR from which prostate cancer staging can benefit. Heuristic search for structure learning and the inclusion of more variables are elements that further improve BN models quality.

Introduction

Cancer is a widely spread disease responsible for many deaths all over the world. In 2008, the World Health Organization estimated the number of new cancer cases in the world to be over 7.5 million [1]. Among all types of cancer, prostate cancer is the most frequent in men. In 2008, around 900 000 new cases of prostate cancer were diagnosed, and approximately 260 000 men died from it over the same period [1]. In Britain, the same source shows that 37 000 men were affected with new occurrence of prostate cancer, accounting for nearly a quarter of all male cancer diagnosed annually. It is also the second commonest cause of cancer death in men in the UK after lung cancer [1].

This paper considers the use of different machine learning techniques in order to improve the prediction of pathological stage in prostate cancer. A UK-wide dataset collected by the British Association of Urological Surgeons (BAUS) is used in order to build and assess predictive models. Results are compared with each other, but also against those of tools and methods currently in use clinically. Machine learning gathers a wide range of methods, particularly for classification purposes. Performance studies of such methods on different applications is an essential step towards the optimization of predictive tools. In [2], classifiers are compared with each other with respect to their performance on predicting different outcomes related to pancreatic cancer, including cancer staging. Similarly, in [3], classifiers were applied on breast cancer patient data to improve survivability prediction. In [2], models built using Bayesian techniques and logistic regression presented the best prediction for the different outcomes while decision trees performed best in [3]. This highlights the importance of comparing methods on different domain as the most adapted technique can vary across them.

Partin tables [4] are the most commonly used tool for prostate cancer staging. They were originally created using logistic regression (LR) [5] on a database gathering records of patients that were treated with radical prostatectomy in a single US institution [4]. Since then, the tables have been updated using different up-to-date datasets [6], [7], [8]. The revision takes into account changes in population demographics, advances in health technology and improved health care systems, but the tables are still based on the same fundamental LR-based methodology.

Partin tables are a well-established and most widely used pathological staging tool in the urological community worldwide. However, concerns have been raised regarding their validity on non-US populations [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19]. In some instances, Partin tables were considered to be unsuitable for the target population because of limitations with respect to their predictive quality [9], [11], [16], [17], [18], [19]. The appropriateness of the methodology behind Partin tables, especially in regard to the choice of predictive variables and classifier, was not addressed in those studies. In addition, it is widely recognized that prostate cancer staging is associated with a high level of uncertainty. All these considerations are compelling clinicians to explore alternative means of generating predictive tools, especially those which apply machine-learning techniques which have the potential of improving the quality and accuracy of predictive performance [20].

In this paper, we assess the Partin tables on a British population. Replicate lookup tables, based on British data are built and analyzed following the original approach given in [8]. Additionally, we propose and compare alternative classification techniques, including Bayesian networks (BNs), which by simplifying the probability distribution, are recognized as a reference method to reason under uncertainty [21]. In addition, we consider more variables to include in the model. The paper is divided as follows. Section 2 presents important details related to the understanding of prostate cancer and its treatments. Section 3 describes the methodology of the study. It presents the different objectives and the data, provides technical background knowledge on classification and describes the experiments. Results are presented in Section 4 and discussed in Section 5. Finally, we describe our conclusions and introduce ideas for further work in Section 6.

Section snippets

Medical background

Cancer is a disease where malignant cells are developed and alter the function of their hosting organs or tissues. Typically, malignant cells reproduce and group together to form a tumor. Untreated tumors grow and affect surrounding healthy cells, leading to a spread of the cancer. Metastasis happens when the cancer reaches surrounding organs or tissues. The presence of cancer results in the deterioration of some body functions and can lead to death when vital organs are touched.

Whilst in the

Objectives

This work introduces three main objectives that present interests for both medicine and machine learning communities.

First, we aim to critically assess the methodology which was used to construct Partin tables. This involves externally validating the version currently being used by practitioners, that is, studying how well it performs on a population that presents different characteristics. Here, the tool is evaluated on a large British cohort and results are compared to those of its internal

External validation of Partin tables

Similarly to the internal validation of the Partin tables, c-index was calculated for the three non-OC pathological stage vs. OC. Results are presented in Table 6 and illustrate how good the different LR models are at distinguishing between patients with each combination of stages. Such values can be understood relative to the scale given in [49]. The scale defines three levels of predictive quality for a model according to its c-index. A model has low, moderate or high prognostic accuracy if

External validation of Partin tables

As shown in Table 6, original Partin tables achieve lower c-indices when applied to the BAUS data. This implies that when used on a UK population, Partin tables have a lower predictive quality than on the native data from which they were derived. With a c-index below 0.70 for OC vs. EPE, Partin tables can be considered as having poor predictive quality for patients falling in these two categories. We recall that patients with OC and EPE pathological stages are the most frequent cases in the

Conclusion and further work

In this paper, we have assessed one of the major tools used clinically for prostate cancer staging. The predictive quality of Partin tables is much lower when it is applied on a British population than it was when originally measured on US data. In addition, a replicate Partin study shows that new lookup tables also exhibit a low to moderate predictive quality. A range of alternative classifiers were selected and three datasets prepared in order to assess different aspects of the methodology,

Acknowledgments

This work was jointly funded by the Northern Research Partnership and NHS Grampian.

References (61)

  • GaoX. et al.

    The newer the better? Comparison of the 1997 and 2001 Partin tables for pathologic stage prediction of prostate cancer in China

    Urology

    (2008)
  • BhojaniN. et al.

    Partin tables cannot accurately predict the pathological stage at radical prostatectomy

    European Journal of Surgical Oncology

    (2009)
  • BhojaniN. et al.

    External validation of the updated partin tables in a cohort of French and Italian men

    International Journal of Radiation Oncology, Biology, Physics

    (2009)
  • BorqueA. et al.

    The use of neural networks and logistic regression analysis for predicting pathological stage in men undergoing radical prostatectomy: a population based study

    The Journal of Urology

    (2001)
  • AnagnostouT. et al.

    Artificial neural networks for decision-making in urologic oncology

    European Urology

    (2003)
  • KawakamiS. et al.

    Development, validation, and head-to-head comparison of logistic regression-based nomograms and artificial neural network models predicting prostate cancer on initial extended biopsy

    European Urology

    (2008)
  • MetzC.

    Basic principles of ROC analysis

    Seminars in Nuclear Medicine

    (1978)
  • AnastasiadisA. et al.

    MRI-guided biopsy of the prostate increases diagnostic performance in men with elevated or increasing PSA levels after previous negative TRUS biopsies

    European Urology

    (2006)
  • AntalP. et al.

    Using literature and data to learn Bayesian networks as clinical models of ovarian tumors

    Artificial Intelligence in Medicine

    (2004)
  • International Agency for Research on Cancer—GLOBOCAN. http://globocan.iarc.fr/; 2008 [accessed:...
  • HosmerD. et al.

    Applied logistic regression

    (2000)
  • PartinA. et al.

    Combination of prostate-specific antigen, clinical stage, and Gleason score to predict pathological stage of localized prostate cancer: a multi-institutional update

    Journal of the American Medical Association

    (1997)
  • KattanM. et al.

    Evaluation of a nomogram used to predict the pathologic stage of clinically localized prostate carcinoma

    Cancer

    (1997)
  • KarakiewiczP. et al.

    Validation of 1997 Partin tables lymph node invasion predictions in men treated with radical prostatectomy in Montreal Quebec

    The Canadian Journal of Urology

    (2005)
  • SongC. et al.

    Nomograms for the prediction of pathologic stage of clinically localized prostate cancer in Korean men

    Journal of Korean Medical Science

    (2005)
  • CruzJ. et al.

    Applications of machine learning in cancer prediction and prognosis

    Cancer Informatics

    (2006)
  • JensenF. et al.

    Bayesian networks and decision graphs

    (2007)
  • SobinL.

    TNM classification of malignant tumours

    (2009)
  • KirbyR.

    The prostate: small gland, big problem

    (2002)
  • ShaidaN. et al.

    Controversial topics in surgery. open versus laparoscopic radical prostatectomy: the case for open radical prostatectomy

    Annals of the Royal College of Surgeons of England

    (2007)
  • Cited by (0)

    View full text