Skip to main content
Erschienen in: Journal of Digital Imaging 5/2019

18.10.2018

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach

verfasst von: Ryan G. Short, John Bralich, Dave Bogaty, Nicholas T. Befera

Erschienen in: Journal of Imaging Informatics in Medicine | Ausgabe 5/2019

Einloggen, um Zugang zu erhalten

Abstract

Radiology reports contain a large amount of potentially valuable unstructured data. Recently, neural networks have been employed to perform classification of radiology reports over a few classes at the document level. The success of neural networks in sequence-labeling problems such as named entity recognition and part of speech tagging suggests that they could be used to classify radiology report text with greater granularity. We employed a neural network architecture to comprehensively classify mammography report text at the word level using a sequence labeling approach. Two radiologists devised a comprehensive classification system for screening mammography reports. Each word in each report was manually categorized by a radiologist into one of 33 categories according to the classification system. Tagged words referencing the same finding were grouped into unique sets. We pre-labeled reports with a rule-based algorithm and then manually edited these annotations for 6705 screening mammography reports (25.1%, 66.8%, and 8.1% BI-RADS 0, 1, and 2, respectively). A combined convolutional and recurrent neural network model was used to label words in each sentence of the individual reports. A siamese recurrent neural network was then used to group findings into sets. Performance of the neural network-based method was compared to a rule-based algorithm and a conditional random field (CRF) model. Global accuracy (percentage of documents where all word tags were predicted correctly) and keyword accuracy (percentage of all words that were labeled correctly, excluding words tagged as unimportant) were calculated on an unseen 519 report test set. Two-tailed t tests were used to assess differences between algorithm performance, and p < 0.05 was used to determine statistical significance. The neural network-based approach showed significantly higher global accuracy compared to both the rule-based algorithm (88.3 vs 57.0%, p < 0.001) and the CRF model (88.3% vs. 75.8%, p < 0.001). The neural network also showed significantly higher keyword level accuracy compared to the rule-based algorithm (95.5% vs. 80.9% p < 0.001) and CRF model (95.5% vs. 76.9%, p < 0.001). We demonstrate the potential of neural networks to accurately perform word-level multilabel classification of free text radiology reports across 33 classes, thus showing the utility of a sequence labeling approach to NLP of radiology reports. We found that a neural network classifier outperforms a rule-based algorithm and a CRF classifier for comprehensive multilabel classification of free text screening mammography reports at the word level. By approaching radiology report classification as a sequence-labeling problem, we demonstrate the ability of neural networks to extract data from free text radiology reports at a level of granularity not previously reported.
Literatur
1.
2.
Zurück zum Zitat Bruno MA, Petscavage-Thomas JM, Mohr MJ, Bell SK, Brown SD: The “Open Letter”: Radiologists’ Reports in the Era of Patient Web Portals. J Am Coll Radiol 11:863–867, 2014CrossRefPubMed Bruno MA, Petscavage-Thomas JM, Mohr MJ, Bell SK, Brown SD: The “Open Letter”: Radiologists’ Reports in the Era of Patient Web Portals. J Am Coll Radiol 11:863–867, 2014CrossRefPubMed
3.
Zurück zum Zitat Cai T, Giannopoulos AA, Yu S, Kelil T, Ripley B, Kumamaru KK, Rybicki FJ, Mitsouras D: Natural Language Processing Technologies in Radiology Research and Clinical Applications. Radiographics 36:176–191, 2016CrossRefPubMedPubMedCentral Cai T, Giannopoulos AA, Yu S, Kelil T, Ripley B, Kumamaru KK, Rybicki FJ, Mitsouras D: Natural Language Processing Technologies in Radiology Research and Clinical Applications. Radiographics 36:176–191, 2016CrossRefPubMedPubMedCentral
4.
Zurück zum Zitat Pons E, Braun LMM, Hunink MGM, Kors JA: Natural Language Processing in Radiology: A Systematic Review. Radiology 279:329–343, 2016CrossRefPubMed Pons E, Braun LMM, Hunink MGM, Kors JA: Natural Language Processing in Radiology: A Systematic Review. Radiology 279:329–343, 2016CrossRefPubMed
5.
Zurück zum Zitat Hassanpour S, Bay G, Langlotz CP: Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing. J Digit Imaging 30:314–322, 2017CrossRefPubMedPubMedCentral Hassanpour S, Bay G, Langlotz CP: Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing. J Digit Imaging 30:314–322, 2017CrossRefPubMedPubMedCentral
6.
Zurück zum Zitat Masino AJ, Grundmeier RW, Pennington JW, Germiller JA, Crenshaw, 3rd. EB: Temporal bone radiology report classification using open source machine learning and natural langue processing libraries. BMC Med Inform Decis Mak 16:65, 2016CrossRefPubMedPubMedCentral Masino AJ, Grundmeier RW, Pennington JW, Germiller JA, Crenshaw, 3rd. EB: Temporal bone radiology report classification using open source machine learning and natural langue processing libraries. BMC Med Inform Decis Mak 16:65, 2016CrossRefPubMedPubMedCentral
8.
Zurück zum Zitat Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J: Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst:3111–3119, 2013 Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J: Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst:3111–3119, 2013
10.
Zurück zum Zitat Chokshi F, Shin B, Lee T, Lemmon A, Necessary S, Choi J. Natural Langeuage Processing for Classification of Acute, Communicable Findings on Unstructured Head CT Reports: Comparison of Neural Network and Non-Neural Machine Learning Techniques. bioRxiv 2017:173310. https://doi.org/10.1101/173310 . Chokshi F, Shin B, Lee T, Lemmon A, Necessary S, Choi J. Natural Langeuage Processing for Classification of Acute, Communicable Findings on Unstructured Head CT Reports: Comparison of Neural Network and Non-Neural Machine Learning Techniques. bioRxiv 2017:173310. https://​doi.​org/​10.​1101/​173310 .​
11.
Zurück zum Zitat Chen MC, Ball RL, Yang L, Moradzadeh N, Chapman BE, Larson DB et al.: Deep Learning to Classify Radiology Free-Text Reports. Radiology 171115, 2017 Chen MC, Ball RL, Yang L, Moradzadeh N, Chapman BE, Larson DB et al.: Deep Learning to Classify Radiology Free-Text Reports. Radiology 171115, 2017
12.
Zurück zum Zitat Banerjee I, Chen MC, Lungren MP, Rubin DL: Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort. J Biomed Inform 77:11–20, 2018CrossRefPubMed Banerjee I, Chen MC, Lungren MP, Rubin DL: Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort. J Biomed Inform 77:11–20, 2018CrossRefPubMed
13.
Zurück zum Zitat Wang P, Qian Y, Soong FK, He L, Zhao H. Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. arXiv [csCL] 2015. Wang P, Qian Y, Soong FK, He L, Zhao H. Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. arXiv [csCL] 2015.
14.
Zurück zum Zitat Dernoncourt F, Lee JY, Szolovits P. NeuroNER: an easy-to-use program for named-entity recognition based on neural networks. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2017. https://doi.org/10.18653/v1/d17-2017 . Dernoncourt F, Lee JY, Szolovits P. NeuroNER: an easy-to-use program for named-entity recognition based on neural networks. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2017. https://​doi.​org/​10.​18653/​v1/​d17-2017 .​
15.
Zurück zum Zitat Cornegruta S, Bakewell R, Withey S, Montana G. Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks. arXiv [csCL] 2016. Cornegruta S, Bakewell R, Withey S, Montana G. Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks. arXiv [csCL] 2016.
16.
Zurück zum Zitat Short RG, Middleton D, Befera NT, Gondalia R, Tailor TD: Patient-Centered Radiology Reporting: Using Online Crowdsourcing to Assess the Effectiveness of a Web-Based Interactive Radiology Report. J Am Coll Radiol 14:1489–1497, 2017CrossRefPubMed Short RG, Middleton D, Befera NT, Gondalia R, Tailor TD: Patient-Centered Radiology Reporting: Using Online Crowdsourcing to Assess the Effectiveness of a Web-Based Interactive Radiology Report. J Am Coll Radiol 14:1489–1497, 2017CrossRefPubMed
17.
Zurück zum Zitat Gale W, Oakden-Rayner L, Carneiro G, Bradley AP, Palmer LJ. Detecting hip fractures with radiologist-level performance using deep neural networks. arXiv [csCV] 2017. Gale W, Oakden-Rayner L, Carneiro G, Bradley AP, Palmer LJ. Detecting hip fractures with radiologist-level performance using deep neural networks. arXiv [csCV] 2017.
18.
Zurück zum Zitat Bird S, Klein E, Loper E. Natural Language Processing with Python. “O’Reilly Media, Inc.”; 2009. Bird S, Klein E, Loper E. Natural Language Processing with Python. “O’Reilly Media, Inc.”; 2009.
21.
Zurück zum Zitat American College of Radiology. ACR BI-RADS atlas: Breast Imaging Reporting and Data System ; Mammography, Ultrasound, Magnetic Resonance Imaging, Follow-up and Outcome Monitoring, Data Dictionary. 2013. American College of Radiology. ACR BI-RADS atlas: Breast Imaging Reporting and Data System ; Mammography, Ultrasound, Magnetic Resonance Imaging, Follow-up and Outcome Monitoring, Data Dictionary. 2013.
22.
Zurück zum Zitat Lafferty JD, Mc Callum A, Pereira FCN. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the Eighteenth International Conference on Machine Learning, Morgan Kaufmann Publishers Inc.; 2001, p. 282–9. Lafferty JD, Mc Callum A, Pereira FCN. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the Eighteenth International Conference on Machine Learning, Morgan Kaufmann Publishers Inc.; 2001, p. 282–9.
23.
Zurück zum Zitat Finkel JR, Grenager T, Manning C: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2005, pp. 363–370 Finkel JR, Grenager T, Manning C: Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2005, pp. 363–370
24.
Zurück zum Zitat Silfverberg M, Ruokolainen T, Linden K, Kurimo M: Others. Part-of-speech tagging using conditional random fields: Exploiting sub-label dependencies for improved accuracy, 2014 Silfverberg M, Ruokolainen T, Linden K, Kurimo M: Others. Part-of-speech tagging using conditional random fields: Exploiting sub-label dependencies for improved accuracy, 2014
27.
Zurück zum Zitat Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv [csCL] 2015. Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv [csCL] 2015.
28.
Zurück zum Zitat Řehůřek R, Sojka P: Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Valletta, ELRA, 2010, pp. 45–50 Řehůřek R, Sojka P: Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Valletta, ELRA, 2010, pp. 45–50
30.
Zurück zum Zitat Cho K, van Merrienboer B, Bahdanau D, Bengio Y. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. arXiv [csCL] 2014. Cho K, van Merrienboer B, Bahdanau D, Bengio Y. On the Properties of Neural Machine Translation: Encoder-Decoder Approaches. arXiv [csCL] 2014.
31.
Zurück zum Zitat Mueller J, Thyagarajan A. Siamese Recurrent Architectures for Learning Sentence Similarity. AAAI 2016. Mueller J, Thyagarajan A. Siamese Recurrent Architectures for Learning Sentence Similarity. AAAI 2016.
32.
Zurück zum Zitat Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv [csLG] 2014. Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv [csLG] 2014.
33.
Zurück zum Zitat Liu F, Baldwin T, Cohn T. Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields. arXiv [csCL] 2017. Liu F, Baldwin T, Cohn T. Capturing Long-range Contextual Dependencies with Memory-enhanced Conditional Random Fields. arXiv [csCL] 2017.
Metadaten
Titel
Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach
verfasst von
Ryan G. Short
John Bralich
Dave Bogaty
Nicholas T. Befera
Publikationsdatum
18.10.2018
Verlag
Springer International Publishing
Erschienen in
Journal of Imaging Informatics in Medicine / Ausgabe 5/2019
Print ISSN: 2948-2925
Elektronische ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-018-0141-4

Weitere Artikel der Ausgabe 5/2019

Journal of Digital Imaging 5/2019 Zur Ausgabe

Update Radiologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.