Skip to main content
Erschienen in: Journal of Digital Imaging 3/2020

10.12.2019 | Original Paper

Automated Misspelling Detection and Correction in Persian Clinical Text

verfasst von: Azita Yazdani, Marjan Ghazisaeedi, Nasrin Ahmadinejad, Masoumeh Giti, Habibe Amjadi, Azin Nahvijou

Erschienen in: Journal of Imaging Informatics in Medicine | Ausgabe 3/2020

Einloggen, um Zugang zu erhalten

Abstract

Accurate electronic health records are important for clinical care, research, and patient safety assurance. Correction of misspelled words is required to ensure the correct interpretation of medical records. In the Persian language, the lack of automated misspelling detection and correction system is evident in the medicine and health care. In this article, we describe the development of an automated misspelling detection and correction system for radiology and ultrasound’s free texts in the Persian language. To achieve our goal, we used n-gram language model and three different types of free texts related to abdominal and pelvic ultrasound, head and neck ultrasound, and breast ultrasound reports. Our system achieved the detection performance of up to 90.29% for radiology and ultrasound’s free texts with the correction accuracy of 88.56%. Results indicated that high-quality spelling correction is possible in clinical reports. The system also achieved significant savings during the documentation process and final approval of the reports in the imaging department.
Literatur
1.
Zurück zum Zitat Holzinger, A., et al., Biomedical text mining: state-of-the-art, open problems and future challenges, in Interactive knowledge discovery and data mining in biomedical informatics. 2014, Springer. p. 271–300. Holzinger, A., et al., Biomedical text mining: state-of-the-art, open problems and future challenges, in Interactive knowledge discovery and data mining in biomedical informatics. 2014, Springer. p. 271–300.
2.
Zurück zum Zitat Wong W, Glance D: Statistical semantic and clinician confidence analysis for correcting abbreviations and spelling errors in clinical progress notes. Artificial intelligence in medicine 53(3):171–180, 2011CrossRef Wong W, Glance D: Statistical semantic and clinician confidence analysis for correcting abbreviations and spelling errors in clinical progress notes. Artificial intelligence in medicine 53(3):171–180, 2011CrossRef
3.
Zurück zum Zitat Zhou L et al.: Analysis of errors in dictated clinical documents assisted by speech recognition software and professional transcriptionists. JAMA Network Open 1(3):e180530–e180530, 2018CrossRef Zhou L et al.: Analysis of errors in dictated clinical documents assisted by speech recognition software and professional transcriptionists. JAMA Network Open 1(3):e180530–e180530, 2018CrossRef
4.
Zurück zum Zitat Turchin, A., et al. Identification of misspelled words without a comprehensive dictionary using prevalence analysis. in AMIA Annual Symposium Proceedings. 2007. American Medical Informatics Association. Turchin, A., et al. Identification of misspelled words without a comprehensive dictionary using prevalence analysis. in AMIA Annual Symposium Proceedings. 2007. American Medical Informatics Association.
5.
Zurück zum Zitat Dalianis, H., Clinical Text Mining: Secondary Use of Electronic Patient Records. 2018: Springer. Dalianis, H., Clinical Text Mining: Secondary Use of Electronic Patient Records. 2018: Springer.
6.
Zurück zum Zitat Dalianis, H., Clinical text retrieval-an overview of basic building blocks and applications, in Professional Search in the Modern World. 2014, Springer. p. 147–165. Dalianis, H., Clinical text retrieval-an overview of basic building blocks and applications, in Professional Search in the Modern World. 2014, Springer. p. 147–165.
7.
Zurück zum Zitat Ringler MD, Goss BC, Bartholmai BJ: Syntactic and semantic errors in radiology reports associated with speech recognition software. Health informatics journal 23(1):3–13, 2017CrossRef Ringler MD, Goss BC, Bartholmai BJ: Syntactic and semantic errors in radiology reports associated with speech recognition software. Health informatics journal 23(1):3–13, 2017CrossRef
8.
Zurück zum Zitat Zech, J., et al., Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models. Annals of Translational Medicine, 2018. Zech, J., et al., Detecting insertion, substitution, and deletion errors in radiology reports using neural sequence-to-sequence models. Annals of Translational Medicine, 2018.
9.
Zurück zum Zitat Zhang, Y. Contextualizing consumer health information searching: an analysis of questions in a social Q&A community. in Proceedings of the 1st ACM International Health Informatics Symposium. 2010. ACM. Zhang, Y. Contextualizing consumer health information searching: an analysis of questions in a social Q&A community. in Proceedings of the 1st ACM International Health Informatics Symposium. 2010. ACM.
10.
Zurück zum Zitat Golkar, A., et al. Improve word sense disambiguation by proposing a pruning method for optimizing conceptual density's contexts. in Artificial Intelligence and Signal Processing (AISP), 2015 International Symposium on. 2015. IEEE. Golkar, A., et al. Improve word sense disambiguation by proposing a pruning method for optimizing conceptual density's contexts. in Artificial Intelligence and Signal Processing (AISP), 2015 International Symposium on. 2015. IEEE.
11.
Zurück zum Zitat Sarker, A. and G. Gonzalez-Hernandez, An unsupervised and customizable misspelling generator for mining noisy health-related text sources. arXiv preprint arXiv:1806.00910, 2018. Sarker, A. and G. Gonzalez-Hernandez, An unsupervised and customizable misspelling generator for mining noisy health-related text sources. arXiv preprint arXiv:1806.00910, 2018.
12.
Zurück zum Zitat Nizamuddin, U. and H. Dalianis. Detection of spelling errors in Swedish clinical text. in 1st Nordic workshop on evaluation of spellchecking and proofing tools (NorWEST2014), SLTC 2014, Uppsala. 2014. Nizamuddin, U. and H. Dalianis. Detection of spelling errors in Swedish clinical text. in 1st Nordic workshop on evaluation of spellchecking and proofing tools (NorWEST2014), SLTC 2014, Uppsala. 2014.
13.
Zurück zum Zitat Dalianis, H., Characteristics of Patient Records and Clinical Corpora, in Clinical Text Mining. 2018, Springer. p. 21–34. Dalianis, H., Characteristics of Patient Records and Clinical Corpora, in Clinical Text Mining. 2018, Springer. p. 21–34.
14.
Zurück zum Zitat Hussain, F. and U. Qamar. Identification and Correction of Misspelled Drugs Names in Electronic Medical Records (EMR). in ICEIS (2). 2016. Hussain, F. and U. Qamar. Identification and Correction of Misspelled Drugs Names in Electronic Medical Records (EMR). in ICEIS (2). 2016.
15.
Zurück zum Zitat Kilicoglu H et al.: An ensemble method for spelling correction in consumer health questions. in AMIA Annual Symposium Proceedings. 2015. American Medical Informatics Association. Kilicoglu H et al.: An ensemble method for spelling correction in consumer health questions. in AMIA Annual Symposium Proceedings. 2015. American Medical Informatics Association.
16.
Zurück zum Zitat Zhou, X., et al., Context-sensitive spelling correction of consumer-generated content on health care. JMIR medical informatics, 2015. 3(3). Zhou, X., et al., Context-sensitive spelling correction of consumer-generated content on health care. JMIR medical informatics, 2015. 3(3).
17.
Zurück zum Zitat Ruch P, Baud R, Geissbühler A: Using lexical disambiguation and named-entity recognition to improve spelling correction in the electronic patient record. Artificial intelligence in medicine 29(1–2):169–184, 2003CrossRef Ruch P, Baud R, Geissbühler A: Using lexical disambiguation and named-entity recognition to improve spelling correction in the electronic patient record. Artificial intelligence in medicine 29(1–2):169–184, 2003CrossRef
18.
Zurück zum Zitat Siklósi, B., A. Novák, and G. Prószéky. Context-aware correction of spelling errors in Hungarian medical documents. in International Conference on Statistical Language and Speech Processing. 2013. Springer. Siklósi, B., A. Novák, and G. Prószéky. Context-aware correction of spelling errors in Hungarian medical documents. in International Conference on Statistical Language and Speech Processing. 2013. Springer.
19.
Zurück zum Zitat Grigonyté, G., et al. Improving readability of Swedish electronic health records through lexical simplification: First results. in European Chapter of ACL (EACL), 26-30 April, 2014, Gothenburg, Sweden. 2014. Association for Computational Linguistics. Grigonyté, G., et al. Improving readability of Swedish electronic health records through lexical simplification: First results. in European Chapter of ACL (EACL), 26-30 April, 2014, Gothenburg, Sweden. 2014. Association for Computational Linguistics.
20.
Zurück zum Zitat Tolentino HD et al.: A UMLS-based spell checker for natural language processing in vaccine safety. BMC medical informatics and decision making 7(1):3, 2007CrossRef Tolentino HD et al.: A UMLS-based spell checker for natural language processing in vaccine safety. BMC medical informatics and decision making 7(1):3, 2007CrossRef
21.
Zurück zum Zitat Doan S, Bastarache L, Klimkowski S, Denny JC, Xu H: Integrating existing natural language processing tools for medication extraction from discharge summaries. Journal of the American Medical Informatics Association 17(5):528–531, 2010CrossRef Doan S, Bastarache L, Klimkowski S, Denny JC, Xu H: Integrating existing natural language processing tools for medication extraction from discharge summaries. Journal of the American Medical Informatics Association 17(5):528–531, 2010CrossRef
22.
Zurück zum Zitat Lai KH, Topaz M, Goss FR, Zhou L: Automated misspelling detection and correction in clinical free-text records. Journal of biomedical informatics 55:188–195, 2015CrossRef Lai KH, Topaz M, Goss FR, Zhou L: Automated misspelling detection and correction in clinical free-text records. Journal of biomedical informatics 55:188–195, 2015CrossRef
23.
Zurück zum Zitat Fivez, P., S. Šuster, and W. Daelemans, Unsupervised Context-Sensitive Spelling Correction of English and Dutch Clinical Free-Text with Word and Character N-Gram Embeddings. arXiv preprint arXiv:1710.07045, 2017. Fivez, P., S. Šuster, and W. Daelemans, Unsupervised Context-Sensitive Spelling Correction of English and Dutch Clinical Free-Text with Word and Character N-Gram Embeddings. arXiv preprint arXiv:1710.07045, 2017.
24.
Zurück zum Zitat Pérez A, Atutxa A, Casillas A, Gojenola K, Sellart Á: Inferred joint multigram models for medical term normalization according to ICD. International journal of medical informatics 110:111–117, 2018CrossRef Pérez A, Atutxa A, Casillas A, Gojenola K, Sellart Á: Inferred joint multigram models for medical term normalization according to ICD. International journal of medical informatics 110:111–117, 2018CrossRef
25.
Zurück zum Zitat D’hondt, E., C. Grouin, and B. Grau. Low-resource OCR error detection and correction in French Clinical Texts. in Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis. 2016. D’hondt, E., C. Grouin, and B. Grau. Low-resource OCR error detection and correction in French Clinical Texts. in Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis. 2016.
26.
Zurück zum Zitat Faili H et al.: Vafa spell-checker for detecting spelling, grammatical, and real-word errors of Persian language. Literary and Linguistic Computing 31(1):95–117, 2014CrossRef Faili H et al.: Vafa spell-checker for detecting spelling, grammatical, and real-word errors of Persian language. Literary and Linguistic Computing 31(1):95–117, 2014CrossRef
27.
Zurück zum Zitat Dowsett, D., Radiological sciences dictionary: keywords, names and definitions. 2009: CRC Press. Dowsett, D., Radiological sciences dictionary: keywords, names and definitions. 2009: CRC Press.
28.
Zurück zum Zitat Damerau FJ: A technique for computer detection and correction of spelling errors. Communications of the ACM 7(3):171–176, 1964CrossRef Damerau FJ: A technique for computer detection and correction of spelling errors. Communications of the ACM 7(3):171–176, 1964CrossRef
29.
Zurück zum Zitat Yazdani A, Safdari R, Golkar A, R Niakan Kalhori S: Words prediction based on N-gram model for free-text entry in electronic health records. Health information science and systems 7(1):6, 2019CrossRef Yazdani A, Safdari R, Golkar A, R Niakan Kalhori S: Words prediction based on N-gram model for free-text entry in electronic health records. Health information science and systems 7(1):6, 2019CrossRef
30.
Zurück zum Zitat Brown PF et al.: Class-based n-gram models of natural language. Computational linguistics 18(4):467–479, 1992 Brown PF et al.: Class-based n-gram models of natural language. Computational linguistics 18(4):467–479, 1992
31.
Zurück zum Zitat Minn MJ, Zandieh AR, Filice RW: Improving radiology report quality by rapidly notifying radiologist of report errors. Journal of digital imaging 28(4):492–498, 2015CrossRef Minn MJ, Zandieh AR, Filice RW: Improving radiology report quality by rapidly notifying radiologist of report errors. Journal of digital imaging 28(4):492–498, 2015CrossRef
32.
Zurück zum Zitat Dashti SM: Real-word error correction with trigrams: correcting multiple errors in a sentence. Language Resources and Evaluation. 52(2):485–502, 2018 Jun 1CrossRef Dashti SM: Real-word error correction with trigrams: correcting multiple errors in a sentence. Language Resources and Evaluation. 52(2):485–502, 2018 Jun 1CrossRef
33.
Zurück zum Zitat Kruskal JB, Reedy A, Pascal L, Rosen MP, Boiselle PM: Quality initiatives: lean approach to improving performance and efficiency in a radiology department. Radiographics. 32(2):573–587, 2012 Mar 5CrossRef Kruskal JB, Reedy A, Pascal L, Rosen MP, Boiselle PM: Quality initiatives: lean approach to improving performance and efficiency in a radiology department. Radiographics. 32(2):573–587, 2012 Mar 5CrossRef
Metadaten
Titel
Automated Misspelling Detection and Correction in Persian Clinical Text
verfasst von
Azita Yazdani
Marjan Ghazisaeedi
Nasrin Ahmadinejad
Masoumeh Giti
Habibe Amjadi
Azin Nahvijou
Publikationsdatum
10.12.2019
Verlag
Springer International Publishing
Erschienen in
Journal of Imaging Informatics in Medicine / Ausgabe 3/2020
Print ISSN: 2948-2925
Elektronische ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-019-00296-y

Weitere Artikel der Ausgabe 3/2020

Journal of Digital Imaging 3/2020 Zur Ausgabe

Update Radiologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.