Skip to main content

01.03.2018 | Research | Sonderheft 1/2018 Open Access

BMC Medical Informatics and Decision Making 1/2018

Leveraging text skeleton for de-identification of electronic medical records

BMC Medical Informatics and Decision Making > Sonderheft 1/2018
Yue-Shu Zhao, Kun-Li Zhang, Hong-Chao Ma, Kun Li



De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value.


In this paper, a method of combining text skeleton and recurrent neural network is proposed to solve the problem of de-identification. Text skeleton is the general structure of a medical record, which can help neural networks to learn better.


We evaluated our method on three datasets involving two English datasets from i2b2 de-identification challenge and a Chinese dataset we annotated. Empirical results show that the text skeleton based method we proposed can help the network to recognize protected health information.


The comparison between our method and state-of-the-art frameworks indicates that our method achieves high performance on the problem of medical record de-identification.
Über diesen Artikel

Weitere Artikel der Sonderheft 1/2018

BMC Medical Informatics and Decision Making 1/2018 Zur Ausgabe