Skip to main content
Erschienen in: Drug Safety 7/2017

01.07.2017 | Original Research Article

Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems

verfasst von: Kory Kreimeyer, David Menschik, Scott Winiecki, Wendy Paul, Faith Barash, Emily Jane Woo, Meghna Alimchandani, Deepa Arya, Craig Zinderman, Richard Forshee, Taxiarchis Botsis

Erschienen in: Drug Safety | Ausgabe 7/2017

Einloggen, um Zugang zu erhalten

Abstract

Introduction

Duplicate case reports in spontaneous adverse event reporting systems pose a challenge for medical reviewers to efficiently perform individual and aggregate safety analyses. Duplicate cases can bias data mining by generating spurious signals of disproportional reporting of product-adverse event pairs.

Objective

We have developed a probabilistic record linkage algorithm for identifying duplicate cases in the US Vaccine Adverse Event Reporting System (VAERS) and the US Food and Drug Administration Adverse Event Reporting System (FAERS).

Methods

In addition to using structured field data, the algorithm incorporates the non-structured narrative text of adverse event reports by examining clinical and temporal information extracted by the Event-based Text-mining of Health Electronic Records system, a natural language processing tool. The final component of the algorithm is a novel duplicate confidence value that is calculated by a rule-based empirical approach that looks for similarities in a number of criteria between two case reports.

Results

For VAERS, the algorithm identified 77% of known duplicate pairs with a precision (or positive predictive value) of 95%. For FAERS, it identified 13% of known duplicate pairs with a precision of 100%. The textual information did not improve the algorithm’s automated classification for VAERS or FAERS. The empirical duplicate confidence value increased performance on both VAERS and FAERS, mainly by reducing the occurrence of false-positives.

Conclusions

The algorithm was shown to be effective at identifying pre-linked duplicate VAERS reports. The narrative text was not shown to be a key component in the automated detection evaluation; however, it is essential for supporting the semi-automated approach that is likely to be deployed at the Food and Drug Administration, where medical reviewers will perform some manual review of the most highly ranked reports identified by the algorithm.
Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat World Health Organization. A guide to detecting and reporting adverse drug reactions. Geneva: World Health Organization; 2002: Contract No. WHO/EDM/QSM/2002.2. World Health Organization. A guide to detecting and reporting adverse drug reactions. Geneva: World Health Organization; 2002: Contract No. WHO/EDM/QSM/2002.2.
3.
Zurück zum Zitat Moore TJ, Furber CD, Mattison DR, et al. A critique of a key drug safety reporting system. QuarterWatch. Horsham, PA: Institute for Safe Medication Practices; 2015. Moore TJ, Furber CD, Mattison DR, et al. A critique of a key drug safety reporting system. QuarterWatch. Horsham, PA: Institute for Safe Medication Practices; 2015.
4.
Zurück zum Zitat Varricchio F, Iskander J, Destefano F, et al. Understanding vaccine safety information from the Vaccine Adverse Event Reporting System. Pediatr Infect Dis J. 2004;23(4):287–94.CrossRefPubMed Varricchio F, Iskander J, Destefano F, et al. Understanding vaccine safety information from the Vaccine Adverse Event Reporting System. Pediatr Infect Dis J. 2004;23(4):287–94.CrossRefPubMed
6.
Zurück zum Zitat Guidance for industry: postmarketing safety reporting for human drug and biological products including vaccines. Rockville, MD: Food and Drug Administration; 2001. Guidance for industry: postmarketing safety reporting for human drug and biological products including vaccines. Rockville, MD: Food and Drug Administration; 2001.
8.
Zurück zum Zitat Poluzzi E, Raschi E, Piccinni C, De F. Data mining techniques in pharmacovigilance: analysis of the publicly accessible FDA Adverse Event Reporting System (AERS). In: Karahoca A, editor. Data mining applications in engineering and medicine. Rijeka, Croatia: InTech; 2012. Poluzzi E, Raschi E, Piccinni C, De F. Data mining techniques in pharmacovigilance: analysis of the publicly accessible FDA Adverse Event Reporting System (AERS). In: Karahoca A, editor. Data mining applications in engineering and medicine. Rijeka, Croatia: InTech; 2012.
9.
Zurück zum Zitat Committee for medicinal products for human use guideline on detection and management of duplicate individual cases and individual case safety reports (ICSRs). London: European Medicines Agency; 2012. Committee for medicinal products for human use guideline on detection and management of duplicate individual cases and individual case safety reports (ICSRs). London: European Medicines Agency; 2012.
11.
Zurück zum Zitat Baldwin E, Johnson K, Berthoud H, Dublin S. Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms. Pharmacoepidemiol Drug Saf. 2015;24(1):45–51. doi:10.1002/pds.3728.CrossRefPubMed Baldwin E, Johnson K, Berthoud H, Dublin S. Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms. Pharmacoepidemiol Drug Saf. 2015;24(1):45–51. doi:10.​1002/​pds.​3728.CrossRefPubMed
15.
Zurück zum Zitat DuVall SL, Fraser AM, Rowe K, et al. Evaluation of record linkage between a large healthcare provider and the Utah Population Database. J Am Med Inform Assoc. 2012;19(e1):e54.CrossRefPubMed DuVall SL, Fraser AM, Rowe K, et al. Evaluation of record linkage between a large healthcare provider and the Utah Population Database. J Am Med Inform Assoc. 2012;19(e1):e54.CrossRefPubMed
17.
Zurück zum Zitat Méray N, Reitsma JB, Ravelli ACJ, Bonsel GJ. Probabilistic record linkage is a valid and transparent tool to combine databases without a patient identification number. J Clin Epidemiol. 2007;60(9):883.e1–11. doi:10.1016/j.jclinepi.2006.11.021. Méray N, Reitsma JB, Ravelli ACJ, Bonsel GJ. Probabilistic record linkage is a valid and transparent tool to combine databases without a patient identification number. J Clin Epidemiol. 2007;60(9):883.e1–11. doi:10.​1016/​j.​jclinepi.​2006.​11.​021.
18.
Zurück zum Zitat Grannis SJ, Overhage JM, Hui S, McDonald CJ. Analysis of a probabilistic record linkage technique without human review. AMIA Annu Symp Proc. 2003;2003:259–63.PubMedCentral Grannis SJ, Overhage JM, Hui S, McDonald CJ. Analysis of a probabilistic record linkage technique without human review. AMIA Annu Symp Proc. 2003;2003:259–63.PubMedCentral
20.
Zurück zum Zitat Wang W, Kreimeyer K, Woo EJ, et al. A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports. J Biomed Inform. 2016;62:78–89. doi:10.1016/j.jbi.2016.06.006.CrossRefPubMed Wang W, Kreimeyer K, Woo EJ, et al. A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports. J Biomed Inform. 2016;62:78–89. doi:10.​1016/​j.​jbi.​2016.​06.​006.CrossRefPubMed
24.
Zurück zum Zitat van Rijsbergen CJ. Information retrieval. 2nd ed. Newton, MA: Butterworth-Heinemann; 1979. van Rijsbergen CJ. Information retrieval. 2nd ed. Newton, MA: Butterworth-Heinemann; 1979.
25.
Zurück zum Zitat Bilenko M, Mooney RJ. Adaptive duplicate detection using learnable string similarity measures. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge discovery and data mining, Washington, DC, 2003, 956759 ACM, p. 39–48. Bilenko M, Mooney RJ. Adaptive duplicate detection using learnable string similarity measures. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge discovery and data mining, Washington, DC, 2003, 956759 ACM, p. 39–48.
Metadaten
Titel
Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems
verfasst von
Kory Kreimeyer
David Menschik
Scott Winiecki
Wendy Paul
Faith Barash
Emily Jane Woo
Meghna Alimchandani
Deepa Arya
Craig Zinderman
Richard Forshee
Taxiarchis Botsis
Publikationsdatum
01.07.2017
Verlag
Springer International Publishing
Erschienen in
Drug Safety / Ausgabe 7/2017
Print ISSN: 0114-5916
Elektronische ISSN: 1179-1942
DOI
https://doi.org/10.1007/s40264-017-0523-4

Weitere Artikel der Ausgabe 7/2017

Drug Safety 7/2017 Zur Ausgabe