Abstract
Automatic translation of clinical researcher data requests to executable database queries is instrumental to an effective interface between clinical researchers and “Big Clinical Data”. A necessary step towards this goal is to parse ample temporal expressions in free-text researcher requests. This paper reports a novel algorithm called TEXer. It uses heuristic rule and pattern learning for extracting and normalizing temporal expressions in researcher requests. Based on 400 real clinical queries with human annotations, we compared our method with four baseline methods. TEXer achieved a precision of 0.945 and a recall of 0.858, outperforming all the baseline methods. We conclude that TEXer is an effective method for temporal expression extraction from free-text clinical data requests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lonsdale, D.W., Tustison, C., Parker, C.G., Embley, D.W.: Assessing Clinical Trial Eligibility with Logic Expression Queries. Data & Knowledge Engineering 66(1), 3–17 (2008)
Hruby, G.W., Boland, M.R., et al.: Characterization of the Biomedical Query Mediation Process. In: Proc. of AMIA 2013 Clinical Research Informatics Summit, San Francisco, CA, March 18-22, pp. 89–93 (2013)
Strötgen, J., Gertz, M.: Heideltime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. In: Proc. of the Workshop on Semantic Evaluation, pp. 321–324. ACL (2010)
Pustejovsky, J., Verhagen, M.: Semeval-2010 Task 13: Evaluating Events, Time Expressions, and Temporal Relations (tempeval-2). In: Proc. of the Workshop on Semantic Evaluations, pp. 112–116. ACL (2009)
Verhagen, M., Sauri, R., Caselli, T., Pustejovsky, J.: Semeval-2010 Task 13: Tempeval-2. In: Proc. of the Workshop on Semantic Evaluation, pp. 57–62. ACL (2010)
Pustejovsky, P., Castaño, J., et al.: Timeml: Robust Specification of Event and Temporal Expressions in Text. In: Proc. of the IWCS-5 Fifth International (2003)
Sohn, S., Wagholikar, K., Li, D., et al.: Comprehensive Temporal Information Detection from Clinical Text: Medical Events, Time, and Tlink Identification. J. Am. Med. Inform. Assoc. (2013), doi:10.1136/amiajnl-2013-00162
Tang, B., Wu., Y., et al.: A Hybrid System for Temporal Information Extraction from Clinical Text. J. Am. Med. Inform. Assoc. (2013), doi:10.1136/amiajnl-2013-001635
Tao, C., He, Y., Poland, G., Chute, C., Yang, H.: Ontology-based Time Information Representation of Vaccine Adverse Events in Vaers for Temporal Analysis. Journal of Biomedical Semantics 3(13) (2012)
Li, M., Patrick, J.: Extracting Temporal Information from Electronic Patient Records. In: Proc. of AMIA Annu. Symp. Proc., pp. 542–551 (2012)
Galescu, L., Blaylock, N.: A Corpus of Clinical Narratives Annotated with Temporal Information. In: Proc. of International Health Informatics Symposium, pp. 715–720 (2012)
Luo, Z., Johnson, S., Lai, A., Weng, C.: Extracting Temporal Constraints from Clinical Research Eligibility Criteria Using Conditional Random Fields. In: Proc. of AMIA Annual Symposium, pp. 843–852 (2011)
Mani, I., Wilson, G.: Automating Temporal Annotation with Tarsqi. In: Proc. of 38th Annual Meeting of the ACL, pp. 69–76 (2000)
Zhao, R., Do, Q., Roth, D.: A Robust Shallow Temporal Reasoning System. In: Proc. of NAACL-HLT Demo. (2012)
Bird, S.: Nltk: the Natural Languagetoolkit. In: Proc. of the COLING/ACL 2006 Interactive Presentation Sessions, pp. 69–72 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hao, T., Rusanov, A., Weng, C. (2013). Extracting and Normalizing Temporal Expressions in Clinical Data Requests from Researchers. In: Zeng, D., et al. Smart Health. ICSH 2013. Lecture Notes in Computer Science, vol 8040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39844-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-39844-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39843-8
Online ISBN: 978-3-642-39844-5
eBook Packages: Computer ScienceComputer Science (R0)