Skip to main content
Log in

Automatic recognition of multi-word terms:. the C-value/NC-value method

  • Natural language processing for digital libraries
  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract.

Technical terms (henceforth called terms ), are important elements for digital libraries. In this paper we present a domain-independent method for the automatic extraction of multi-word terms, from machine-readable special language corpora. The method, (C-value/NC-value ), combines linguistic and statistical information. The first part, C-value, enhances the common statistical measure of frequency of occurrence for term extraction, making it sensitive to a particular type of multi-word terms, the nested terms. The second part, NC-value, gives: 1) a method for the extraction of term context words (words that tend to appear with terms); 2) the incorporation of information from term context words to the extraction of terms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received: 17 December 1998 / Revised: 19 May 1999

Rights and permissions

Reprints and permissions

About this article

Cite this article

Frantzi, K., Ananiadou, S. & Mima, H. Automatic recognition of multi-word terms:. the C-value/NC-value method . Int J Digit Libr 3, 115–130 (2000). https://doi.org/10.1007/s007999900023

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s007999900023

Navigation