Conditional random fields for term extraction

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In this paper, we describe how to construct a machine learning framework that utilizes syntactic information in extraction of biomedical terms. Conditional random fields (CRF), is used as the basis of this framework. We make an effort to find the appropriate use for syntactic information, including parent nodes, syntactic paths and term ratios under the machine learning framework. The experiment results show that syntactic paths and term ratios can improve precision of term extraction, including old terms and novel terms. However, the recall rate of novel terms still needs to be increased. This research serves as an example for constructing machine learning based term extraction systems that utilizes linguistic information.

Original languageEnglish
Title of host publicationKDIR 2010 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval
Pages414-417
Number of pages4
Publication statusPublished - 2010
EventInternational Conference on Knowledge Discovery and Information Retrieval, KDIR 2010 - Valencia, Spain
Duration: 25 Oct 201028 Oct 2010

Publication series

NameKDIR 2010 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval

Conference

ConferenceInternational Conference on Knowledge Discovery and Information Retrieval, KDIR 2010
Country/TerritorySpain
CityValencia
Period25/10/1028/10/10

Keywords

  • Conditional random fields
  • Syntactic function
  • Term extraction
  • Term ratio

Fingerprint

Dive into the research topics of 'Conditional random fields for term extraction'. Together they form a unique fingerprint.

Cite this