A study on automatic extraction of new terms

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This research explores to automatically predict new terms based on linguistic features and statistical behaviors of noun phrases during a special period. It integrates both syntactic function value and TF-IDF value into an automatic term extraction system to weight new term candidates. Research questions include: what are the linguistic and statistic properties of new terms during a special period? Will linguistic features contribute to prediction of new terms? And will statistic features like, TFIDF Value contribute to prediction of new terms? Correspondingly, a series of experiments are conducted on medical corpus to examine a group of new terms' distribution properties and syntactic features across two years in comparison. The results show there does exist significant difference between two groups of values. Regardless of this limitation, this research is meaningful as it attempts to realize automation of selection process of new medical terms, which will greatly avoid subjective decisions and reduce experts' workloads.

Original languageEnglish
Title of host publicationProceedings - 2011 4th International Symposium on Knowledge Acquisition and Modeling, KAM 2011
Pages599-602
Number of pages4
DOIs
Publication statusPublished - 2011
Event2011 4th International Symposium on Knowledge Acquisition and Modeling, KAM 2011 - Sanya, China
Duration: 8 Oct 20119 Oct 2011

Publication series

NameProceedings - 2011 4th International Symposium on Knowledge Acquisition and Modeling, KAM 2011

Conference

Conference2011 4th International Symposium on Knowledge Acquisition and Modeling, KAM 2011
Country/TerritoryChina
CitySanya
Period8/10/119/10/11

Keywords

  • New term
  • SF-Value
  • TFIDF
  • old term
  • term extraction

Fingerprint

Dive into the research topics of 'A study on automatic extraction of new terms'. Together they form a unique fingerprint.

Cite this