Bio-medical term extraction on simple rule language

Takashi Sinnou, Koichi Takeuchi, Nigel Collier

Research output: Contribution to conferencePaperpeer-review

Abstract

For disease surveillance system, bio-medical term extraction is a key technology for a surveillance system of epidemic disease news from the Web1. In the previous work we applied statistical learning model to extract terms from the Web site. The previous approach is good at extracting terms with high precision rates; however it is weak at extracting new terms that do not exist in the training data. Since we usually have new disease names a new term extraction approach with high coverage for unknown or low-frequent terms is needed. Recently, Simple rule Language (SRL), a rule-based word extraction language, is freely available2. The SRL also has an developing environment called SRL editor. Thus we are constructing rules of bio-medical terms on the several language (such as English, Japanese, Thai and Vietnam) for the multilingual disease surveillance system. In this manuscript we confirm how we construct rules to extract Japanese bio-medical terms from Japanese news articles.

Original languageEnglish
Pages132-134
Number of pages3
Publication statusPublished - 2009
Event3rd International Symposium on Languages in Biology and Medicine, LBM 2009 - Jeju Island, Korea, Republic of
Duration: Nov 8 2009Nov 10 2009

Conference

Conference3rd International Symposium on Languages in Biology and Medicine, LBM 2009
Country/TerritoryKorea, Republic of
CityJeju Island
Period11/8/0911/10/09

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Fingerprint

Dive into the research topics of 'Bio-medical term extraction on simple rule language'. Together they form a unique fingerprint.

Cite this