Abstract
For disease surveillance system, bio-medical term extraction is a key technology for a surveillance system of epidemic disease news from the Web1. In the previous work we applied statistical learning model to extract terms from the Web site. The previous approach is good at extracting terms with high precision rates; however it is weak at extracting new terms that do not exist in the training data. Since we usually have new disease names a new term extraction approach with high coverage for unknown or low-frequent terms is needed. Recently, Simple rule Language (SRL), a rule-based word extraction language, is freely available2. The SRL also has an developing environment called SRL editor. Thus we are constructing rules of bio-medical terms on the several language (such as English, Japanese, Thai and Vietnam) for the multilingual disease surveillance system. In this manuscript we confirm how we construct rules to extract Japanese bio-medical terms from Japanese news articles.
Original language | English |
---|---|
Pages | 132-134 |
Number of pages | 3 |
Publication status | Published - 2009 |
Event | 3rd International Symposium on Languages in Biology and Medicine, LBM 2009 - Jeju Island, Korea, Republic of Duration: Nov 8 2009 → Nov 10 2009 |
Conference
Conference | 3rd International Symposium on Languages in Biology and Medicine, LBM 2009 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju Island |
Period | 11/8/09 → 11/10/09 |
ASJC Scopus subject areas
- Computer Science Applications
- Health Informatics