A multilingual ontology for infectious disease surveillance: Rationale, design and challenges

Nigel Collier, Ai Kawazoe, Lihua Jin, Mika Shigematsu, Dinh Dien, Roberto A. Barrero, Koichi Takeuchi, Asanee Kawtrakul

Research output: Contribution to journalArticle

38 Citations (Scopus)

Abstract

A lack of surveillance system infrastructure in the Asia-Pacific region is seen as hindering the global control of rapidly spreading infectious diseases such as the recent avian H5N1 epidemic. As part of improving surveillance in the region, the BioCaster project aims to develop a system based on text mining for automatically monitoring Internet news and other online sources in several regional languages. At the heart of the system is an application ontology which serves the dual purpose of enabling advanced searches on the mined facts and of allowing the system to make intelligent inferences for assessing the priority of events. However, it became clear early on in the project that existing classification schemes did not have the necessary language coverage or semantic specificity for our needs. In this article we present an overview of our needs and explore in detail the rationale and methods for developing a new conceptual structure and multilingual terminological resource that focusses on priority pathogens and the diseases they cause. The ontology is made freely available as an online database and downloadable OWL file.

Original languageEnglish
Pages (from-to)405-413
Number of pages9
JournalLanguage Resources and Evaluation
Volume40
Issue number3-4
DOIs
Publication statusPublished - Dec 2006

    Fingerprint

Keywords

  • Infectious disease surveillance
  • Multilingual ontology
  • Text mining

ASJC Scopus subject areas

  • Linguistics and Language

Cite this

Collier, N., Kawazoe, A., Jin, L., Shigematsu, M., Dien, D., Barrero, R. A., Takeuchi, K., & Kawtrakul, A. (2006). A multilingual ontology for infectious disease surveillance: Rationale, design and challenges. Language Resources and Evaluation, 40(3-4), 405-413. https://doi.org/10.1007/s10579-007-9019-7