Twitter catches the flu: Detecting influenza epidemics using Twitter

Eiji Aramaki, Sachiko Maskawa, Mizuki Morita

Research output: Chapter in Book/Report/Conference proceedingConference contribution

321 Citations (Scopus)

Abstract

With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.

Original languageEnglish
Title of host publicationEMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
Pages1568-1576
Number of pages9
Publication statusPublished - 2011
Externally publishedYes
EventConference on Empirical Methods in Natural Language Processing, EMNLP 2011 - Edinburgh, United Kingdom
Duration: Jul 27 2011Jul 31 2011

Other

OtherConference on Empirical Methods in Natural Language Processing, EMNLP 2011
CountryUnited Kingdom
CityEdinburgh
Period7/27/117/31/11

Fingerprint

Application programming interfaces (API)
Support vector machines
Classifiers
Experiments

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems

Cite this

Aramaki, E., Maskawa, S., & Morita, M. (2011). Twitter catches the flu: Detecting influenza epidemics using Twitter. In EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 1568-1576)

Twitter catches the flu : Detecting influenza epidemics using Twitter. / Aramaki, Eiji; Maskawa, Sachiko; Morita, Mizuki.

EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. 2011. p. 1568-1576.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Aramaki, E, Maskawa, S & Morita, M 2011, Twitter catches the flu: Detecting influenza epidemics using Twitter. in EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. pp. 1568-1576, Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, Edinburgh, United Kingdom, 7/27/11.
Aramaki E, Maskawa S, Morita M. Twitter catches the flu: Detecting influenza epidemics using Twitter. In EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. 2011. p. 1568-1576
Aramaki, Eiji ; Maskawa, Sachiko ; Morita, Mizuki. / Twitter catches the flu : Detecting influenza epidemics using Twitter. EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. 2011. pp. 1568-1576
@inproceedings{65629a4197c4472ea8adb2de5e9189f7,
title = "Twitter catches the flu: Detecting influenza epidemics using Twitter",
abstract = "With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.",
author = "Eiji Aramaki and Sachiko Maskawa and Mizuki Morita",
year = "2011",
language = "English",
isbn = "1937284115",
pages = "1568--1576",
booktitle = "EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference",

}

TY - GEN

T1 - Twitter catches the flu

T2 - Detecting influenza epidemics using Twitter

AU - Aramaki, Eiji

AU - Maskawa, Sachiko

AU - Morita, Mizuki

PY - 2011

Y1 - 2011

N2 - With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.

AB - With the recent rise in popularity and scale of social media, a growing need exists for systems that can extract useful information from huge amounts of data. We address the issue of detecting influenza epidemics. First, the proposed system extracts influenza related tweets using Twitter API. Then, only tweets that mention actual influenza patients are extracted by the support vector machine (SVM) based classifier. The experiment results demonstrate the feasibility of the proposed approach (0.89 correlation to the gold standard). Especially at the outbreak and early spread (early epidemic stage), the proposed method shows high correlation (0.97 correlation), which outperforms the state-of-the-art methods. This paper describes that Twitter texts reflect the real world, and that NLP techniques can be applied to extract only tweets that contain useful information.

UR - http://www.scopus.com/inward/record.url?scp=80053290412&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053290412&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:80053290412

SN - 1937284115

SN - 9781937284114

SP - 1568

EP - 1576

BT - EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

ER -