Early detection of heterogeneous disaster events using social media | Zendy

Pekar Viktor | Zendy; Binner Jane | Zendy; Najafi Hossein | Zendy; Hale Chris | Zendy; Schmidt Vincent | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Early detection of heterogeneous disaster events using social media

Author(s) -

Pekar Viktor,

Binner Jane,

Najafi Hossein,

Hale Chris,

Schmidt Vincent

Publication year - 2020

Publication title -

journal of the association for information science and technology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.903

H-Index - 145

eISSN - 2330-1643

pISSN - 2330-1635

DOI - 10.1002/asi.24208

Subject(s) - computer science , boosting (machine learning) , situation awareness , social media , machine learning , ensemble learning , adaboost , artificial intelligence , data set , benchmark (surveying) , homogeneous , set (abstract data type) , context (archaeology) , data science , support vector machine , data mining , world wide web , paleontology , physics , geodesy , biology , geography , engineering , thermodynamics , programming language , aerospace engineering

This article addresses the problem of detecting crisis‐related messages on social media, in order to improve the situational awareness of emergency services. Previous work focused on developing machine‐learning classifiers restricted to specific disasters, such as storms or wildfires. We investigate for the first time methods to detect such messages where the type of the crisis is not known in advance, that is, the data are highly heterogeneous. Data heterogeneity causes significant difficulties for learning algorithms to generalize and accurately label incoming data. Our main contributions are as follows. First, we evaluate the extent of this problem in the context of disaster management, finding that the performance of traditional learners drops by up to 40% when trained and tested on heterogeneous data vis‐á‐vis homogeneous data. Then, in order to overcome data heterogeneity, we propose a new ensemble learning method, and found this to perform on a par with the Gradient Boosting and AdaBoost ensemble learners. The methods are studied on a benchmark data set comprising 26 disaster events and four classification problems: detection of relevant messages, informative messages, eyewitness reports, and topical classification of messages. Finally, in a case study, we evaluate the proposed methods on a real‐world data set to assess its practical value.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research