ECIS 2021 Research Papers

Information Overload in Crisis Management: Bilingual Evaluation of Embedding Models for Clustering Social Media Posts in Emergencies

Markus Bayer, Technical University of DarmstadtFollow
Marc-André Kaufhold, Technical University of DarmstadtFollow
Christian Reuter, Technical University of DarmstadtFollow

Paper Number

1338

Abstract

Past studies in the domains of information systems have analysed the potentials and barriers of social media in emergencies. While information disseminated in social media can lead to valuable insights, emergency services and researchers face the challenge of information overload as data quickly exceeds the manageable amount. We propose an embedding-based clustering approach and a method for the automated labelling of clusters. Given that the clustering quality is highly dependent on embeddings, we evaluate 19 embedding models with respect to time, internal cluster quality, and language invariance. The results show that it may be sensible to use embedding models that were already trained on other crisis datasets. However, one must ensure that the training data generalizes enough, so that the clustering can adapt to new situations. Confirming this, we found out that some embeddings were not able to perform as well on a German dataset as on an English dataset.

Recommended Citation

Bayer, Markus; Kaufhold, Marc-André; and Reuter, Christian, "Information Overload in Crisis Management: Bilingual Evaluation of Embedding Models for Clustering Social Media Posts in Emergencies" (2021). ECIS 2021 Research Papers. 64.
https://aisel.aisnet.org/ecis2021_rp/64

Download

COinS

When commenting on articles, please be friendly, welcoming, respectful and abide by the AIS eLibrary Discussion Thread Code of Conduct posted here.

ECIS 2021 Research Papers

Information Overload in Crisis Management: Bilingual Evaluation of Embedding Models for Clustering Social Media Posts in Emergencies

Paper Number

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

ECIS 2021 Research Papers

Information Overload in Crisis Management: Bilingual Evaluation of Embedding Models for Clustering Social Media Posts in Emergencies

Authors

Paper Number

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner