Disaster Tweets Classification Method based on Pretrained BERT Model

Journal of Graphics, 2022

Recommended citation: Lin, J.R.*, Cheng, Z.G., Han, Y., Yin, Y.P. (2022). Disaster Tweets Classification Method based on Pretrained BERT Model. Journal of Graphics, 43(3), 530-536. doi: 10.11996/JG.j.2095-302X.2022030530 http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2022030530 cited by count


Social media has become an important medium for the release and dissemination of disaster information, the effective identification and utilization of which is of great significance to disaster emergency management. Given the shortcomings of the traditional text classification model, a disaster tweet classification method was proposed based on the pre-trained model of bidirectional encoder representations from transformers (BERT). After data cleaning and preprocessing, this study constructed a text classification model based on long short-term memory-convolutional neural network (LSTM-CNN) through comparative analysis, based on BERT. Experiments on the tweet datasets of the Kaggle competition platform showed that the proposed classification model outperforms the traditional Naive Bayesian classification model and the common fine-tuning model, with the recognition rate up to 85%. This study could shed significant light on enhancing the identification accuracy of real disaster information and the efficiency of disaster emergency response.

Download paper here

Download preprint here

The authors are grateful for the financial support received from the National Natural Science Foundation of China (No. 72091512, No. 51908323).

Financial Sources:

Leave a Comment