The ubiquitousness of smartphones enables people to announce an emergency they’re observing in real-time. Because of this, more agencies are interested in programatically monitoring Twitter (i.e. disaster relief organisations and news agencies). But, it’s not always clear whether a person’s words are actually announcing a disaster. Aim of this project is to build a machine learning model that predicts which Tweets are about real disasters and which one’s aren’t.
Each sample in the data has the following information:
The development of a predictive model has involved the implementation of the following procedural steps:
XGBoost evaluation
Document-Term Matrix for not a disaster
Document-Term Matrix for a disaster