Precision Prediction for Dengue Fever in Singapore: A Machine Learning Approach Incorporating Meteorological Data

Trop Med Infect Dis. 2024 Mar 29;9(4):72. doi: 10.3390/tropicalmed9040072.

Abstract

Objective: This study aimed to improve dengue fever predictions in Singapore using a machine learning model that incorporates meteorological data, addressing the current methodological limitations by examining the intricate relationships between weather changes and dengue transmission.

Method: Using weekly dengue case and meteorological data from 2012 to 2022, the data was preprocessed and analyzed using various machine learning algorithms, including General Linear Model (GLM), Support Vector Machine (SVM), Gradient Boosting Machine (GBM), Decision Tree (DT), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) algorithms. Performance metrics such as Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-squared (R2) were employed.

Results: From 2012 to 2022, there was a total of 164,333 cases of dengue fever. Singapore witnessed a fluctuating number of dengue cases, peaking notably in 2020 and revealing a strong seasonality between March and July. An analysis of meteorological data points highlighted connections between certain climate variables and dengue fever outbreaks. The correlation analyses suggested significant associations between dengue cases and specific weather factors such as solar radiation, solar energy, and UV index. For disease predictions, the XGBoost model showed the best performance with an MAE = 89.12, RMSE = 156.07, and R2 = 0.83, identifying time as the primary factor, while 19 key predictors showed non-linear associations with dengue transmission. This underscores the significant role of environmental conditions, including cloud cover and rainfall, in dengue propagation.

Conclusion: In the last decade, meteorological factors have significantly influenced dengue transmission in Singapore. This research, using the XGBoost model, highlights the key predictors like time and cloud cover in understanding dengue's complex dynamics. By employing advanced algorithms, our study offers insights into dengue predictive models and the importance of careful model selection. These results can inform public health strategies, aiming to improve dengue control in Singapore and comparable regions.

Keywords: dengue fever; machine learning; meteorological data; prediction.