Abstract
In this paper, we analyze multi-modal data including environmental, geographic, infrastructure, demographic, and socioeconomic data from New York City (NYC), to examine their impact on flood risk. We applied XGBoost machine learning model combined with SHapley Additive exPlanations (SHAP) function to analyze and interpret the contributions of these factors to flood risks. We tested the model with a window of 1 to 7 days to determine optimum time scales to capture the essential information in the time series and factors' dependencies over various time periods. Our findings reveal that although environmental and geographic factors have a strong correlation with flood risk, socioeconomic factors, such as median age, median house value, population density, and race also play an important role in such risks. Low-income household areas have higher flood risks, which happen to also be minority communities, making them more vulnerable to flooding than higher-income communities. Furthermore, population density and level of education may also have impact on flood risks. These insights can be used to facilitate planning and preparing future flood prevention and mitigation strategies.
| Original language | English (US) |
|---|---|
| Title of host publication | 2025 IEEE Conference on Technologies for Sustainability, SusTech 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Edition | 2025 |
| ISBN (Electronic) | 9798331504311 |
| DOIs | |
| State | Published - 2025 |
| Event | 12th IEEE Conference on Technologies for Sustainability, SusTech 2025 - Los Angeles, United States Duration: Apr 20 2025 → Apr 23 2025 |
Conference
| Conference | 12th IEEE Conference on Technologies for Sustainability, SusTech 2025 |
|---|---|
| Country/Territory | United States |
| City | Los Angeles |
| Period | 4/20/25 → 4/23/25 |
All Science Journal Classification (ASJC) codes
- Energy Engineering and Power Technology
- Renewable Energy, Sustainability and the Environment
- Control and Optimization
Keywords
- feature importance
- Flash flood
- machine learning
- risk