TY - JOUR
T1 - Revealing the sources of arsenic in private well water using Random Forest Classification and Regression
AU - Giri, Subhasis
AU - Kang, Yang
AU - MacDonald, Kristi
AU - Tippett, Mara
AU - Qiu, Zeyuan
AU - Lathrop, Richard G.
AU - Obropta, Christopher C.
N1 - Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2023/1/20
Y1 - 2023/1/20
N2 - Exposure to arsenic through private drinking water wells causes serious human health risks throughout the globe. Water testing data indicates there is arsenic contamination in private drinking water wells across New Jersey. To reduce the adverse health risk due to exposure to arsenic in drinking water, it is necessary to identify arsenic sources affecting private wells. Private wells are not regulated by any federal or state agencies through the Safe Drinking Water Act and therefore information is often lacking. To this end, we have developed machine learning algorithms including Random Forest Classification and Regression to decipher the factors contributing to higher arsenic concentration in private drinking water wells in west-central New Jersey. Arsenic concentration in private drinking water wells served as a response variable while explanatory variables were geological bedrock type, soil type, drainage class, land use/cover, and presence of orchards, contaminated sites, and abandoned mines within the 152.4-meter (500 ft) radius of each well. Random Forest Classification and Regression achieved 66 % and 55 % prediction accuracies for arsenic concentration in private drinking water wells, respectively. Overall, both models identify that bedrock, soil, land use/cover, and drainage type (in descending order) are the most important variables contributing to higher arsenic concentration in well water. These models further identify bedrock subgroups at a finer scale including Passaic Formation, Lockatong Formation, Stockton Formation contributing significantly to arsenic concentration in well water. Identification of sources of arsenic contamination in private drinking water wells at such a fine scale facilitates development of more targeted outreach as well as mitigation strategies to improve water quality and safeguard human health.
AB - Exposure to arsenic through private drinking water wells causes serious human health risks throughout the globe. Water testing data indicates there is arsenic contamination in private drinking water wells across New Jersey. To reduce the adverse health risk due to exposure to arsenic in drinking water, it is necessary to identify arsenic sources affecting private wells. Private wells are not regulated by any federal or state agencies through the Safe Drinking Water Act and therefore information is often lacking. To this end, we have developed machine learning algorithms including Random Forest Classification and Regression to decipher the factors contributing to higher arsenic concentration in private drinking water wells in west-central New Jersey. Arsenic concentration in private drinking water wells served as a response variable while explanatory variables were geological bedrock type, soil type, drainage class, land use/cover, and presence of orchards, contaminated sites, and abandoned mines within the 152.4-meter (500 ft) radius of each well. Random Forest Classification and Regression achieved 66 % and 55 % prediction accuracies for arsenic concentration in private drinking water wells, respectively. Overall, both models identify that bedrock, soil, land use/cover, and drainage type (in descending order) are the most important variables contributing to higher arsenic concentration in well water. These models further identify bedrock subgroups at a finer scale including Passaic Formation, Lockatong Formation, Stockton Formation contributing significantly to arsenic concentration in well water. Identification of sources of arsenic contamination in private drinking water wells at such a fine scale facilitates development of more targeted outreach as well as mitigation strategies to improve water quality and safeguard human health.
KW - Arsenic
KW - Bed rock
KW - Human health
KW - Private well water
KW - Random Forest Classification
KW - Random Forest Regression
UR - http://www.scopus.com/inward/record.url?scp=85139830388&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139830388&partnerID=8YFLogxK
U2 - 10.1016/j.scitotenv.2022.159360
DO - 10.1016/j.scitotenv.2022.159360
M3 - Article
C2 - 36240940
AN - SCOPUS:85139830388
SN - 0048-9697
VL - 857
JO - Science of the Total Environment
JF - Science of the Total Environment
M1 - 159360
ER -