Skip to main navigation Skip to search Skip to main content

LLM-Based Retrieval-Augmented Generation (RAG) Pipeline with Multi-Modal RGB-D Sensing Fusion and Deep Learning Computer Vision for Automated Monitoring, Scene Understanding, and Mitigation of Worker-Machinery Proximity Safety Issues on Construction Jobsites

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Worker-machinery proximity is a key contributor to safety incidents in the construction industry. This study proposes a non-intrusive and cost-effective retrieval-augmentation generation (RAG) pipeline for proximity work safety monitoring, enhanced by automated unsafe scene understanding and mitigation solution recommendation. The proposed pipeline aims to automatically (1) perform worker and machinery detection and proximity estimation using deep learning computer vision and RGB-D sensing fusion, (2) conduct query on the imagery data for scene information generation through vision-to-text mechanism, and (3) identify the potential safety hazards with recommended solutions through large language models (LLMs)-based RAG. Experimental results indicated that the proposed pipeline could accurately detect workers and machinery in real time, effectively estimate proximity information, and reliably generate scene understanding information with identified safety hazards and mitigation actions/solutions. This study offers a new RAG solution to advancing proximity-related construction safety management practices towards a more cost-effective, less complex, and more informative manner.

Original languageEnglish (US)
Title of host publicationComputing in Civil Engineering 2025
Subtitle of host publicationResilient, Robotic, and Educational Systems - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2025
EditorsAmirhosein Jafari, Yimin Zhu
PublisherAmerican Society of Civil Engineers (ASCE)
Pages764-774
Number of pages11
ISBN (Electronic)9780784486443
DOIs
StatePublished - 2025
EventASCE International Conference on Computing in Civil Engineering, i3CE 2025 - New Orleans, United States
Duration: May 11 2025May 14 2025

Publication series

NameComputing in Civil Engineering 2025: Resilient, Robotic, and Educational Systems - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2025

Conference

ConferenceASCE International Conference on Computing in Civil Engineering, i3CE 2025
Country/TerritoryUnited States
CityNew Orleans
Period5/11/255/14/25

All Science Journal Classification (ASJC) codes

  • Civil and Structural Engineering
  • Electrical and Electronic Engineering
  • Artificial Intelligence
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'LLM-Based Retrieval-Augmented Generation (RAG) Pipeline with Multi-Modal RGB-D Sensing Fusion and Deep Learning Computer Vision for Automated Monitoring, Scene Understanding, and Mitigation of Worker-Machinery Proximity Safety Issues on Construction Jobsites'. Together they form a unique fingerprint.

Cite this