Skip to main navigation Skip to search Skip to main content

Integrating Large Language Models (LLMs) with Autonomous Aerial Drone Robotics and Computer Vision for Contextual Adaptive Construction Site Safety Management and Risk Assessment

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Integrating large language models (LLMs) and robotics offers transformative potential for enhancing construction site safety monitoring, real-time risk assessment, and situational response. This research proposes an intelligent drone-based system that leverages real-time object detection and contextual image analysis with advanced LLM reasoning capabilities for construction site supervision. A fine-tuned YOLOv11n model (using transfer learning) was developed for detecting 10 different construction site safety-related classes. In critical safety violations (e.g., "No Hardhat"), the corresponding image frame is sent to CLIP (Contrastive Language-Image Pretraining) for generating image-based descriptions. These data are processed by a fine-tuned LLM to generate construction-specific textual prompts, which are converted to audio and broadcast via a drone-mounted speaker. The system operates autonomously using a D* planning algorithm. Detection, response generation, and navigation capabilities were evaluated in a simulated environment using Webots, and the pipeline from object segmentation to audio generation was ported to a real-world drone.

Original languageEnglish (US)
Title of host publicationComputing in Civil Engineering 2025
Subtitle of host publicationResilient, Robotic, and Educational Systems - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2025
EditorsAmirhosein Jafari, Yimin Zhu
PublisherAmerican Society of Civil Engineers (ASCE)
Pages509-518
Number of pages10
ISBN (Electronic)9780784486443
DOIs
StatePublished - 2025
EventASCE International Conference on Computing in Civil Engineering, i3CE 2025 - New Orleans, United States
Duration: May 11 2025May 14 2025

Publication series

NameComputing in Civil Engineering 2025: Resilient, Robotic, and Educational Systems - Selected Papers from the ASCE International Conference on Computing in Civil Engineering 2025

Conference

ConferenceASCE International Conference on Computing in Civil Engineering, i3CE 2025
Country/TerritoryUnited States
CityNew Orleans
Period5/11/255/14/25

All Science Journal Classification (ASJC) codes

  • Civil and Structural Engineering
  • Electrical and Electronic Engineering
  • Artificial Intelligence
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Integrating Large Language Models (LLMs) with Autonomous Aerial Drone Robotics and Computer Vision for Contextual Adaptive Construction Site Safety Management and Risk Assessment'. Together they form a unique fingerprint.

Cite this