Skip to main navigation Skip to search Skip to main content

Personalized Conversational Audio Descriptions in 360° Virtual Reality for Blind and Low-Vision Users

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

On-demand, conversational audio descriptions in 360° VR empower blind and low-vision users to actively explore immersive visual content. We present a Meta Quest demo that integrates head-pose-based view snapshots, real-time speech recognition, and GPT-4o-powered chunked text-to-speech streaming directly on-device to support multi-turn Q&A with personalized voice profiles. Our pipeline leverages chunk transfer encoding to play AI-generated audio as it's produced, minimizing perceived delay. Unlike prior VR accessibility demos reliant on static or author-crafted descriptions, our multimodal system delivers dynamic, user-driven narration for inclusive and interactive VR experiences.

Original languageEnglish (US)
Title of host publicationProceedings - 2025 IEEE International Symposium on Mixed and Augmented Reality Adjunct, ISMAR-Adjunct 2025
EditorsUlrich Eck, Gun Lee, Alexander Plopski, Missie Smith, Qi Sun, Markus Tatzgern
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages961-962
Number of pages2
ISBN (Electronic)9798331593476
DOIs
StatePublished - 2025
Event2025 IEEE International Symposium on Mixed and Augmented Reality Adjunct, ISMAR-Adjunct 2025 - Daejeon, Korea, Republic of
Duration: Oct 8 2025Oct 12 2025

Publication series

NameProceedings - 2025 IEEE International Symposium on Mixed and Augmented Reality Adjunct, ISMAR-Adjunct 2025

Conference

Conference2025 IEEE International Symposium on Mixed and Augmented Reality Adjunct, ISMAR-Adjunct 2025
Country/TerritoryKorea, Republic of
CityDaejeon
Period10/8/2510/12/25

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Electrical and Electronic Engineering

Keywords

  • accessibility
  • audio description
  • Conversational AI
  • virtual reality

Fingerprint

Dive into the research topics of 'Personalized Conversational Audio Descriptions in 360° Virtual Reality for Blind and Low-Vision Users'. Together they form a unique fingerprint.

Cite this