TY - GEN
T1 - Interactive Scene Graph Analysis for Future Intelligent Teleconferencing Systems
AU - Wu, Mingyuan
AU - Lu, Yuhan
AU - Trivedi, Shiv
AU - Chen, Bo
AU - Zhou, Qian
AU - Wang, Lingdong
AU - Singh, Simran
AU - Zink, Michael
AU - Sitaraman, Ramesh
AU - Chakareski, Jacob
AU - Nahrstedt, Klara
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In a real-life meeting environment, individuals often demonstrate a remarkable ability to selectively focus their attention on specific visual information. This ability allows them to naturally concentrate on a specific region of interest while tuning out others. Understanding and exploiting such selective attention remains unexplored in a user-centric teleconferencing system, where there is a potential to customize video streaming and foveated rendering based on the viewer's attention. This paper proposes a novel user-centric scene analysis module that fully leverages the power of selective attention for online meeting scenarios and recognizes the unequal importance of individual pixels in the videos. The module determines the user's selective attention through the meeting contexts. The contextual representation of the meeting is modeled as a combination of two primary components: proactive user interaction within the system and passive real-time analysis of high-level visual semantics from the scenes. As the meeting progresses, the interactive scene analysis module dynamically updates its contextual representation, offering a dual advantage: (a) Videos can be selectively and adaptively streamed within a user's attention, resulting in bandwidth savings of up to 78 percent. (b) The module enhances the overall quality of the user experience by facilitating higher user interactivity, particularly in meeting-related tasks such as screen sharing, privacy-preserving user blocking, background removal, automatic user attention shift detection, etc. Our interactive scene analysis module makes significant progress toward enabling an efficient, immersive, and intelligent teleconferencing system.
AB - In a real-life meeting environment, individuals often demonstrate a remarkable ability to selectively focus their attention on specific visual information. This ability allows them to naturally concentrate on a specific region of interest while tuning out others. Understanding and exploiting such selective attention remains unexplored in a user-centric teleconferencing system, where there is a potential to customize video streaming and foveated rendering based on the viewer's attention. This paper proposes a novel user-centric scene analysis module that fully leverages the power of selective attention for online meeting scenarios and recognizes the unequal importance of individual pixels in the videos. The module determines the user's selective attention through the meeting contexts. The contextual representation of the meeting is modeled as a combination of two primary components: proactive user interaction within the system and passive real-time analysis of high-level visual semantics from the scenes. As the meeting progresses, the interactive scene analysis module dynamically updates its contextual representation, offering a dual advantage: (a) Videos can be selectively and adaptively streamed within a user's attention, resulting in bandwidth savings of up to 78 percent. (b) The module enhances the overall quality of the user experience by facilitating higher user interactivity, particularly in meeting-related tasks such as screen sharing, privacy-preserving user blocking, background removal, automatic user attention shift detection, etc. Our interactive scene analysis module makes significant progress toward enabling an efficient, immersive, and intelligent teleconferencing system.
KW - Interactive Scene Analysis
KW - Teleconferencing System
KW - User-centric System
UR - http://www.scopus.com/inward/record.url?scp=85190309639&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190309639&partnerID=8YFLogxK
U2 - 10.1109/ISM59092.2023.00048
DO - 10.1109/ISM59092.2023.00048
M3 - Conference contribution
AN - SCOPUS:85190309639
T3 - Proceedings - 2023 IEEE International Symposium on Multimedia, ISM 2023
SP - 251
EP - 255
BT - Proceedings - 2023 IEEE International Symposium on Multimedia, ISM 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE International Symposium on Multimedia, ISM 2023
Y2 - 11 December 2023 through 13 December 2023
ER -