TY - GEN
T1 - VRCopilot
T2 - 37th Annual ACM Symposium on User Interface Software and Technology, UIST 2024
AU - Zhang, Lei
AU - Pan, Jin
AU - Gettig, Jacob
AU - Oney, Steve
AU - Guo, Anhong
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/10/13
Y1 - 2024/10/13
N2 - Immersive authoring provides an intuitive medium for users to create 3D scenes via direct manipulation in Virtual Reality (VR). Recent advances in generative AI have enabled the automatic creation of realistic 3D layouts. However, it is unclear how capabilities of generative AI can be used in immersive authoring to support fluid interactions, user agency, and creativity. We introduce VRCopilot, a mixed-initiative system that integrates pre-trained generative AI models into immersive authoring to facilitate human-AI co-creation in VR. VRCopilot presents multimodal interactions to support rapid prototyping and iterations with AI, and intermediate representations such as wireframes to augment user controllability over the created content. Through a series of user studies, we evaluated the potential and challenges in manual, scaffolded, and automatic creation in immersive authoring. We found that scaffolded creation using wireframes enhanced the user agency compared to automatic creation. We also found that manual creation via multimodal specification offers the highest sense of creativity and agency.
AB - Immersive authoring provides an intuitive medium for users to create 3D scenes via direct manipulation in Virtual Reality (VR). Recent advances in generative AI have enabled the automatic creation of realistic 3D layouts. However, it is unclear how capabilities of generative AI can be used in immersive authoring to support fluid interactions, user agency, and creativity. We introduce VRCopilot, a mixed-initiative system that integrates pre-trained generative AI models into immersive authoring to facilitate human-AI co-creation in VR. VRCopilot presents multimodal interactions to support rapid prototyping and iterations with AI, and intermediate representations such as wireframes to augment user controllability over the created content. Through a series of user studies, we evaluated the potential and challenges in manual, scaffolded, and automatic creation in immersive authoring. We found that scaffolded creation using wireframes enhanced the user agency compared to automatic creation. We also found that manual creation via multimodal specification offers the highest sense of creativity and agency.
KW - Generative AI
KW - Human-AI Co-creation
KW - Virtual Reality
UR - https://www.scopus.com/pages/publications/85215072893
UR - https://www.scopus.com/pages/publications/85215072893#tab=citedBy
U2 - 10.1145/3654777.3676451
DO - 10.1145/3654777.3676451
M3 - Conference contribution
AN - SCOPUS:85215072893
T3 - UIST 2024 - Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
BT - UIST 2024 - Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
PB - Association for Computing Machinery, Inc
Y2 - 13 October 2024 through 16 October 2024
ER -