TY - GEN
T1 - Speech privacy attack via vibrations from room objects leveraging a phased-MIMO radar
AU - Shi, Cong
AU - Zhang, Tianfang
AU - Xu, Zhaoyi
AU - Li, Shuping
AU - Yuan, Yichao
AU - Petropulu, Athina
AU - Wu, Chung Tse Michael
AU - Chen, Yingying
N1 - Publisher Copyright:
© 2022 Owner/Author.
PY - 2022/6/27
Y1 - 2022/6/27
N2 - Speech privacy leakage has long been a public concern. Through speech eavesdropping, an adversary may steal a user's private information or an enterprise's financial/intellectual properties, leading to catastrophic consequences. Existing non-microphone-based eavesdropping attacks rely on physical contact or line-of-sight between the sensor (e.g., a motion sensor or a radar) and the victim sound source. In this poster, we discover a new form of speech eavesdropping attack that senses minor speech-induced vibrations upon common room objects using mmWave. By integrating phasedarray and multiple-input and multiple-output (MIMO) on a single mmWave transceiver, our attack can capture and fuse micrometerlevel vibrations upon the surfaces of multiple objects to reveal speech content in a remote and non-line-of-sight fashion. We successfully demonstrate such an attack by developing a deep speech recognition scheme grounded on unsupervised domain adaptation. Without prior training on the victim's data, our attack can achieve a high success rate of over 90% in recognizing simple speech content.
AB - Speech privacy leakage has long been a public concern. Through speech eavesdropping, an adversary may steal a user's private information or an enterprise's financial/intellectual properties, leading to catastrophic consequences. Existing non-microphone-based eavesdropping attacks rely on physical contact or line-of-sight between the sensor (e.g., a motion sensor or a radar) and the victim sound source. In this poster, we discover a new form of speech eavesdropping attack that senses minor speech-induced vibrations upon common room objects using mmWave. By integrating phasedarray and multiple-input and multiple-output (MIMO) on a single mmWave transceiver, our attack can capture and fuse micrometerlevel vibrations upon the surfaces of multiple objects to reveal speech content in a remote and non-line-of-sight fashion. We successfully demonstrate such an attack by developing a deep speech recognition scheme grounded on unsupervised domain adaptation. Without prior training on the victim's data, our attack can achieve a high success rate of over 90% in recognizing simple speech content.
KW - mmWave sensing
KW - phased-MIMO
KW - speech privacy attack
UR - http://www.scopus.com/inward/record.url?scp=85134046524&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85134046524&partnerID=8YFLogxK
U2 - 10.1145/3498361.3538790
DO - 10.1145/3498361.3538790
M3 - Conference contribution
AN - SCOPUS:85134046524
T3 - MobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services
SP - 573
EP - 574
BT - MobiSys 2022 - Proceedings of the 2022 20th Annual International Conference on Mobile Systems, Applications and Services
PB - Association for Computing Machinery, Inc
T2 - 20th ACM International Conference on Mobile Systems, Applications and Services, MobiSys 2022
Y2 - 27 June 2022 through 1 July 2022
ER -