TY - GEN
T1 - Privacy-Aware Federated Learning for Page Recommendation
AU - Zhao, Shuai
AU - Bharati, Roshani
AU - Borcea, Cristian
AU - Chen, Yi
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/12/10
Y1 - 2020/12/10
N2 - Traditional page recommendation models are endangered by stricter privacy regulations, such as the General Data Protection Regulation (GDPR). The performance of these models suffer when only a part of the users share their personal data, such as cookies, with web servers, while the rest of the users choose to opt-out from sharing these data. Furthermore, these models are not designed to provide recommendations for users who do not share their data. This paper addresses the question of how to provide good page recommendations to all users, independent of their privacy attitudes. We propose Fed4Rec, a privacy-preserving framework for page recommendation based on federated learning (FL) and model-agnostic meta-learning (MAML), which allows machine learning models to train on data collected from both public users, who share data with the server, and private users, who do not share data with the server. Fed4Rec enables recommendations for both public users, computed at the server, and private users, computed at their local devices. Private users' data are stored only on user devices and never shared with the server. FL is used to train on local data, and Fed4Rec shares with the server only partial model parameters, computed on local devices. MAML is used to jointly train on the public data and the model parameters from the private users. We compare Fed4Rec against several baseline frameworks, using a publicly available dataset from a large news portal. The results show that Fed4Rec outperforms the baselines in terms of recommendation accuracy. We also conduct one ablation study to examine the impact of varying the ratio between the number of public and private users. Fed4Rec performs better than the baselines for all ratios, but it is especially beneficial w hen t he p ercentage of public users is low.
AB - Traditional page recommendation models are endangered by stricter privacy regulations, such as the General Data Protection Regulation (GDPR). The performance of these models suffer when only a part of the users share their personal data, such as cookies, with web servers, while the rest of the users choose to opt-out from sharing these data. Furthermore, these models are not designed to provide recommendations for users who do not share their data. This paper addresses the question of how to provide good page recommendations to all users, independent of their privacy attitudes. We propose Fed4Rec, a privacy-preserving framework for page recommendation based on federated learning (FL) and model-agnostic meta-learning (MAML), which allows machine learning models to train on data collected from both public users, who share data with the server, and private users, who do not share data with the server. Fed4Rec enables recommendations for both public users, computed at the server, and private users, computed at their local devices. Private users' data are stored only on user devices and never shared with the server. FL is used to train on local data, and Fed4Rec shares with the server only partial model parameters, computed on local devices. MAML is used to jointly train on the public data and the model parameters from the private users. We compare Fed4Rec against several baseline frameworks, using a publicly available dataset from a large news portal. The results show that Fed4Rec outperforms the baselines in terms of recommendation accuracy. We also conduct one ablation study to examine the impact of varying the ratio between the number of public and private users. Fed4Rec performs better than the baselines for all ratios, but it is especially beneficial w hen t he p ercentage of public users is low.
KW - deep learning
KW - federated learning
KW - metalearning
KW - page recommendation
KW - privacy regulation
UR - http://www.scopus.com/inward/record.url?scp=85103839277&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103839277&partnerID=8YFLogxK
U2 - 10.1109/BigData50022.2020.9377942
DO - 10.1109/BigData50022.2020.9377942
M3 - Conference contribution
AN - SCOPUS:85103839277
T3 - Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020
SP - 1071
EP - 1080
BT - Proceedings - 2020 IEEE International Conference on Big Data, Big Data 2020
A2 - Wu, Xintao
A2 - Jermaine, Chris
A2 - Xiong, Li
A2 - Hu, Xiaohua Tony
A2 - Kotevska, Olivera
A2 - Lu, Siyuan
A2 - Xu, Weijia
A2 - Aluru, Srinivas
A2 - Zhai, Chengxiang
A2 - Al-Masri, Eyhab
A2 - Chen, Zhiyuan
A2 - Saltz, Jeff
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th IEEE International Conference on Big Data, Big Data 2020
Y2 - 10 December 2020 through 13 December 2020
ER -