Affiliations: [a] School of Computer and Artificial Intelligence, Wuhan University of Technology, China. E-mails: [email protected], [email protected], [email protected] | [b] Qingdao Dagang Customs House, People’s Republic of China. E-mail: [email protected] | [c] Department of Computer, Information and Media Science, Department of System Innovation, Faculty of Science and Engineering, IWATE University, Japan. E-mail: [email protected] | [d] School of Science, University of Southern Queensland, Australia. E-mail: [email protected]
Abstract: Learning based credit prediction has attracted great interest from academia and industry. Different institutions hold a certain amount of credit data with limited users to build model. An institution has the requirement to obtain data from other institutions for improving model performance. However, due to the privacy protection and subject to legal restrictions, they encounter difficulties in data exchange. This affects the performance of credit prediction. In order to solve the above problem, this paper proposes a federated learning based semi-supervised credit prediction approach enhanced by multi-layer label mean, which can aggregate parameters of each institution via joint training while protecting the data privacy of each institution. Moreover, in actual production and life, there are usually more unlabeled credit data than labeled ones, and the distribution of their feature space presents multiple data-dense divisions. To deal with these, local meanNet model is proposed with a multi-layer label mean based semi-supervised deep learning network. In addition, this paper introduces a cost-sensitive loss function in the supervised part of the local mean model. Conducted on two public credit datasets, experimental results show that our proposed federated learning based approach has achieved promising credit prediction performance in terms of Accuracy and F1 measures. At the same time, the framework design mode that splits data aggregation and keys uniformly can improve the security of data privacy and enhance the flexibility of model training.
Keywords: Federated learning, credit prediction, label mean, semi-supervised deep learning