Affiliations: School of Engineering & Science, Victoria University, PO Box 14428, VIC 8001, Australia, E-mail: {Guandong.Xu, Yanchun.Zhang, Xun.Yi}@vu.edu.au | School of Computer Science & Technology, Wuhan University of Technology, China, E-mail: [email protected] | Institute of Industrial Science, University of Tokyo, Japan, E-mail: [email protected]
Note: [] Corresponding author.
Abstract: Web users exhibit a variety of navigational interests through clicking a sequence of Web pages. Analyses of Web usage data will lead to discovering Web user access patterns, and in turn, facilitating users to locate more preferable Web contents via collaborative recommendation techniques. In the context of Web usage mining, Latent Semantic Analysis (LSA) based on probability inference provides a promising approach to capture not only user hidden navigational patterns, but also the associations between users, pages and hidden navigational patterns. The discovered user access patterns could be used as a usage reference base for identifying the new user's access preferences and making usage-based collaborative Web recommendations. In this paper, we propose a novel usage-based Web recommendation framework, in which Latent Dirichlet Allocation (LDA) is employed to learn the underlying task space from the training Web log data and infer the task distribution for a target user via task inference. The main advantages of the adapted LDA model are its capabilities of efficiently learning the semantic usage information from the Web log data and effectively inferring the access preference of the target user even with a few Web clicks that might be unseen in the training data. In this paper, we also investigate the determination of an optimizing task number, which is another important problem commonly encountered in latent semantic analysis. Experiments conducted on a real Web log dataset show that this approach can achieve better recommendation performance in comparison to other existing techniques. And the discovered task-simplex expression can also provide a better interpretation for Web designers or users to better understand the user navigational preference.
Keywords: User modeling, Web recommendation, hidden navigational task mining, latent Dirichlet allocation