Recommender system using deep learning and matrix factorization

Hanafi (2021) Recommender system using deep learning and matrix factorization. Doctoral thesis, Universiti Teknikal Malaysia Melaka.

[img] Text (24 Pages)
Recommender system using deep learning and matrix factorization.pdf - Submitted Version

Download (1MB)
[img] Text (Full Text)
Recommender system using deep learning and matrix factorization.pdf - Submitted Version
Restricted to Registered users only

Download (5MB)

Abstract

The recommender system is part of machine learning that responsible to provide product recommendation to consumers in e-commerce. This system has been adopted by almost every e-commerce company in the world including Amazon, Alibaba, iTunes, Google, and Netflix. Collaborative filtering (CF) is the most popular recommender system method. This method calculated the user behaviour similarities in the form of ratings. However, in reality, the number of ratings is difficult to obtain (0.3% - 4%). In 2006 when Netflix competition was held, many experts consider to using matrix factorization (latent factor) model, and this model has proved to be better performance over previous work using statistical method. However, the matrix factorization performs degrade when implemented on extreme sparse data. Adding information in the form of product documents using Latent Dirichlet Allocation (LDA) is one way to solve this problem. However, the LDA method also faces obstacles to capture the documents contextual understanding which mostly of work by using LDA as categorical Bag of Word (BOW). A significant effort to capture the contextual meaning in Natural Language Processing (NLP) application is by considering the subtle words and the words order. Another problem is that most researchers involve only a part of the information to aid the performance of the matrix factorization, namely the user information or product information. Aiming to deal with the contextual problem of the document, this research proposes the use of the Long Short-Term Memory (LTSM) and word embedding (WE) based on Global Vector for Word Representation (GLOVE) method. According to the testing and evaluation report, the LTSM and GLOVE which has proven to be successful to capture the contextual meaning of the document in qualitative tests. Aiming to deal with the matrix factorization’s decrease in performance, this research proposes to integrate LSTM and Probabilistic Matrix Factorization (PMF). According to the experiment report, LSTM-PMF superior over existing best perform using Convolutional Neural Network (CNN) and PMF achieve 1.4% on average. Aiming to deal with hybridization between user and item information representation, this study implements Stack Denoising Auto Encoder (SDAE) and LSTM into PMF. The experiment report shows that this model outperforms over previous work achieve 0.92% on average. In the future, aiming to improve the performance in capturing understanding of product document, the use of bidirectional word vector representation and another variant of deep learning needs to be considered.

Item Type: Thesis (Doctoral)
Uncontrolled Keywords: Electronic commerce, Recommender systems (Information filtering), Internet marketing
Subjects: H Social Sciences > H Social Sciences (General)
H Social Sciences > HF Commerce
Divisions: Library > Tesis > FTMK
Depositing User: F Haslinda Harun
Date Deposited: 15 Nov 2022 12:49
Last Modified: 15 Nov 2022 12:49
URI: http://eprints.utem.edu.my/id/eprint/26098
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item