Basic Search / Detailed Display

Author: 何昶毅
Chang-Yi Ho
Thesis Title: 自關注高斯嵌入模型之多訊息融合機制研究-應用於推薦系統
Fusion Mechanism of Multiple Messages on Self-Attentive Gaussian Embedding Model for Recommendation System
Advisor: 林伯慎
Bor-Shen Lin
Committee: 林伯慎
Bor-Shen Lin
楊傳凱
Chuan-Kai Yang
賴源正
Yuan-Cheng Lai
Degree: 碩士
Master
Department: 管理學院 - 資訊管理系
Department of Information Management
Thesis Publication Year: 2023
Graduation Academic Year: 111
Language: 中文
Pages: 64
Keywords (in Chinese): 推薦系統高斯嵌入自關注融合機制
Keywords (in other languages): Recommendation System, Gaussian Embedding, Self-attention, Fusion Mechanism
Reference times: Clicks: 304Downloads: 15
Share:
School Collection Retrieve National Library Collection Retrieve Error Report
  • 近年來,引入時序概念的序列推薦成為推薦系統中的重要研究方向。在序列推薦模型中,引入自關注機制能改善模型容易遺失序列過久以前資訊的問題,且能對序列的項目根據其相關度給予不同權重,以提升預測準確率;引入嵌入模型能學習使用者偏好與項目的表示,若使用高斯嵌入則能進一步能學習到嵌入表示的變異性;附屬資訊融合則是利用項目附屬資訊輔助來提升模型預測能力。這些改進方法都能有效提升模型的預測效能,然而因其各自有複雜網路結構,目前仍沒有方法能結合這些模型。本論文的研究嘗試結合自關注機制、高斯嵌入模型、與多重附屬訊息融合機制,提出了多訊息自關注高斯嵌入融合模型。
    我們在電影推薦資料集MovieLens 1M上對本論文提出的推薦模型進行實驗。首先,我們使用單一附屬資訊「導演」,探討將其融合至高斯嵌入模型的融合架構,比較了「只對平均向量融合」、「只對共變異矩陣融合」以及「同時對兩者融合」等三種架構。實驗結果顯示,同時對平均向量和共變異矩陣都進行融合的架構能獲得最好的效能。接著,我們探討項目與附屬資訊融合的加權方式,比較了「相加」、「串接」、「維度加權」以及「附屬資訊加權」等方法。實驗結果顯示,本論文提出的「附屬資訊加權」的方式有最好的預測效能。基於此融合架構與融合加權方式,考慮所有項目的前5名精確率可達0.0649,MAP可達0.1015,優於未融合附屬資訊的嵌入模型。進一步,我們同時使用「導演」與「風格」兩種附屬資訊進行多訊息融合的實驗;結果發現,多訊息融合比起單一附屬資訊融合,其效能可再往上提升,前5名精確率可達0.0658,MAP可達0.1026。另外,我們將融合模型在資料較充足的遊戲推薦資料集Steam上進行實驗,想了解附屬資訊對此資料集的預測效果。我們發現,在資料充足的情況下,項目資訊就足以訓練出良好的預測模型,若融合鑑別力較低的遊戲類型資訊(23,902個遊戲、13種類型),效能並沒有提升。為了進一步探究其原因,我們將訓練集依照比例切割為不同大小,進行實驗;實驗結果顯示,遊戲類型資訊在訓練資料量不足時,才會有輔助改進的效果。


    Sequential recommendation has recently become an important area of research in recommender systems. State-of-the-art sequential recommendation models were improved with several ways. Self-attention, for example, can make use of long-distance correlation to improve the prediction by weighting the items in the sequence according to their relevance. Embedding model, on the other hand, can learn to represent user preferences and items, and even model their variabilities if Gaussian embedding is further applied. Additionally, fusion of side information for the items can improve the performance since the predictive capability of the model could get benefits from the extra information. Though these models are effective for improve the predictive performance of recommendation systems, there is however no one combining these approaches, perhaps due to their respective complicated network architectures. In this research, a new model integrating self-attention, Gaussian embedding model, and fusion mechanism of multiple messages of side information was proposed and called self-attended Gaussian embedding fusion model.
    A series of experiments were conducted for the proposed model on the recommendation datasets of MovieLens 1M. First, we investigate the fusion of single side information of director. The fusion architecture of side information for Gaussian embedding model was investigated. Experimental results show that fusion on both 'mean’ and 'covariance' for Gaussian embedding is superior to fusion on either 'mean' or 'covariance' alone. Additionally, the fusion weighting method for item and side information was investigated. We compared four methods: 'add', 'concat', 'dimension weighting', and 'side information weighting'. Experimental results show that 'side information weighting' is superior to other fusion weighting methods. Based on the fusion architecture and fusion weighting method, experimental results show that top-five precision and mean average precision (MAP) may reach 0.0649 and 0.1015, respectively, which outperform the models without using side information. When multiple messages of side information, director and genre, were fused, top-five precision and MAP can rise up to 0.0658 and 0.1026, respectively.
    Moreover, the proposed fusion model was tested on another dataset, Steam, which is a game recommendation dataset with sufficient data relatively. From the experiments it was found that incorporation of less discriminative side information (13 genres for 23,902 games) did not help to improve performance. To explore the reason, we further conducted experiments for the training set in different scales, and experimental results show that the less discriminative side information like genre is effective for improving the prediction performance only when the training data is insufficient .

    第1章 序論 1.1 研究背景與動機 1.2 研究主要成果 1.3 論文組織與架構 第2章 文獻回顧 2.1 推薦系統 2.1.1 內容導向過濾 2.1.2 協同過濾 2.1.3 混合型推薦 2.2 序列推薦 2.2.2 序列預測問題類型 2.2.3 序列推薦模型 2.3 使用者偏好不確定性的序列推薦模型 2.4 含有附屬資訊的序列推薦模型 2.5 序列推薦評估指標 2.6 本章摘要 第3章 自關注高斯嵌入融合模型 3.1 模型架構 3.1.1 自關注高斯嵌入融合網路 3.1.2 平均向量自關注融合模組 3.1.3 共變異矩陣自關注融合模組 3.1.4 融合加權方法 3.1.5 使用者與候選項目相似度計算 3.1.6 損失函數 3.2 實驗設定 3.2.1 資料集介紹與資料前處理 3.2.2 訓練設定 3.3 比較實驗 3.3.1 嵌入模型之比較 3.3.2 向量嵌入模型融合附屬資訊之比較 3.4 基礎實驗 3.4.1 高斯嵌入模型與附屬資訊融合架構之比較 3.4.2 附屬資訊融合權重實驗 3.4.3 融合多附屬資訊之實驗 3.4.4 不同融合加權方法比較 3.4.5 不同距離度量方法之比較 3.5 實際案例比較 3.6 本章摘要 第4章 自關注高斯嵌入融合模型應用於遊戲推薦之探討 4.1 實驗設定 4.1.1 實驗資料集與參數設定 4.1.2 基礎實驗 4.1.3 不同比例訓練資料之實驗 4.2 本章摘要 第5章 結論 參考文獻

    [1] W.-C. Kang, J. McAuley, “Self-Attentive Sequential Recommendation,” ICDM2018, 20 8 2018.
    [2] Z. Fan, Z. Liu, L. Zheng, S. Wang, Yu, “Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation,” CIKM 2021, 2021.
    [3] Y. Xie, P. Zhou, S. Kim, “Decoupled Side Information Fusion for Sequential Recommendation,” SIGIR'22, 23 4 2022.
    [4] T. Hofmann, “Latent Semantic Models for collaborative filtering,” ACM Transactions on Information SystemsVol. 22, No. 1, pp. 89-115, 1 2004.
    [5] X. Ning, G. Karypis, “SLIM: Sparse Linear Methods for Top-N Recommender Systems,” 2011 IEEE 11th International Conference on Data Mining, 12 2011.
    [6] H. Fang, D. Zhang, Y. Shu, G. Guo, “Deep Learning for Sequential Recommendation: Algorithms” arXiv:1905.01997, 30 4 2019.
    [7] G. Linden, B. Smith., J.York, “Amazon.com Recommendations Item-to-Item Collaborative Filtering,” IEEE Internet Computing, pp. 76-80, 22 1 2003.
    [8] S. Rendle, C. Freudenthaler, L. Schmidt-Thieme, “Factorizing Personalized Markov Chains for Next-Basket Recommendation,” WWW 2010, pp. 811-820, 26 4 2010.
    [9] R. He, J. McAuley, “Fusing Similarity Models with Markov Chains for Sparse Sequential Recommendation,” Information Retrieval (cs.IR) arXiv:1609.09152, 28 9 2016.
    [10] B. Hidasi, A. Karatzoglou, L. Baltrunas, D. Tikk, “Session-based Recommendations with Recurrent Neural Networks,” Machine Learning (cs.LG) arXiv:1511.06939, 17 2 2016.
    [11] J. Tang, K. Wang, “Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding,” Information Retrieval (cs.IR); Machine Learning (cs.LG) arXiv:1809.07426, 19 9 2018.
    [12] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, I. Polosukhin, “Attention Is All You Need,” NIPS 2017, 2017.
    [13] K.-h. Chen, “資訊檢索的績效評估 Performance Evaluations for Information Retrieval,” 於 2004年現代資訊組織與檢索研討會, 2004.
    [14] S. Rendle, C. Freudenthaler, Z. Gantner, L. Schmidt-Thieme, “BPR: Bayesian Personalized Ranking from Implicit Feedback,” UAI 2009, 2009.
    [15] C. C. Aggarwal, Recommender Systems: The Textbook, 2016.
    [16] J. Duchi, “Derivations for Linear Algebra and Optimization”.
    [17] R. Kashyap, “The Perfect Marriage and Much More: Combining Dimension Reduction, Distance Measures and Covariance,” 2019.

    QR CODE