簡易檢索 / 詳目顯示

研究生: 范美華
Mei-hua Fan
論文名稱: 在動態式P2P網路上 Non-Derivable 項目集探勘之研究
Mining Non-Derivable Frequent Itemsets on Dynamic P2P Network
指導教授: 陳秋華
Chyou-hwa Chen
口試委員: 邱舉明
none
戴碧如
none
李育杰
Yuh-jye Li
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 51
中文關鍵詞: 大項目集探勘非推導式的項目集探勘動態式網路點對點網路P2P網路
外文關鍵詞: Non-Derivable Itemsets Mining, Dynamic Network
相關次數: 點閱:190下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文針對在動態的P2P網路上,分散式大項目集探勘的問題作研究。本論文利用非推導式大項目集(Non-derivable Itemset Mining : NDI)的方法,在P2P網路上做大項目集的探勘,並且當網路拓撲發生改變(動態事件)時如:有節點加入或離開,需要將之前收集到的資訊做一些修改,本論文提出了一個處理方法,使得當有節點加入或離開時,不用整個網路放棄原本已經探勘過的資訊,我們可以直接做一些修改處理,不用全部重算,如果全部重新探勘,是很耗時又費力的,也浪費了動態事件發生前的資訊。


    We propose a number of distributed frequent itemset mining algorithms by extending previous efficient parallel algorithms that exploit the downward closure property. We show that the approach that exploits downward closure property is much more efficient, and more scalable in a distributed setting than approaches.

    目錄 誌 謝-------------------I 摘 要-------------------II ABSTRACT-------------------III 表目錄-------------------V 圖目錄-------------------VI 一、緒論-------------------1 1.1 研究動機-------------------1 1.2 研究目的-------------------2 二、大項目集探勘-------------------3 2.1 非推導式的項目集探勘(Non-derivable itemset mining:NDI)-------------------3 2.3 Non-Derivable Frequent Itemsets Tree (NDFIT)-------------------8 三、分散式系統架構-------------------16 3.1分散式(Distributed)大項目集探勘技術-------------------16 3.2分散式資料匯集樹作為大項目集探勘基礎-------------------26 3.4動態分散式的非推導式的項目集探勘(Dynamic Distributed NDI : DDNDI)-------------------29 五、結論-------------------49 六、參考文獻-------------------50

    六、參考文獻
    1. “Non-Derivable Itemset Mining” T. Calders, and B. Goethals, Data Mining and Knowledge Discovery;
    2. “ining non-derivable frequent itemsets over data stream”, Haifeng Lia, Hong Chena,b,*
    3. Ran Wolff and Assaf Schuster, “Association Rule Mining in Peer-to-Peer Systems,” IEEE Transactions on Systems, Man, and Cybernetics, part B, 34(6), 2004
    4. R. Agrawal and J. Shafer. Parallel mining of association rules. IEEE Transactions on Knowledge and Data Engineering, 8(6):962 . 969, 1996
    5. R. Agrawal, and R. Srikant, “Fast Algorithms for Mining Association Rules in Large Databases,” Proceedings of the 20th International Conference on Very Large Data Bases, (VLDB'94), Sep. 1994
    6. R. J. Bayardo Jr. Efficiently mining long patterns from databases. In SIGMOD, 1998.
    7. Boyd, A. Ghosh B. Prabhakar and D. Shah, “Randomized gossip algorithms,”IEEE Transactions on Information Theory, vol. 52, no. 6, June 2006
    8. James Cheng, Yiping Ke, and Wilfred Ng. A Survey on Algorithms for Mining Frequent Patterns over Data Streams. In Knowledge and Information Systems Journal (KAIS), 16(1): 1-27, 2008
    9. Y-L. Cheng and A. Fu, “Mining Frequent Itemsets without Support Threshold: With and Without Item Constraints”, Journal of IEEE Transactions on Knowledge and Data Engineering, 16(9), 2004
    10. D.W. Cheung, S.D. Lee, V. Xiao, "Effect of Data Skewness and Workload Balance in Parallel Data Mining," IEEE Transactions on Knowledge and Data Engineering, vol. 14, no. 3, pp. 498-514, 2002
    11. Laukik Chitnis, Alin Dobra, Sanjay Ranka: Aggregation methods for large-scale sensor networks. IEEE Transactions of Sensor Networks 4(2):(2008)
    12. Laukik Chitnis, Alin Dobra, Sanjay Ranka: Fault tolerant aggregation in heterogeneous sensor networks, Journal of Parallel and Distributed Computing 69(2):(Feb 2009) 210--219
    13. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, 2nd edition, Morgan Kaufmann, 2006
    14. Eui-Hong (Sam) Han, George Karypis, and Vipin Kumar. Scalable parallel data mining for association rules. IEEE Transactions on Knowledge and Data Engineering, 12(3):352~ 377, 2000
    15. J Han, H Cheng, D Xin, X Yan, Frequent pattern mining: current status and future directions, Data Mining and Knowledge Discovery, 2007
    16. R Jin, S McCallen, Y Breitbart, D Fuhry, DWang, Estimating the number of frequent itemsets in a large database, Proc. International Conf. on Extending Database Technology, 2009
    17. David Kempe, Alin Dobra, Johannes Gehrke, Gossip-based computation of aggregate information, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003
    18. Hua-Fu Li, MK Shan, SY Lee, “DSM-FI: an efficient algorithm for mining frequent itemsets in data streams”,Knowledge and Information Systems, 2008 –Springer
    19. Hua-Fu Li and Suh-Yin Lee, Approximate mining of maximal frequent itemsets in data streams with different window models, Expert Systems with Applications, Volume 35, Issue 3, October 2008, Pages 781-789
    20. Hongyan Liu, Yuan Lin and Jiawei Han, Methods for mining frequent items in data streams: an overview, Knowledge and Information Systems, 2009
    21. Park, J.-S., Chen, M.-S., Yu, P.S.: An effective hash based algorithm for mining association rules. In: Proceedings of ACM SIGMOD (1995)
    22. O. Zaiane, M. El-Hajj, P. Lu, Fast parallel association rule mining without candidacy generation, in: IEEE International Conference on Data Mining, 2001
    23. Mohammad El-Hajj, Osmar R. Zaïane: Parallel Leap: Large-Scale Maximal Pattern Mining in a Distributed Environment. ICPADS (1) 2006: 135-142
    24. Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong, Tag: A tiny aggregation service for ad-hoc sensor networks, SIGOPS Operating Systems Review 36 (SI) (2002) 131_146.
    25. Mengkun Yang, Zongming Fei “A Proactive Approach to Reconstructing Overlay Multicast Trees” , Proceedings of INFOCOM 2004, Hong Kong (March, 2004)

    無法下載圖示 全文公開日期 2015/08/02 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)
    全文公開日期 本全文未授權公開 (國家圖書館:臺灣博碩士論文系統)
    QR CODE