簡易檢索 / 詳目顯示

研究生: 林冠伯
Guan-bo Lin
論文名稱: 區塊式流程分群演算法之發展與應用
Developing a Process Cluster Analysis Approach Based on Blocked Activities
指導教授: 歐陽超
Chao Ou-Yang
口試委員: 郭人介
Ren-Jieh Kuo
阮業春
Yeh-Chun Juan
學位類別: 碩士
Master
系所名稱: 管理學院 - 工業管理系
Department of Industrial Management
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 89
中文關鍵詞: 流程探勘分群區塊式分群
外文關鍵詞: Process Cluster, Workflow Mining, Workflow Block
相關次數: 點閱:238下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著電腦軟體的發展,企業利用流程資料庫,儲存流程事件檔已是一個非常普遍的方式,而由於商業流程的越趨複雜,流程資料庫所儲存的事件檔也就相對的非常冗長,而如此探勘出的流程架構圖,不是概括性的描述整個商業流程活動,便是過於複雜讓使用者無法容易了解,也無法輕易的擷取出,使用者所需的正確資訊。因此,如何對一個資料量非常龐大的流程資料庫,進行分群的方式,讓使用者能清楚的了解不同群組間的差異性,以及透過對分群的流程探勘,有效的取得使用者所需的正確資訊,讓使用者在看到流程圖時,就能透過圖的表達方式,快速了解商業流程的詳細架構,是一個非常重要的問題。
    因此,本研究將提出一個方法,針對流程資料庫中的流程事件檔進行分群,以此分析出造成流程差異性的原因,並透過流程探勘的方式,找出潛藏的流程資訊。
    本研究提出以區塊式的方式,對流程事件檔進行切割,將分群的區塊縮小,藉此簡化在進行分群時的計算複雜度。並運用給予不同的Transition權重,增加流程相關性的敏感度,以及不同分群群組間的差異性,最後在針對每個分群結果的流程事件檔,進行流程探勘,建構個別的流程模型,讓使用者了解不同分群間的差異性,並擷取所需的正確資訊。


    Recently, workflow automation has been widely applied in industry. Log files stored the activities sequence for each case can be used to construct the process model in terms of the developed process mining algorithm. However, due to the complexity of the process, the mined model might be very complicated and hence difficult to view and to analysis. Cluster the stored cases and to mine each group of cases can simplified this issue.
     Currently, the developed workflow clustering algorithm tends to compute the sequential relationship of the activities based on the whole record of the case. The computational efficiency will decrease a lot for a case with long sequence of activities.
     In this research, an approach based on blocking the log into several group and clustering the activity data in each group will be proposed. This approach ignores the sections of the records having the common sequential relationship and addresses on the portions where cases have diverse string of activities. That is, based on the mined model, the sequence of activities in the log will be classified as several blocks. Then the activities in each block will be clustered and the inter-block activities relationship also will be analyzed. By applying this approach, the computation efficiency will be increased for the log with long string of activities but contained common sequential relationship. In addition, the attributes for each group of cases can be analyzed by identifying the features of individual block of activities.

    目錄 第一章 緒論       1 1.1 研究背景      1 1.2 研究目的   2 1.3 論文架構 3 第二章 文獻探討與背景介紹 4 2.1 流程分群概念 5 2.1.1 現有流程分群 5 2.1.2 流程事件檔分群 7 2.2 分群演算法 8 2.2.1 分割式分群演算法(Partitional Clustering) 8 2.2.2 階層式分群演算法(Hierarchical Clustering) 9 2.3 相似度計算 10 2.4 PETRI NETS 11 2.4.1 Petri Nets概述 11 2.4.2 以Petri Nets為基之流程探勘 12 2.4.3 Petri Nets之應用:CPN tools 14 第三章 研究方法架構 16 3.1 概念階段 17 3.2 設計階段 18 3.2.1 初始探勘 19 3.2.2 因果矩陣 20 3.2.3 區塊切割 21 3.2.4 分群 23 3.2.5 群組連結 27 3.2.6 結果評估 28 3.2.7 流程探勘 32 第四章 實作研究 33 4.1 隨機流程設計 37 4.1.1 Sequence活動數量大於And&Xor活動數量 39 4.1.2 Sequence活動數量小於And&Xor活動數量 47 4.1.3 Sequence活動數量等於And&Xor活動數量 53 4.2 中風醫療流程資料庫實作 61 4.2.1 本研究之流程分群演算法 61 4.2.2 現有之流程分群演算法 77 4.3 實驗分析 79 第五章 結論與建議 80 5.1 討論 80 5.2 建議 81 參考文獻 82 附錄A 完整流程活動表 85 附錄B 病況轉換欄位表 88

    參考文獻
    [1] J. Jung and J. Bae, Workflow Clustering Model Based on Process Similarity, Springer-Verlag, pp. 379-389, 2006.
    [2] J. Jung, J. Bae, and L. Liu, Hierarchical Business Process Clustering, IEEE International Conference on Services Computing, pp. 613-616, 2008.
    [3] C. Li, M. Reichert, and A. Wombacher, "Discovering Process Reference Models from Process Variants Using Clustering Techniques," IEEE International Conference on Web Services, 2008.
    [4] G. Greco, A. Guzzo, and L. Pontieri, "Discovering Expressive Process Models by Clustering Log Traces," IEEE Transactions on Knowledge and Data Engineering vol. 18 no. 8, pp. 1010-1027, 2006.
    [5] M. Song, C. W. Günther, and W. M. P. Aalst, "Trace Clustering in Process Mining," Business Process Management Workshops, vol. 17, pp. 109-120, 2008.
    [6] V. Guralnik and G. Karypis, "A Scalable Algorithm for Clustering Sequential Data," vol. Proc. IEEE Int'l Conf. Data Maning (ICDM '01), pp. 179-186, 2000.
    [7] A. Guzzo, G. Greco, B. V. Dongen, and D. Saccà, "Process Mining Based on Clustering: A Quest for Precision " vol. 4928/2008, pp. 17-29, 2008.
    [8] K. Alsabti, S. Ranka, and V. Singh, "An Efficient K-Means Clustering Algorithm," http://www.cise.ufl.edu/~ranka/, 1998.
    [9] P. S. Bradley, K. P. Bennett, and A. Demiriz, "Constrained k-means clustering," vol. MSR-TR-2000-65, Microsoft Research 2000.
    [10] K. Lagus, T. Honkela, S. Kaski, and T. Kohonen, "Self-Organizing Maps of Document Collections: A New Approach to Interactive Exploration," Proceedings Second International Conference on Knowledge Discovery & Data Mining, pp. 238-243, 1996.
    [11] A. Ultsch, Self-organizing Neural Networks for Visualization and Classification, Springer, Berlin, pp. 307-313, 1993.
    [12] H. Koga, T. Ishibashi, and T. Watanabe, "Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing," Knowledge and Information Systems, vol. 12, Number 1 / 2007年5月, pp. 25-53, 2006.
    [13] A. S. Lampropoulos and G. A. Tsihrintzis, "Agglomerative Hierarchical Clustering For Musical Database Visualization and Browsing," In Proc. 3rd Hellenic Conference on Artificial Intelligence, Samos Island, 2004.
    [14] 林惠玲 and 陳正倉, "統計學方法與應用," 雙葉書廊, 1999.
    [15] 林惠玲 and 陳正倉, "應用統計學," 雙葉書廊, pp. 239-592, 1999.
    [16] T. Murata, "Petri Nets: Properties, Analysis and Applications," vol. Proceedings of the IEEE, 77(4), pp. 541-580, 1989.
    [17] A. A. Desrochers and R. Y. Al-Jaar, Applications of Petri nets in manufacturing systems: modeling, control, and performance analysis, The Institute of Electrical and Electronics Engineers, pp. 56-58, 1994.
    [18] W. M. P. V. D. Aalst, A. J. M. M. Weijters, and L. Maruster, "Workflow Mining: Discovering Process Models from Event Logs," IEEE Trans. Knowledge and Data Eng, vol. 16, no. 9, pp. 1128-1142, 2004.
    [19] R. Agrawal, D. Gunopulos, and F. Leymann, "Mining process models from workflow logs " Lecture Notes in Computer Science, vol. Volume 1377/1998, pp. 467-483, 1998.
    [20] M. A. Marsan, G. Conte, and G. Balbo, "A Class of Generalized Stochastic Petri Nets for the Performance Evaluation of Multiprocessor Systems," ACM Transactions on Computer Systems (TOCS) vol. 2 pp. 93-122, 1984.
    [21] W. M. P. V. D. Aalst, B. F. V. Dongena, J. Herbst, L. Marustera, G. Schimm, and A. J. M. M. Weijtersa, "Workflow mining: A survey of issues and approaches " Data & Knowledge Engineering, vol. 47, pp. 237-267, 2003.
    [22] L. Wen, J. Wang, and J. Sun, "Detecting Implicit Dependencies Between Tasks from Event Logs," Lecture Notes in Computer Science, vol. 3841/2006, pp. 591-603, 2006.
    [23] L. Wen, W. M. P. V. D. Aalst, J. Wang, and J. Sun, "Mining process models with non-free-choice constructs," Data Mining and Knowledge Discovery, vol. 15, Number 2 / 2007年10月, pp. 145-180, 2007.
    [24] A. J. M. M. Weijters and W. M. P. V. D. Aalst, "Rediscovering Workflow Models from Event-Based Data Using Little Thumb," Integrated Computer-Aided Eng, vol. 10, no. 2, pp. 151-162, 2003.
    [25] A. Weijters, W. M. P. V. D. Aalst, and A. K. A. D. Medeiros, "Process Mining with the Heuristics Miner Algorithm," Citeseer, 2006.
    [26] A. Medeiros, A. Weijters, and W. Aalst, "Genetic process mining: an experimental evaluation," Data Mining and Knowledge Discovery, vol. 14, Number 2, pp. 245-304, 2007.
    [27] A. K. A. D. Medeiros and C. W. Gunther, "Process mining: Using cpn tools to create test logs for mining algorithms," In K.Jensen, editor, Proceedings of the Sixth Workshop on the Practical Use of Coloured Petri Nets and CPN Tools vol. 576 of DAIMI, 2004.
    [28] C. W. Günther and W. M. P. V. D. Aalst, "A Generic Import Framework For Process Event Logs," Lecture Notes in Computer Science, vol. 4103/2006, pp. 81-92, 2006.
    [29] A. Rozinat and R. S. Mans, "Mining CPN Models: Discovering Process Models With Data from Event Logs," In K. Jensen, editor, Proceedings of the Seventh Workshop on the Practical Use of Coloured Petri Nets and CPN Tools, pp. 57-76, 2006.

    QR CODE