簡易檢索 / 詳目顯示

研究生: 黃盈笛
Ying-Di Huang
論文名稱: 應用蜂群最佳化演算法於自動核心分群之研究
Automatic Kernel Clustering with Bee Colony Optimization Algorithm
指導教授: 郭人介
Ren-Jieh Kuo
口試委員: 楊朝龍
C.-L. Yang
蔡介元
Chieh-Yuan Tsai
學位類別: 碩士
Master
系所名稱: 管理學院 - 工業管理系
Department of Industrial Management
論文出版年: 2012
畢業學年度: 100
語文別: 中文
論文頁數: 82
中文關鍵詞: 自動分群分群分析粒子群最佳化演算法蜂群最佳化演算法核心函數攝護腺癌症
外文關鍵詞: Automatic Clustering, Cluster Analysis, Particle Swarm Optimization Algorithm, Bee Colony Optimization, Kernel Function, Prostate Cancer
相關次數: 點閱:259下載:6
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 由於資訊科技日新月異,新的群集分析方法不斷地被提出,但是大多分群方法都需要先設定欲分群之群數,因此本研究主要提出一自動核心分群之方法-Automatic Kernel Clustering with Bee Colony Optimization Algorithm (AKC-BCO),不需事先決定欲分群的數目,可藉由資料的特徵值自行群聚成適合的群集,也可透過核心函數使得分類比傳統的歐式距離較為正確。
    本研究先分別採用四組標竿資料集Iris、Wine、Glass以及Vowel來進行實驗,並與Dynamic Clustering using Particle Swarm Optimization (DCPSO)、Dynamic Clustering using Particle Swarm Optimization and Genetic Algorithm (DCPG)和Automatic Kernel Clustering with a Multi-Elitist Particle Swarm Optimization Algorithm (AKC-MEPSO)進行比較,以驗證該方法之準確性及有效性。驗證得知,本研究所提出AKC-BCO此一自動核心分群的方法在分群結果上表現較為穩定且優異。最後再將此方法應用於國內北部某教學醫院攝護腺癌症術後的存活狀況資料中,利用AKC-BCO自動核心分群方法找出最佳的分群結果,透過分群結果進行分析,提供醫院在攝護腺患者存活狀況上,給予適當的建議。


    With the advancement of information technology, the methods of cluster analysis are continually proposed. Even though, most of the clustering methods still need to pre-determine the number of clusters. This study intends to propose a novel automatic kernel clustering technique- Automatic Kernel Clustering with Bee Colony Optimization Algorithm (AKC-BCO), which does not need to set the number of clusters in advance. By examining the data features, the proposed method is able to automatically cluster the data into a suitable number of groups. In addition, it also applies the kernel function to make the classification more correct than the conventional Euclidean distance.
    This study employees four benchmark datasets, Iris, Wine, Glass and Vowel, to evaluate the accuracy and validity by comparing AKC-BCO with other three methods, Dynamic Clustering using Particle Swarm Optimization (DCPSO), Dynamic Clustering using Particle Swarm Optimization and Genetic Algorithm (DCPG) and Automatic Kernel Clustering with a Multi-Elitist Particle Swarm Optimization Algorithm (AKC-MEPSO). The experimental results indicate that AKC-BCO outperforms other methods in validity and stability. Finally, the proposed method is further applied to prostate cancer prognosis system. The results of AKC-BCO are still very promising. They can be used to provide appropriste suggestions for doctors regarding the patient survival situation.

    摘要 i Abstract ii 誌謝 iv 目錄 v 圖目錄 vii 表目錄 viii 第一章 緒論 1 1.1 研究背景與動機 1 1.2 研究目的 2 1.3 研究範圍與限制 2 1.4 研究流程 3 第二章 文獻探討 5 2.1 分群 5 2.1.1分群的定義與應用 5 2.1.2分群的程序 6 2.1.3分群的方法 6 2.1.4自動分群方法 9 2.2 蜂群最佳化演算法 19 2.2.1蜂群最佳化演算法之原理 19 2.2.2蜂群最佳化演算法在分群的應用 22 2.3 核心為基準的方法 23 第三章 研究方法 26 3.1 研究架構 27 3.2 AKC-BCO演算法介紹 28 3.2.1AKC-BCO演算法之架構 28 3.2.2AKC-BCO之演算流程 30 3.3 效能驗證 32 第四章 實驗分析 33 4.1 資料集介紹 33 4.2 資料前處理 36 4.3 演算法參數設定 37 4.4 實驗結果分析與檢定 42 4.4.1演算法評估準則 42 4.4.2演算法收斂情形 43 4.4.3演算法分群結果 45 4.4.4統計檢定 49 4.4.5小結 51 第五章 實證分析 52 5.1 案例簡介 52 5.1.1攝護腺癌症的定義及特性 52 5.1.2攝護腺癌症的資料蒐集和型態 53 5.2 實證分析流程 54 5.3 實證資料前處理 55 5.4 實證結果與分析 57 5.4.1分群結果比較 57 5.4.2統計檢定 59 5.4.3實證分析 61 5.4.4小結 64 第六章 結論與建議 65 6.1 研究結論 65 6.2 研究貢獻 66 6.3 未來研究方向 66 參考文獻 67 附錄一 實驗結果以及分群結果正確率 76 附錄二 實證結果以及分群結果正確率 82

    中文部分
    徐儀蓁,整合粒子群最佳化演算法與遺傳演算法於動態分群之研究,台北科技大學工業工程與管理研究所,碩士論文,台北,2009。
    林雪華,應用粒子群最佳化演算法與免疫演算法為基之動態分群於顧客關係管理研究,台北科技大學工業工程與管理研究所,碩士論文,台北,2010。
    蔡政軒,蜂群最佳化演算法於分群型資料探勘之應用與研究,佛光大學資訊應用研究所,碩士論文,宜蘭,2011。
    莊惟丞,蜂群最佳化之實數問題求解研究,東海大學工業工程與經營資訊研究所,碩士論文,台中,2009。
    林書宜,應用蜂群最佳化演算法於智慧型專業工作人員任務指派規劃,佛光大學資訊應用研究所,碩士論文,宜蘭,2011。
    劉易昌,支援向量機於財務預測上之應用,靜宜大學資訊管理研究所,碩士論文,台中,2003。
    余彥儒,圖書館書籍通閱移送之車輛途程問題-巨集啟發式演算法之應用,國立中央大學土木工程研究所,碩士論文,桃園,2010。

    英文部分
    Al-Sultan K., “A Tabu search approach to the clustering problem,” Pattern Recognition, vol. 28, no. 9, 1995, pp. 1443-1451.
    Ankerst, M., Breunig, M., Kriegel, H. and Sander, J., “OPTICS: Ordering Points to Identify the Clustering Structure,” Proceedins ACM SIGMOD’99 International Conference. on Management of Data, 1999.
    Arumugam, M.S. and Rao, M.V.C., “On the improved performances of the partical swarm optimization algorithms with adaptive parameters, cross-over operators and root mean square(RMS) vataiants for computing optimal control of a class of hybrid systems,” Applied Soft Computing, vol. 8, no. 1, 2008, pp. 324-336.
    Barbara, D. and Chen, P., “Using the fractal dimension to cluster datasets,” Proceedings 6th ACM SIGKDD International Conference Knowledge Discovery and Data Mining, 2000, pp. 260-264.
    Baykasoğlu, A., Ozbakir, L., and Tapkan, P., “Bees algorithm for generalized assignment problem,” Applied Mathematics and Computation, vol.215, 2010, pp. 3782-3795.
    Bezdek, J. and Hathaway, R., “Numerical convergence and interpretation of the fuzzy c-shells clustering algorithms,” IEEE Transactions on Neural Network, vol. 3, no. 5, 1992, pp. 787-793.
    Carpenter, G. and Grossberg, S., “A massively parallel architecture for a self-organizing neural pattern recognition machine,” Comput. Vis. Graph. Image Process, vol. 37, 1987, pp. 54-115.
    Carpenter, G. and Grossberg, S., “ART2: Self-organization of stable category recognition codes for analog input patterns,” Appl. Opt., vol. 26, no. 23, 1987, pp. 4919-4930.
    Carpenter, G., Grossberg, S., and Reynolds, J., “ARTMAP: a self-organizing neural network architecture for fast supervised learning and pattern recognition,” Neural Networks, vol. 1, 1991, pp. 863-868.
    Carpenter, G., Grossberg, S. and Rosen, D., “Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system,” Neural Networks, vol. 4, 1991, pp. 759-771.
    Camastra, F., Verri, A., “A novel kernel method for clustering,” Pattern Analysis and Machine Intelligence, vol.27, no.5, 2005, pp. 801-804.
    Chang, D. and X. Zhang, “Dynamic niching genetic algorithm with data attraction for Automatic Clustering,” Tsinghua Science & Technology 14(6), 2009, pp.718-724.
    Chang, D.-X., X.-D. Zhang, et al., “A robust dynamic niching genetic algorithm with niche migration for automatic clustering problem,” Pattern Recognition 43(4), 2010, pp.1346-1360.
    Chiang, J.-H., P.-Y. Hao, “A new kernel-based fuzzy clustering approach: support vector clustering with cell growing,” Fuzzy Systems, 2003, pp.518-527.
    Chen, C.-Y., Feng, H.-M. and Ye, F., “Automatic Particle Swarm Optimization Clustering Algorithm,” International Journal of Electrical Engineering, vol. 13, no. 4, 2006, pp.379-387.
    Clerc M., “The swarm and the queen: towards a deterministic and adaptive particle swarm optimization,” Evolutionary Computation, 1999.
    Cohen, S.C.M. and Castro, L.N.D., “Data clustering with particle swarms,” IEEE Congress on Evolutionary Computations, 2006, pp. 1792-1798.
    Cınar, M., Engin, M., Engin, E. Z., & Ziya Ateşci, Y., “Early prostate cancer diagnosis by using artificial neural networks and support vector machines,” Expert Systems with Applications, 36(3, Part 2), 2009, pp. 6357-6361.
    Das, S., Abraham, A., Konar, A., “Automatic kernel clustering with a Multi-Elitist Particle Swarm Optimization Algorithm,” Pattern Recognition Letters 29, 2008, pp. 688-699.
    Dorigo, M., Maniezzo, V. and Colorni, A., “The Ant System: Optimization by a colony of cooperating agents,” IEEE Transactions on Systems, Man and Cybernetics-Part B, vol. 26, no. 1, 1996, pp. 1-13.
    Estivill-Castro, V. and Lee, I., “AMOEBA: hierarchical clustering based on spatial proximity using delaunay diagram,” Proceedings 9th International Spatial Data Handling (SDH2000) , 2000a, pp. 10-12.
    Estivill-Castro, V. and Lee, I., “AUTOCLUST: Automatic clustering via boundary extraction for massive point data sets,” Proceedings 5th International Conference Geo-Computation, 2000b, pp. 23-25.
    Ester M., Kriegel H., Sander J. and Xu X., “A density-based algorithm for discovering clusters in large spatial databases with noise,” Proceedings 2th International Conference Knowledge Discovery and Data Mining (KDD’96), 1996, pp. 226-231.
    Forgy, E., “Cluster analysis of multivariate data: efficiency vs. interpretability of classification,” Biometrics, vol. 21, 1965, pp. 768-780.
    Garai, G. and B. B. Chaudhuri, “A novel genetic algorithm for automatic clustering,” Pattern Recognition Letters 25(2), 2004, pp.173-187.
    Geva, A.B, “Hierarchical unsupervised fuzzy clustering,” IEEE Transactions on Fuzzy Systems, vol. 7, no. 6, 1999, pp. 723-733.
    Grossberg, S., “Adaptive pattern recognition and universal encoding II: Feedback, expectation, olfaction, and illusions,” Biol. Cybern., vol. 23, 1976, pp. 187-202.
    Graves, D. and Pedrycz, W., “Kernel-based fuzzy clustering and fuzzy clustering: A comparative experimental study,” Fuzzy Sets and Systems 161, 2010, pp. 522-543.
    Guha, S., Rastogi, R. and Shim, K., “CURE: An efficient clustering algorithm for large databases,” Proceedings ACM SIGMOD Int. Conf. Management of Data, 1998, pp. 73-84.
    Guha, S., Rastogi, R. and Shim, K., “ROCK: A robust clustering algorithm for categorical attributes,” Inf. Syst., vol. 25, no. 5, 2000, pp. 345-366.
    Gu, Y. and Hall, L.O., “Kernel Based Fuzzy Ant Clustering with Partition validity,” Fuzzy Systems, 2006, pp.263-267.
    He, H. and Y. Tan, “A two-stage genetic algorithm for automatic clustering,” Neurocomputing, 2011.
    Hinneburg, A. and Keim, D. "An efficient approach to clustering in large multimedia databases with noise," in Proceeding 4th International Conference Knowledge Discovery and Data Mining (KDD'98), 1998, pp. 58-65.
    Jarboui, B., Cheikh, M., Siarry, P. and Rebai, A., “Combinatorial particle swarm optimization (CPSO) for partitional clustering problem,” Applied Mathematics and Computation, vol. 192, no. 2, 2007, pp. 337-345.
    Jerne, N.K., “Towards a Network Theory of the Immune System,” Annual Immunology, 125c, 1974, pp.373-389.
    Juha V. and Esa A., “Clustering of the Self-Organizing Map,” IEEE Transactions on Neural Networks, vol. 11, no. 3, 2000, pp. 586-600.
    Karypis, G., Han, E. and Kumar V., “Chameleon: Hierarchical clustering using dynamic modeling,” IEEE Computer, vol. 32, no. 8, 1999, pp. 68-75.
    Karaboga, D., “An Idea Based On Honey Bee Swarm For Numerical Optimization,” Technical Report-TR06, Erciyes University, Engineering Faculty, Computer Engineering Department, 2005.
    Karaboga, D. and Basturk, B., “On the performance of artificial bee colony (ABC) algorithm,” Applied Soft Computing, vol. 8, 2008, pp. 687-697.
    Karaboga, D. and Ozturk, C., “A novel clustering approach: Artificial Bee Colony (ABC) algorithm,” Applied Soft Computing 11, 2011, pp. 652-657.
    Kaufman, L. and Rousseeuw, P.J., “Finding groups in data: an introduction to cluster analysis.” John Wiley & Sons, 1990.
    Kim, D.-W., Lee, K., Lee, D., Lee, K. H., “A kernel-based subtractive clustering method,” Pattern Recognition Letters 26, 2005, pp. 879-891.
    Kiang, M. Y., Raghu, T. S. and Shang, K. H. M., ”Marketing on the Internet –Who can benefit from an online marketing approach?” Decision Support System, Vol. 27, 2000, pp.383-393.
    Kohonen, T., “The self-organizing map,” Proceeding of IEEE, vol. 78, no. 9, 1990, pp. 1464-1480.
    Kotsiantis, S. and Pintelas, P., "Recent advances in clustering: a brief survey,” WSEAS Trans. Information Science & Applications, vol. 1, no. 1, 2004, pp. 73-81.
    Krishnapuram, R. and Keller, J., “A possibilistic approach to clustering,” IEEE Transactions Fuzzy Systems, vol. 1, no. 2, 1993, pp. 98-110.
    Krishna, K. and Murty, M.N., “Genetic K-means algorithm,” IEEE Transactions on Systems, vol. 29, no. 3, 1999, pp. 433-439.
    Kuo, R.J., Wang, H.S., Hu, T.L. and Chou, S.H. , “Application of ant K-means on clustering analysis,” Computers & Mathematics with Applications, vol. 50, 2005, pp. 1709-1724.
    Kuo, R.J., Syu, Y.J. Chen, Z.Y., and Tien, F.C., “Integration of particle swarm optimization and genetic algorithm for dynamic clustering,” Information Sciences, 2012.
    Li, W. and Yang, Y., “A new approach to TS fuzzy modeling using dual kernel-based learning machines,” Neurocomputing 71, 2008, pp. 3660-3665.
    Liu, Y., X. Wu, et al., “Automatic clustering using genetic algorithms,” Applied Mathematics and Computation 218(4), 2011, pp.1267-1279.
    Lumer, E. and Faieta, B., “Diversity and adaptation in populations of clustering ants,“ Proceedings of the 3th International Conference onSimulation of Adaptive Behavior: From Animals to Animats, vol. 3, 1994, pp. 501–508.
    Mizutani, K. and Miyamoto, S., “Kernel-Based Fuzzy Competitive Learning Clustering,” Fuzzy Systems, 2005, pp. 636-639.
    Monmarche, N., Slimane, M. and Venturini, G., “Antclass: discovery of clusters in numeric data by an hybridization of an ant colony with the kmeans algorithm,” 1999.
    Omran, M.G.H., Salman A. and Engelbrecht A.P., “Dynamic clustering using particle swarm optimization with application in image segmentation,” Pattern Analysis & Applications, vol. 8, no. 4, 2006, pp. 332-344.
    Ouadfel S., Batouch M., Taleb-Ahmed A., “A modified particle swarm optimization algorithm for automatic image clustering,” 2010.
    Pham, D.T., Otri, S., Afify, A., Mahmuddin, M., and Al-Jabbouli, H., “Data Clustering Using the Bees Algorithm,” Proc. 40th CIRP International Manufacturing Systems Seminar, 2007.
    Ressom, H., Wang, D. and Natarajan, P., “Adaptive double self-organizing maps for clustering gene expression profiles,” Neural Networks, vol. 16, 2003, pp. 633-640.
    Samadzadegan F., Saeedi S., “Clustering of Lidar Data Using Particle Swarm Optimization Algorithm in Urban area,” Laserscanning09, vol. XXXVIII, 2009, p. 334-339.
    Saritas, I., Ozkan, I. A., & Sert, I. U., “Prognosis of prostate cancer by artificial neural networks,” Expert Systems with Applications, 37(9), 2010, pp. 6646-6650.
    Selim, S. and Alsultan, K., “A simulated annealing algorithm for the clustering problems,” Pattern Recognition, vol. 24, no. 10, 1991, pp. 1003-1008.
    Sheikholeslami, G., Chatterjee, S., and Zhang, A., “WaveCluster: A multi-resolution clustering approach for very large spatial databases,” Proceedings 1998 International Conference Very Large Databases (VLDB’98), 1998, pp. 428-439.
    Shi, Y. and Eberhart, R., “A modified particle swarm optimizer,” Proceedings of the IEEE International Conference on Evolutionary Computation, 1998a, pp. 69-73.
    Sitao W. and Tommy W.S. Chow, “PRSOM: A New Visualization Method by Hybridizing Multidimensional Scaling and Self-Organizing Map, ” Neural Networks, vol. 16, 2005, pp. 1362-1380.
    Soleymani Baghshah, M. and Bagheri Shouraki, S., “Kernel-based metric learning for semi-supervised clustering,” Neurocomputing 73, 2010, pp. 1352-1361.
    Suganthan, P.N., “Particle swarm optimizer with neighbourhood operator,” Proceedings of the Conference on Evolutionary Computation, 1999, pp. 1958-1961.
    Su, M.C. and Chang, H.T., “Fast self-organizing feature map algorithm,” IEEE Transactions on neural networks, vol. 11, no. 3, 2000, pp. 721-733.
    Tsai, C.F., Tsai, C.W., Wu, H.C. and Yang, T. “ACODF: A Novel Data Clustering Approach for Data Mining in Large Databases,” Journal of Systems and Software, vol. 73, 2004, pp. 133–145.
    Tseng, L. Y. and S. Bien Yang, “A genetic approach to the automatic clustering problem,” Pattern Recognition 34(2), 2001, pp.415-424.
    Tushir, M. and Srivastava, S., “A new Kernelized hybrid c-mean clustering model with optimized parameters,” Applied Soft Computing 10, 2010, pp. 381-389.
    Tushir, M. and Srivastava, S., “A New Kernel based Hybrid c-Means Clustering Model,” Fuzzy Systems, 2007.
    Wang, C.-D., Lai, J.-H., Zhu, J.-Y., “A Conscience On-line Learning Approach for Kernel-Based Clustering,” Data Mining, 2010, pp. 531-540.
    Xie, X.F., Zhang, W.J. and Yang, Z.L., “A dissipative paticle swarm optimization,” Proceedings of the IEEE Congress on Evolutionary Computing (CEC 2002), 2002, pp. 1666-1670.
    Xu, R. and Wunsch, D., “Survey of clustering algorithm,” IEEE Transactions on Neural Networks, vol.16, no. 3, 2005, pp. 645-678.
    Yang, F. and Jiang, T., “Cell Image Segmentation with Kernel-Based Dynamic Clustering and an Ellipsoidal Cell Shape Model,” Journal of Biomedical Informatics 34, 2001, pp. 67-73.
    Yang, M.-S. and Tsai, H.-S., “A Gaussian kernel-based fuzzy c-means algorithm with a spatial bias correction,” Pattern Recognition Letters 29, 2008, pp. 1713-1725.
    Yin, H., “ViSOM-A novel method for multivariate data projection and structure visualization,” IEEE Transactions on Neural Networks, vol. 13, no. 1, 2002, pp. 135-144.
    Zhang, T., Ramakrishnan, R. amd Livny, M., “BIRCH: An efficient data clustering method for very large databases,” Proceedings ACM SIGMOD Conf. Management of Data, 1996, pp. 103-114.
    Zhang, D., Liu, X. and Guan, Z., “A dynamic clustering algorithm based on PSO and Its application in fuzzy identification,” Proceedings of the 2006 International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP'06), 2006, pp. 232 – 235.
    Zhang, C.S., Ouyang, D.T., and Ning, J., “An artificial bee colony approach for clustering,” Expert Systems with Applications 37, 2010, pp. 4761-4767.
    Zhao, B., Guo, C.X., Bai, B.R. and Cao, Y.J., “An improved particle swarm optimization algorithm for unit commitment,” International Journal of Electrical Power & Energy Systems, vol. 28, no.7, 2006, pp. 482-490.
    Zou, W.P., Zhu, Y.L., Chen, H.N., and Sui, X., “A Clustering Approach Using Cooperative Artificial Bee Colony Algorithm,” Hindawi Publishing Corporation Discrete Dynamics in Nature and Society, Article ID 459796, 2010.

    QR CODE