簡易檢索 / 詳目顯示

研究生: 梅智皓
Chih-Hau Mei
論文名稱: 應用萬用演算法為基礎之整合分群法於App顧客區隔之研究
Applying Meta-heuristic Algorithms-based Clustering Ensembles Methods for App Customer Segmentation
指導教授: 郭人介
Ren-Jieh Kuo
口試委員: 歐陽超
Chao Ou-Yang
蔡介元
Chieh-Yuan Tsai
學位類別: 碩士
Master
系所名稱: 管理學院 - 工業管理系
Department of Industrial Management
論文出版年: 2014
畢業學年度: 102
語文別: 英文
論文頁數: 100
中文關鍵詞: 分群分析萬用演算法整合分群基因演算法粒子群演算法人工蜂群演算法應用程式顧客區隔
外文關鍵詞: Clustering Analysis, Meta-heuristic Algorithms, Clustering Ensembles, GA, PSO, ABC, App, Customer Segmentation
相關次數: 點閱:305下載:11
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 由於資訊科技日新月異,新的分群分析方法不斷地被提出,但是沒有任何一個分群演算法可以對所有的資料集進行正確的分群。因此,本研究主要是提出一個萬用演算法結合整合分群演算法的模型,在整合分群的部分使用資料子集合和不同的分群演算法來產生分群結果,再用一致關聯矩陣整合所有的分群結果,在過程中亦使用主成分分析法來減少程式的運算時間。最後,則利用萬用演算法為基礎之K-means分群演算法去決定最終分群結果。本研究提出三個使用此模型的分群演算法,包括了實數型基因演算法為基礎之K-means整合分群演算法、粒子群演算法為基礎之K-means整合分群演算法、人工蜂群演算法為基礎之K-means整合分群演算法。接著,將它們對Iris、Wine、Tae、Flame資料集的分群結果和其他分群演算法進行比較,分群演算法有K-means分群演算法、單一連結聚合分群演算法、完整連結聚合分群演算法、平均連結聚合分群演算法、實數型基因演算法為基礎之分群演算法、粒子群演算法為基礎之分群演算法、人工蜂群演算法為基礎之分群演算法、實數型基因演算法為基礎之K-means分群演算法、粒子群演算法為基礎之K-means分群演算法、人工蜂群演算法為基礎之K-means分群演算法。實驗結果顯示實數型基因演算法為基礎之K-means整合分群演算法和人工蜂群演算法為基礎之K-means整合分群演算法比其他分群演算法能獲得更好的結果,此外,人工蜂群演算法為基礎之K-means整合分群演算法比實數型基因演算法為基礎之K-means整合分群演算法更能穩定地獲得最佳解,所以更應用在體重控制App的紀錄數據進行App顧客區隔分析,並進而為App發展商提供了一些改進策略的建議。


    Recently, many clustering algorithms have been presented. However, there is no clustering method which can process all datasets correctly. This study proposes a model of meta-heuristic algorithms combining with clustering ensembles. The proposed clustering ensembles use subsets of data and different algorithms to generate clustering results. Then, co-association matrix is applied to integrate all the clustering results. In the process, principal component analysis is employed to reduce the computational time. The meta-heuristic algorithms based K-means clustering algorithms are applied to determine the clustering results. There are three clustering algorithms proposed including real-coded genetic algorithm based K-means clustering (GKC), particle swarm optimization based K-means clustering (PSOKC), and artificial bee colony based K-means clustering (ABCKC). Their results are compared with these of other clustering algorithms, such as K-means, single linkage agglomerative hierarchical clustering (SLHC), complete linkage agglomerative hierarchical clustering (CLHC), average linkage agglomerative hierarchical clustering (ALHC), real-coded genetic algorithms based clustering (GC), particle swarm optimization based clustering (PSOC), artificial bee colony based clustering (ABCC), real-coded genetic algorithm based K-means clustering (GKC), particle swarm optimization based K-means clustering (PSOKC), and artificial bee colony based K-means clustering (ABCKC) for iris, wine, tae, and flame datasets. The experimental results indicate that GKCE and ABCKCE are able to obtain better solutions than other methods. Furthermore, ABCKCE can stably obtain the optimal solution than GKCE, so ABCKCE is applied to do the App customer segmentation for log files of the weight control App. Finally, this study provides some improving suggestions for the application developer.

    摘要 I Abstract II 誌謝 III Contents IV List of Tables VII List of Figures IX Chapter 1 Introduction 1 1.1 Background and Motivation 1 1.2 Research Objectives 3 1.3 Research Scope and Constrain 3 1.4 Research Framework 3 Chapter 2 Literature Review 5 2.1 Clustering Methods 5 2.1.1 K-means Clustering 5 2.1.2 Hierarchical Clustering 7 2.2 Meta-heuristic Algorithm Based Clustering Methods 9 2.2.1 Real-Coded Genetic Algorithm Based Clustering 9 2.2.2 Particle Swarm Optimization Based Clustering 11 2.2.3 Artificial Bee Colony Based Clustering 13 2.3 Clustering Ensembles 15 2.3.1 Generation Mechanisms 16 2.3.2 Consensus Functions 17 Chapter 3 Methodology 19 3.1 Methodology Framework 19 3.2 The Proposed Clustering Ensembles 20 3.2.1 Subsets of Data with Different Algorithms 21 3.2.2 Co-association Matrix 22 3.3 Principal Component Analysis 24 3.4 Meta-heuristic Algorithm Based K-means Clustering Method 25 3.4.1 Real-Coded Genetic Algorithm Based K-means Clustering 26 3.4.2 Particle Swarm Optimization Based K-means Clustering 28 3.4.3 Artificial Bee Colony Based K-means Clustering 30 Chapter 4 Experimental Results 34 4.1 Datasets 34 4.2 Performance Measurement 35 4.3 Preprocessing of Datasets 35 4.4 Parameters Determination 36 4.4.1 Parameters for Iris Dataset 40 4.4.2 Parameters for Wine Dataset 43 4.4.3 Parameters for Tae Dataset 46 4.4.4 Parameters for Flame Dataset 49 4.5 Computational Results 52 4.6 Statistical Hypothesis 53 Chapter 5 Model Evaluation Results 58 5.1 Model Dataset 58 5.2 Performance Measurement 60 5.3 Dataset Preprocessing 60 5.4 Parameters Determination 61 5.5 Computational Results 67 5.6 Statistical Hypothesis 68 5.7 Analysis 69 Chapter 6 Conclusions and Future Research 72 6.1 Conclusions 72 6.2 Contributions 72 6.3 Future Research 73 References 74 Appendix 77

    Akay, B. & Karaboga, D., "A modified artificial bee colony algorithm for real-parameter optimization," Information Sciences, vol. 192, pp. 120-142, 2012.
    Azimi, J., Mohammadi, M., Movaghar, A., & Analoui, M., "Clustering ensembles using genetic algorithm," in Computer Architecture for Machine Perception and Sensing, 2006. CAMP 2006. International Workshop on, 2006, pp. 119-123.
    Basturk, B. & Karaboga, D., "An artificial bee colony (ABC) algorithm for numeric function optimization," in IEEE swarm intelligence symposium, 2006, pp. 12-14.
    Chen, Q. & Mo, J., "Optimizing the Ant Clustering Model Based on K-Means Algorithm," in Computer Science and Information Engineering, 2009 WRI World Congress on, 2009, pp. 699-702.
    Cheng, S.-S., Chao, Y.-H., Wang, H.-M., & Fu, H.-C., "A prototypes-embedded genetic K-Means algorithm," in Pattern Recognition, 2006. ICPR 2006. 18th International Conference on, 2006, pp. 724-727.
    Dong, J. & Qi, M., "A new algorithm for clustering based on particle swarm optimization and K-means," in Artificial Intelligence and Computational Intelligence, 2009. AICI'09. International Conference on, 2009, pp. 264-268.
    Eiben, A. E. & Smith, J. E., Introduction to evolutionary computing: springer, 2003.
    Florek, K., Łukaszewicz, J., Perkal, J., Steinhaus, H., & Zubrzycki, S., "Sur la liaison et la division des points d'un ensemble fini," in Colloquium Mathematicae, 1951, pp. 282-285.
    Fred, A. L. & Jain, A. K., "Data clustering using evidence accumulation," in Pattern Recognition, 2002. Proceedings. 16th International Conference on, 2002, pp. 276-280.
    Fu, L. & Medico, E., "FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data," BMC bioinformatics, vol. 8, no. 1, p. 3, 2007.
    Hamka, F., Bouwman, H., De Reuver, M., & Kroesen, M., "Mobile customer segmentation based on smartphone measurement," Telematics and Informatics, vol. 31, no. 2, pp. 220-227, 2014.
    Jain, A. K., Murty, M. N., & Flynn, P. J., "Data clustering: a review," ACM computing surveys (CSUR), vol. 31, no. 3, pp. 264-323, 1999.
    Jiang, B., Wang, N., & Wang, L., "Parameter identification for solid oxide fuel cells using cooperative barebone particle swarm optimization with hybrid learning," International Journal of Hydrogen Energy, vol. 39, no. 1, pp. 532-542, 2014.
    Jiang, D., Pei, J., & Zhang, A., "DHC: a density-based hierarchical clustering method for time series gene expression data," in Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on, 2003, pp. 393-400.
    Jolliffe, I., Principal component analysis: Wiley Online Library, 2005.
    Kackar, R. N., "Off-line quality control, parameter design, and the Taguchi method," Journal of Quality Technology, vol. 17, pp. 176-188, 1985.
    Karaboga, D., "An idea based on honey bee swarm for numerical optimization," Technical report-tr06, Erciyes university, engineering faculty, computer engineering department2005.
    Karaboga, D. & Basturk, B., "On the performance of artificial bee colony (ABC) algorithm," Applied soft computing, vol. 8, no. 1, pp. 687-697, 2008.
    Karaboga, D. & Ozturk, C., "A novel clustering approach: Artificial Bee Colony (ABC) algorithm," Applied Soft Computing, vol. 11, no. 1, pp. 652-657, 2011.
    Kennedy, J. & Eberhart, R., "Particle swarm optimization," in Proceedings of IEEE international conference on neural networks, 1995, pp. 1942-1948.
    Kleinberg, J., "An impossibility theorem for clustering," in NIPS, 2002, pp. 463-470.
    Krishna, K. & Murty, M. N., "Genetic K-means algorithm," Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on, vol. 29, no. 3, pp. 433-439, 1999.
    Krishnamoorthi, M. & Natarajan, A., "A comparative analysis of enhanced Artificial Bee Colony algorithms for data clustering," in Computer Communication and Informatics (ICCCI), 2013 International Conference on, 2013, pp. 1-6.
    Kuncheva, L. I. & Vetrov, D. P., "Evaluation of stability of k-means cluster ensembles with respect to random initialization," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 28, no. 11, pp. 1798-1808, 2006.
    Kuo, R., Zulvia, F. E., & Suryadi, K., "Hybrid particle swarm optimization with genetic algorithm for solving capacitated vehicle routing problem with fuzzy demand–A case study on garbage collection system," Applied Mathematics and Computation, vol. 219, no. 5, pp. 2574-2588, 2012.
    Li, K., Wang, L., & Hao, L., "Comparison of cluster ensembles methods based on hierarchical clustering," in Computational Intelligence and Natural Computing, 2009. CINC'09. International Conference on, 2009, pp. 499-502.
    Li, Z., Li, Y., & Xu, L., "Anomaly intrusion detection method based on k-means clustering algorithm with particle swarm optimization," in Information Technology, Computer Engineering and Management Sciences (ICM), 2011 International Conference on, 2011, pp. 157-161.
    MacQueen, J., "Some methods for classification and analysis of multivariate observations," in Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, 1967, p. 14.
    Maulik, U. & Bandyopadhyay, S., "Genetic algorithm-based clustering technique," Pattern recognition, vol. 33, no. 9, pp. 1455-1465, 2000.
    Michielssen, E., Ranjithan, S., & Mittra, R., "Optimal multilayer filter design using real coded genetic algorithms," IEE Proceedings J (Optoelectronics), vol. 139, no. 6, pp. 413-420, 1992.
    Patel, R., Raghuwanshi, M., & Jaiswal, A. N., "Modifying genetic algorithm with species and sexual selection by using K-means algorithm," in Advance Computing Conference, 2009. IACC 2009. IEEE International, 2009, pp. 114-119.
    Rani, A. & Parthipan, L., "Clustering analysis by Improved Particle Swarm Optimization and K-means algorithm," in Sustainable Energy and Intelligent Systems (SEISCON 2012), IET Chennai 3rd International on, 2012, pp. 1-6.
    Shi, Y. & Eberhart, R. C., "Parameter selection in particle swarm optimization," in Evolutionary Programming VII, 1998, pp. 591-600.
    Sokal, R. R., "A statistical method for evaluating systematic relationships," Univ Kans Sci Bull, vol. 38, pp. 1409-1438, 1958.
    Strehl, A. & Ghosh, J., "Cluster ensembles---a knowledge reuse framework for combining multiple partitions," The Journal of Machine Learning Research, vol. 3, pp. 583-617, 2003.
    Sun, G., Feng, Y.-h., Guo, X.-j., & Zhang, C.-p., "Research on K-means Clustering Algorithm," Journal of Changchun Normal University, vol. 2, p. 001, 2011.
    Tamasauskas, D., Sakalauskas, V., & Kriksciuniene, D., "Evaluation framework of hierarchical clustering methods for binary data," in Hybrid Intelligent Systems (HIS), 2012 12th International Conference on, 2012, pp. 421-426.
    Topchy, A., Jain, A. K., & Punch, W., "Clustering ensembles: Models of consensus and weak partitions," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 27, no. 12, pp. 1866-1881, 2005.
    Topchy, A. P., Jain, A. K., & Punch, W. F., "A Mixture Model for Clustering Ensembles," in SDM, 2004.
    Van der Merwe, D. & Engelbrecht, A. P., "Data clustering using particle swarm optimization," in Evolutionary Computation, 2003. CEC'03. The 2003 Congress on, 2003, pp. 215-220.
    Vega-Pons, S. & Ruiz-Shulcloper, J., "A survey of clustering ensemble algorithms," International Journal of Pattern Recognition and Artificial Intelligence, vol. 25, no. 03, pp. 337-372, 2011.
    Xiuman, D., Guoxia, S., & Tao, Y., "Moving target detection based on Genetic K-means Algorithm," in Communication Technology (ICCT), 2011 IEEE 13th International Conference on, 2011, pp. 819-822.
    Yang, L.-y., Zhang, J.-y., & Wang, W.-j., "Cluster Ensemble Based on Particle Swarm Optimization," in Intelligent Systems, 2009. GCIS'09. WRI Global Congress on, 2009, pp. 519-523.
    Zhang, C., Ouyang, D., & Ning, J., "An artificial bee colony approach for clustering," Expert Systems with Applications, vol. 37, no. 7, pp. 4761-4767, 2010.
    Zheng, L., Li, T., & Ding, C., "Hierarchical ensemble clustering," in Data Mining (ICDM), 2010 IEEE 10th International Conference on, 2010, pp. 1199-1204.

    QR CODE