簡易檢索 / 詳目顯示

研究生: 張舒婷
Shu-Ting Chang
論文名稱: 根據自動分群演算法及多迴歸分析技術以在關聯式資料庫系統中估計空值之新方法
Estimating Null Values in Relational Database Systems Using Automatic Clustering and Multiple Regression Techniques
指導教授: 陳錫明
Shyi-Ming Chen
口試委員: 何正信
Cheng-seen Ho
陳榮靜
Rung-ching Chen
李惠明
Huey-ming Lee
呂永和
Yung-ho Leu
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2008
畢業學年度: 96
語文別: 英文
論文頁數: 92
中文關鍵詞: 關連式資料庫空值自動分群演算法分群中心
外文關鍵詞: Relational database, Null value, Automatic clustering algorithm, Cluster center
相關次數: 點閱:297下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在本論文中,我們提出一個新的自動分群演算法及利用多迴歸分析技術以在關聯式資料庫系統中估計空值。首先,我們針對數值資料提出一自動分群演算法.本論文所提之自動分群演算法不須事先定義群數及也不須事先將資料予以排序,使得分群時能更有彈性。依據本論文所提出之自動分群演算法所做的分群結果來估計空值,其僅須針對其中一個分群的資料做估計空值的工作,無須針對整個資料庫系統內所有的資料做處理,本論文所提之在關聯式資料庫系統中估計空值的方法比目前已存在的方法具有更高的平均估計準確率。


    In this thesis, we present a new method for estimating null values in relational database systems using automatic clustering and multiple regression techniques. First, we present a new automatic clustering algorithm for clustering numerical data. The proposed automatic clustering algorithm does not need to determine the number of clusters in advance and does not need to sort the data in the database in advance. Then, based on the proposed automatic clustering algorithm and multiple regression techniques, we present a new method to estimate null values in relational database systems. The proposed method for estimating null values in relational database systems only needs to process a particular cluster instead of the whole database. It gets a higher average estimation accuracy rate than the existing methods for estimating null values in relational database systems.

    Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Related Literature 2 1.3 Organization of This Thesis 2 Chapter 2 Fuzzy Set Theory 4 2.1 Basic Concept of Fuzzy sets 4 2.2 Types of Membership Functions 4 2.3 Summary 7 Chapter 3 A Method for Estimating Null Values in Relational Database Systems 8 3.1 Chen-and-Yeh’s Automatic Clustering Algorithm [12] 8 3.2 Hsiao-and-Chen’s Method to Estimated Null Values in Relational Database Systems [18] 9 3.2.1 An Automatic Clustering Algorithm [18] 9 3.2.2 A Review of Chen-and-Hsiao’s Method for Estimating Null Values in Relational Database Systems [17] 15 3.2.3 A Review of Chen-and-Chen’s Method for Estimating Null Values in Relational Database Systems [9] 17 3.3 Summary 20 Chapter 4 A New Method for Estimating Null Values in Relational Database Systems Using Automatic Clustering and Multiple Regression Techniques 22 4.1 A New Automatic Clustering Algorithm 22 4.2 A New Method to Estimate Null Values in Relational Database Systems 38 4.3 An Example of Estimating Null Value in Relation Database Systems 43 4.4 Summary 50 Chapter 5 Estimating Null Values in Relational Database Systems with Negative Dependency Relationships between Attributes 52 5.1An Example of Estimating Null Values in Relational Database Systems with Negative Dependency Relationships between Attributes 52 5.2A New Method to Estimate Null Values in Relational Database Systems with Negative Dependency Relationships between Attributes 65 5.3 Summary 86 Chapter 6 Conclusions 87 6.1 Contributions of This Thesis 87 6.2 Future Research 87 References 89

    [1]P. Alam, D. Booth, K. Lee, and T. Thordarson, “The use of fuzzy clustering algorithm and self-organizing neural networks for identifying potentially failing banks: An experimental study,” Expert Systems with Applications, vol. 18, no. 3, pp. 185-199, 2000.
    [2]J.G. Baek and Co. Kim, “Learning single-issue negotiation strategies using hierarchical clustering method,” Expert Systems with Applications, vol. 32, no. 2, pp. 606-615, 2007.
    [3]M. L. Bernson, D. M. Levine, and M. Goldstein, Intermediate Statistical Methods and Applications. New Jersey: Prentice-Hall, 1983.
    [4]S. K. Bhatia and J. S. Deogun, “Conceptual clustering in information retrieval,” IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 28, no. 3, pp. 427-436, 1998.
    [5]Z. H. Chan, L. Collins, and N. Kasabov, “An efficient greedy K-means algorithm for global gene trajectory clustering,” Expert System with Applications, vol. 30, no. 1, pp. 137-141, 2006.
    [6]S. T. Chang and S. M. Chen, “A new approach for estimating null values in relational database systems using automatic clustering and multiple regression techniques,” Proceedings of the 11th Conference on Artificial Intelligence and Applications, Kaohsiung, Taiwan, Republic of China, 2006.
    [7]S. M. Chen, “A new approach to handle fuzzy decision making problems,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 18, no. 6, pp. 1012-1016, 1998.
    [8]M. S. Chen and S. W. Wang, “Fuzzy clustering analysis for optimizing fuzzy
    membership functions,” Fuzzy Sets and Systems, vol. 103, no. 2, pp. 239-254,
    1999.
    [9]S. M. Chen and H. H. Chen, “Estimating null values in the distributed relational databases environments,” Cybernetics and Systems, vol. 31, no. 8, pp. 851-871, 2000.
    [10]S. M. Chen and C.M. Huang, “Generating weighted fuzzy rules from relational database systems for estimating null values using genetic algorithms,” IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 495-506, 2003.
    [11]S. M. Chen and S. W. Lee, “A new method to generate fuzzy rules from relational database systems for estimating null values,” Cybernetics and Systems, vol. 34, no. 1, pp. 33-57, 2003.
    [12]S. M. Chen and M. S. Yeh, “Generation fuzzy rules from relational database systems for estimating null values,” Cybernetics and Systems, vol. 29, no. 6, pp. 363-376, 1998.
    [13]S. M. Chen and W. T. Jong, “Fuzzy query translation for relational database systems,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 27, no. 4, pp. 714-721,1997.
    [14]C. H. Cheng and J. W. Wang, “A new approach for estimating null values in relational database,” Soft Computing, vol. 10, no. 2, pp. 104-114, 2006.
    [15]J. H. Chiang, “Support vector learning mechanism for fuzzy rule-based modeling: A new approach,” IEEE Transactions on Fuzzy Systems, vol. 12, no. 1, pp. 1-12, 2004.
    [16]K. Honda and H. Ichihashi, “Linear fuzzy clustering techniques with missing values and their application to local principal component analysis,” IEEE Transactions on Fuzzy Systems, vol. 12, no. 2, pp. 183-193, 2004.
    [17]S. M. Chen and H. R. Hsiao, “A new method to estimate null values in relational
    database systems based on automatic clustering techniques,” Information
    Sciences, vol. 69, no. 1-2, pp. 47-69, 2005.
    [18]H. R. Hsiao and S. M. Chen, “A new automatic clustering algorithm for fuzzy query processing,” Proceedings of the 6th Conference on Artificial Intelligence and Applications, Kaohsiung, Taiwan, Republic of China, pp. 550-555, 2001.
    [19]C. M. Huang and S. M. Chen, “Estimating null values in relational database systems with a negative dependency relationship between attributes,” Proceedings of the 13th International Conference on Information Management, Taipei, Taiwan, Republic of China, vol. 1, pp. 151-158, 2002.
    [20]C. M. Huang and S. M. Chen, “A new method to estimate null values in relational database systems using genetic algorithms,” Proceedings of the Six Conference on Artificial Intelligence and Applications, Kauhsiung, Taiwan, Republic of China, pp. 599-604, 2001.
    [21]A. H. Kvanli, C. S. Guynes, and R. J. Pavur, Introduction to Business Statistics. West Publishing Company, 1986.
    [22]S. W. Lee and S. M. Chen, “A new method for estimating null values in relational database systems based on genetic algorithms,” Proceedings of the Seventh Conference on Artificial Intelligence and Applications, Taichung, Taiwan, Republic of China, pp. 447-452, 2002.
    [23]J. M. Leski, “Generalized weighted conditional fuzzy clustering,” IEEE Transactions on Fuzzy Systems, vol. 11, no. 6, pp. 709-715, 2003.
    [24]Y. S. Lin and S. M. Chen, “Using automatic clustering techniques for fuzzy query processing in relational database systems,” Proceedings of the 11th National Conference on Information Management, Kaohsiung, Taiwan, Republic of China, 2000.
    [25]S. Rıdvan, T. Kemal and A. Novruz, “A fuzzy clustering approach for finding
    similar documents using a novel similarity measure,” Expert Systems with Applications, vol. 1, no. 3, pp. 600-605.
    [26]S. M. Tseng, K. H. Wang, and C. I. Lee, “A preprocessing method to deal with missing values by integrating clustering and regression techniques,” Applied Artificial Intelligence, vol. 17, no. 5, pp. 535-544, 2003.
    [27]H. Wang, and P. M. Bell, “Fuzzy clustering analysis and multifactorial evaluation for students’ imaginative power in physics problem solving,” Fuzzy Sets and Systems, vol. 78, no. 1, pp. 95-105, 1996.
    [28]S. L. Wang and Y. J. Tsai, “Null queries with interval-valued ambiguous attributes,” Proceedings of the 1998 IEEE International Conference on Systems, Man, and Cybernetics, San Diego, USA, vol. 3, pp. 2150-2153, 1998.
    [29]M. S. Yang, P. Y. Hwang, and D. H. Chen, “Fuzzy clustering algorithms for mixed feature variables,” Fuzzy Sets and Systems, vol. 141, no. 2, pp. 301-317, 2004.
    [30]M. S. Yeh and S. M. Chen, “An algorithm for generating fuzzy rules from relational database systems,” Proceedings of the 6th International Conference on Information Management, Taipei, Taiwan, Republic of China, pp. 219-226, 1995.
    [31]M. S. Yeh and S. M. Chen, “A new method for fuzzy query processing using automatic clustering techniques,” Journal of Computers, vol. 6, no. 1, pp. 1-10, 1994.
    [32]M. S. Yeh, S. M. Chen, and P. Y. Hsiao, “A comparison of measures of similarity of fuzzy values,” Proceedings of the 1994 Second National Conference on Fuzzy Theory and Applications, Taipei, Taiwan, Republic of China, pp. 152-159, 1994
    [33]L. A. Zadeh, “Fuzzy sets,” Information and Control, vol. 8, pp. 338-353, 1965.

    QR CODE