研究生: |
劉宇軒 Yu-Hsuan Liu |
---|---|
論文名稱: |
結合主成分分析與費雪資訊之簡單貝氏分類法 Naive Bayes classifier with Principal Components Analysis and Fisher Information |
指導教授: |
楊維寧
Wei-Ning Yang |
口試委員: |
陳雲岫
Yun-Shiow Chen 呂永和 Yung-Ho Leu |
學位類別: |
碩士 Master |
系所名稱: |
管理學院 - 資訊管理系 Department of Information Management |
論文出版年: | 2017 |
畢業學年度: | 105 |
語文別: | 中文 |
論文頁數: | 18 |
中文關鍵詞: | 簡單貝氏分類法 、主成分分析 、費雪資訊 |
外文關鍵詞: | Naive Bayes classifier, Principal Components Analysis, Fisher Information |
相關次數: | 點閱:391 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
「簡單貝氏分類法」是一種透過各特徵彼此間互相獨立下運用貝氏定理的分類器,現實生活中大部分的資料無法滿足任何兩個特徵皆不相關這項假設。
我們提出一種結合主成分分析與費雪資訊之簡單貝氏分類法,先利用「主成分分析」將各特徵轉化成任何兩個特徵皆不相關的新的特徵,並將各屬性欄位與類別欄位進行費雪資訊的計算,逐一選出具有較多資訊量的屬性,最後估計貝氏分類法所需的相似度,並將物件歸類於最高「事後機率」的類別。
我們以網路常用的標準資料集為研究與效能評估,探討利用主成分分析和費雪資訊降低特徵維度,此分類法準確率的變化。
Naive Bayes classifier is a simple probabilistic classifier which is based on applying Bayes’theorem which strong independence assumptions between the features.
We propose a method based on Naive Bayes classifier with Principal Components Analysis(PCA) and Fisher Information.
We use Principal Components Analysis to make features uncorrelated.
The transformed features are ranked by Fisher Information score which measuring the amount of information and calculate the posterior probability where the likelihood is replaced by p-value.
We conclude our research through the classification accuracy with some examples and present our vision for future research.
Hsu, C. N., Huang, H. J., and Wong, T. T.. “Implications of the Dirichlet assumption for discretization of continuous attributes in naïve Bayesian classifiers”. Machine Learning, 53, 235-263, 2003.
Mohammad Darzi, Ali AsgharLiaei, Mahdi Hosseini, HabibollahAsghari. "Feature Selection for Breast Cancer Diagnosis: A Case-Based Wrapper Approach. " (2011), World academy of Science, Engineering and Technology 77, 2011,pp 1142-1143.
E.Osuna, R.Freund, and F. Girosi, “Training support vector machines: Application to face detection”.
Proceedings of computer vision and pattern recognition, Puerto Rico pp. 130–136.1997.
D. Lavanya, “Ensemble Decision Tree Classifier for Breast Cancer Data,” International Journal of Information Technology Convergence and Services, vol. 2, no. 1, pp. 17-24, Feb. 2012.
Ch. Rakesh, , D.N.D.Harini, , M. Bhanu Sridhar "An Empirical Analysis of Classification Algorithms for Medical Data"
Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
Balint Antal, Andras Hajdu: An ensemble-based system for automatic screening of diabetic retinopathy, Knowledge-Based Systems 60 (April 2014), 20-27.
Frieden, B. R. (2004) Science from Fisher Information: A Unification. Cambridge Univ. Press. ISBN 0-521-00911-1.