簡易檢索 / 詳目顯示

研究生: 張岳庭
Dillon-Fleshman Brandon
論文名稱: 人工智慧技術於房價預測之比較
Comparison of Artificial Intelligence Techniques in House Price Prediction
指導教授: 周瑞生
Jui-Sheng Chou
口試委員: 周瑞生
Jui-Sheng Chou
鄭明淵
Min-Yuan Cheng
曾惠斌
Hui-Ping Tserng
周建成
Chien-Cheng Chou
學位類別: 碩士
Master
系所名稱: 工程學院 - 營建工程系
Department of Civil and Construction Engineering
論文出版年: 2020
畢業學年度: 108
語文別: 英文
論文頁數: 114
中文關鍵詞: 房價預測人工智慧機器學習資料探勘混和模型元啟發式演算法粒子群優化法
外文關鍵詞: House price prediction, Artificial Intelligence, Machine learning, Data mining, Hybrid model, Metaheuristic algorithm, Particle swarm optimization
相關次數: 點閱:248下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • (無)


    Real estate is one of the most critical investments in the household portfolio, which occupies the greatest proportion of the wealth of the private household in highly-developed countries. This research conducted a comprehensive review of machine learning techniques in house price prediction. The data of the dwelling transaction price were collected from real price registration system of the Ministry of the Interior, from 2013 through to 2017 in Taipei City, Taiwan. Four well-known artificial intelligent techniques, Artificial Neural Network, Support Vector Machine, Classification and Regression Tree, and Linear Regression were implemented for developing both the baseline and ensemble models. Additionally, a hybrid model was built to compare the predictive performance with the models in baseline and ensemble scenarios. The comprehensive comparison demonstrated that the PSO-Bagging-ANNs hybrid model outperforms other models in this investigation, as well as the previously proposed models in works of literature. With the provision of multiple prediction models, users are able to determine the most suitable technique based on their backgrounds, demands, and comprehension of artificial intelligence, for the house price prediction.

    ABSTRACT ACKNOWLEDGMENT TABLE OF CONTENTS LIST OF FIGURES LIST OF TABLES ABBREVIATION AND SYMBOLS Chapter 1: Introduction 1.1 Research Background and Motivations 1.2 Research Objectives 1.3 Research Process Chapter 2: Literature Review 2.1 Traditional Real Estate Valuation Methods 2.2 Hedonic Price Theory 2.3 Artificial Intelligence in House Price Prediction Chapter 3: Methodology 3.1 Box plot 3.2 Feature Selection Method 3.3 Baseline Prediction Models 3.3.1 Artificial Neural Networks 3.3.2 Support Vector Machine for classification and regression 3.3.3 Classification and Regression Tree 3.3.4 Linear Regression 3.4 Ensemble Prediction Models 3.4.1 Voting 3.4.2 Bagging 3.4.3 Stacking 3.5 Hybrid Models 3.6 Performance Evaluation 3.6.1 Cross-Fold Validation 3.6.2 Statistic Indicators Chapter 4: House Price Dataset 4.1 Data Preprocessing 4.1.1 Data Cleaning 4.1.2 Box Plot Elimination 4.2 Feature Description 4.2.1 Feature Collection 4.2.2 Feature Selection 4.2.3 Distance Calculation 4.2.4 Description of Residential School District Chapter 5: Model Development 5.1 Baseline Prediction Models 5.2 Ensemble Prediction Models 5.3 Hybrid Prediction Model Chapter 6: Model Performance and Discussion 6.1 Performance in Baseline Models 6.2 Performance in Ensemble Models 6.3 Performance in Hybrid Model 6.4 Comprehensive Comparison and Discussion Chapter 7: Conclusion and Recommendation References Appendix A. Main codes of the data cleaning process Appendix B. The demonstration of samples after data cleaning Appendix C. Collection of features from works of literature Appendix D. Main codes for calculating the distance between the residences and the nearest 3 MRT exits Appendix E. Main codes of intercepting the distance to the nearest MRT exit Appendix F. The actual distance between the buildings and YIMBYs along with NIMBYs Appendix G. The coordinates of YIMBYs Appendix H. The coordinates of NIMBYs Appendix I. The locations of YIMBYs in Taipei city Appendix J. The locations of NIMBYs in Taipei city Appendix K. MATLAB codes of PSO-Bagging-ANNs model K.1 MATLAB_PSO_Bagging_ANNs.m K.2 Costfun.m K.3 Object_Bagging_ANNs_Fun.m K.4 PSO.m K.5 Bagging_ANNs_Model_Result.m

    [1] C. Chiang, C.-C. Han, Y.-M. Chiang, T.-C. Tsai, F.-S. Wu, D. Seng, Market liquidity, funding liquidity in the news and housing price, Market Liquidity, Funding Liquidity in the News and Housing Price (February 15, 2015) (2015).
    [2] M. Munusamy, C. Muthuveerappan, M. Baba, M.N. Abdullah, M. Asmoni, An Overview of the Forecasting Methods Used in Real Estate Housing Price Modelling, Jurnal Teknologi 73 (5) (2015).
    [3] C. Fan, Z. Cui, X. Zhong, House Prices Prediction with Machine Learning Algorithms, Proceedings of the 2018 10th International Conference on Machine Learning and Computing, ACM, 2018, pp. 6-10.
    [4] K.A. Adetiloye, P.O. Eke, A review of real estate valuation and optimal pricing techniques, Asian Economic and Financial Review 4 (12) (2014) 1878-1893.
    [5] V. Kontrimas, A. Verikas, The mass appraisal of the real estate by computational intelligence, Appl. Soft Comput. 11 (1) (2011) 443-448.
    [6] S. Rosen, Hedonic Prices and Implicit Markets: Product Differentiation in Pure Competition, Journal of Political Economy 82 (1) (1974) 34-55.
    [7] Y. Xiao, Urban morphology and housing market, Springer, 2017.
    [8] V. Limsombunchai, House price prediction: hedonic price model vs. artificial neural network, New Zealand Agricultural and Resource Economics Society Conference, 2004, pp. 25-26.
    [9] N. Chaphalkar, S. Sandbhor, Use of artificial intelligence in real property valuation, International Journal of Engineering and Technology 5 (3) (2013) 4.
    [10] J.-S. Chou, D.-K. Bui, Modeling heating and cooling loads by artificial intelligence for energy-efficient building design, Energy and Buildings 82 (2014) 437-446.
    [11] R. Barzegar, J. Adamowski, A.A. Moghaddam, Application of wavelet-artificial intelligence hybrid models for water quality prediction: a case study in Aji-Chay River, Iran, Stochastic environmental research and risk assessment 30 (7) (2016) 1797-1819.
    [12] J.-S. Chou, C.-C. Ho, H.-S. Hoang, Determining quality of water in reservoir using machine learning, Ecological Informatics 44 (2018) 57-75.
    [13] J.Y. Wu, Housing Price prediction Using Support Vector Regression, (2017).
    [14] G.-Z. Fan, S.E. Ong, H.C. Koh, Determinants of house price: A decision tree approach, Urban Studies 43 (12) (2006) 2301-2315.
    [15] A. Varma, A. Sarma, S. Doshi, R. Nair, House Price Prediction Using Machine Learning And Neural Networks, 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), IEEE, 2018, pp. 1936-1939.
    [16] M. Claesen, B. De Moor, Hyperparameter search in machine learning, arXiv preprint arXiv:1502.02127 (2015).
    [17] A.N. Alfiyatin, R.E. Febrita, H. Taufiq, W.F. Mahmudy, Modeling House Price Prediction using Regression Analysis and Particle Swarm Optimization, INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2017).
    [18] X. Wang, J. Wen, Y. Zhang, Y. Wang, Real estate price forecasting based on SVM optimized by PSO, Optik-International Journal for Light and Electron Optics 125 (3) (2014) 1439-1443.
    [19] D.F. Williamson, R.A. Parker, J.S. Kendrick, The box plot: a simple visual method to interpret data, Annals of internal medicine 110 (11) (1989) 916-921.
    [20] K. Potter, H. Hagen, A. Kerren, P. Dannenmann, Methods for presenting statistical information: The box plot, Visualization of large and unstructured data sets 4 (2006) 97-106.
    [21] N.C. Schwertman, M.A. Owens, R. Adnan, A simple more general boxplot method for identifying outliers, Computational Statistics & Data Analysis 47 (1) (2004) 165-174.
    [22] R. Dawson, How Significant Is A Boxplot Outlier?, 2011.
    [23] I. Guyon, Andr, #233, Elisseeff, An introduction to variable and feature selection, J. Mach. Learn. Res. 3 (2003) 1157-1182.
    [24] Z.M. Hira, D.F. Gillies, A review of feature selection and feature extraction methods applied on microarray data, Advances in bioinformatics 2015 (2015).
    [25] J. Li, K. Cheng, S. Wang, F. Morstatter, R.P. Trevino, J. Tang, H. Liu, Feature selection: A data perspective, ACM Computing Surveys (CSUR) 50 (6) (2018) 94.
    [26] R.J. Urbanowicz, M. Meeker, W. La Cava, R.S. Olson, J.H. Moore, Relief-based feature selection: introduction and review, Journal of biomedical informatics (2018).
    [27] A. Fallahi, S. Jafari, An expert system for detection of breast cancer using data preprocessing and bayesian network, International Journal of Advanced Science and Technology 34 (2011) 65-70.
    [28] Y. Huang, P.J. McCullagh, N.D. Black, An optimization of ReliefF for classification in large datasets, Data & Knowledge Engineering 68 (11) (2009) 1348-1356.
    [29] R.-J. Palma-Mendoza, D. Rodriguez, L. De-Marcos, Distributed ReliefF-based feature selection in Spark, Knowledge and Information Systems (2018) 1-20.
    [30] I.S.H. Bahia, A Data Mining Model by Using ANN for Predicting Real Estate Market: Comparative Study, International Journal of Intelligence Science 3 (04) (2013) 162.
    [31] A.B. Khamis, N.K.K.B. Kamarudin, Comparative study on estimate house price using statistical and neural network model, International Journal of Scientific & Technology Research 3 (12) (2014) 126-131.
    [32] V.N. Vapnik, The nature of statistical learning theory, Springer-Verlag, 1995.
    [33] J.-S. Chou, K.-H. Yang, J.-Y. Lin, Peak Shear Strength of Discrete Fiber-Reinforced Soils Computed by Machine Learning and Metaensemble Methods, Journal of Computing in Civil Engineering 30 (6) (2016) 04016036.
    [34] J.-S. Chou, N.-T. Ngo, W.K. Chong, The use of artificial intelligence combiners for modeling steel pitting risk and corrosion rate, Engineering Applications of Artificial Intelligence 65 (2017) 471-483.
    [35] H.I. Erdal, O. Karakurt, Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms, Journal of Hydrology 477 (2013) 119-128.
    [36] N. Fumo, M.A. Rafe Biswas, Regression analysis for prediction of residential energy consumption, Renewable and Sustainable Energy Reviews 47 (2015) 332-343.
    [37] J.-S. Chou, D.-S. Tran, Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders, Energy 165 (2018) 709-726.
    [38] B. Qian, K. Rasheed, Stock market prediction with multiple classifiers, Applied Intelligence 26 (1) (2007) 25-33.
    [39] D.H. Wolpert, Stacked generalization, Neural networks 5 (2) (1992) 241-259.
    [40] P. Kazienko, E. Lughofer, B. Trawiński, Hybrid and ensemble methods in machine learning J. UCS special issue, J Univers Comput Sci 19 (4) (2013) 457-461.
    [41] K. Hammouche, M. Diaf, P. Siarry, A comparative study of various meta-heuristic techniques applied to the multilevel thresholding problem, Engineering Applications of Artificial Intelligence 23 (5) (2010) 676-688.
    [42] J. Kennedy, R. Eberhart, Particle swarm optimization, Proceedings of ICNN'95 - International Conference on Neural Networks, Vol. 4, 1995, pp. 1942-1948 vol.1944.
    [43] M. Hasanipanah, M. Noorian-Bidgoli, D.J. Armaghani, H. Khamesi, Feasibility of PSO-ANN model for predicting surface settlement caused by tunneling, Engineering with Computers 32 (4) (2016) 705-715.
    [44] D.J. Armaghani, R.S.N.S.B. Raja, K. Faizi, A.S.A. Rashid, Developing a hybrid PSO–ANN model for estimating the ultimate bearing capacity of rock-socketed piles, Neural Computing and Applications 28 (2) (2017) 391-405.
    [45] R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, Ijcai, Vol. 14, Montreal, Canada, 1995, pp. 1137-1145.
    [46] K.W. Chau, T. Chin, A critical review of literature on the hedonic price model, International Journal for Housing Science and Its Applications 27 (2) (2003) 145-165.
    [47] B. Geng, H. Bao, Y. Liang, A study of the effect of a high-speed rail station on spatial variations in housing price based on the hedonic model, Habitat International 49 (2015) 333-339.
    [48] Z. Huang, R. Chen, D. Xu, W. Zhou, Spatial and hedonic analysis of housing prices in Shanghai, Habitat International 67 (2017) 69-78.
    [49] L.H. Rong, Y.M. Sun, The Analysis of Second-hand Housing Price Influencing Factors based on Hedonic Model and WEB Information, Applied Mechanics and Materials, Vol. 587, Trans Tech Publ, 2014, pp. 2285-2289.
    [50] W. Wei, T. Guang-ji, Z. Hong-rui, Empirical analysis on the housing price in Harbin City based on hedonic model, 2010 International Conference on Management Science & Engineering 17th Annual Conference Proceedings, IEEE, 2010, pp. 1659-1664.
    [51] H.-Z. Wen, J. Sheng-hua, G. Xiao-yu, Hedonic price analysis of urban housing: an empirical research on Hangzhou, China, Journal of Zhejiang University-Science A 6 (8) (2005) 907-914.
    [52] J. Wu, M. Wang, W. Li, J. Peng, L. Huang, Impact of urban green space on residential housing prices: Case study in Shenzhen, Journal of Urban Planning and Development 141 (4) (2014) 05014023.
    [53] X. Wang, The application of SPSS in empirical research of housing hedonic price, 2011 International Conference on Multimedia Technology, IEEE, 2011, pp. 3262-3265.
    [54] H. Zhang, M. Zhang, Environment hedonic price analysis: Evidence from Jilin city, 2010 Second International Conference on Communication Systems, Networks and Applications, Vol. 1, IEEE, 2010, pp. 354-356.
    [55] 杜宇璇, 宋豐荃, 曾禹瑄, 葛仲寧, 陳奉瑤, 台灣特徵價格模型之回顧分析, 土地問題研究季刊 12 (2) (2013) 44-57.
    [56] 林素菁, 台北市國中小明星學區邊際願意支付之估計, JOURNAL OF HOUSING 13 (1) (2004).
    [57] 楊宗憲, 地政, 迎毗設施與鄰避設施對住宅價格影響之研究, JOURNAL OF HOUSING 20 (2) (2011).
    [58] 江穎慧, 莊喻婷, 張金鶚, 臺北市公共自行車場站對鄰近住宅價格之影響, 運輸計劃季刊 46 (4) (2017) 399-428.
    [59] 黃怡潔, 江穎慧, 張金鶚, 臺北市公共住宅對周圍住宅價格之影響, Journal of City and Planning 44 (3) (2017) 277-302.
    [60] S. Kamaruddin, V. Ravi, Credit card fraud detection using big data analytics: Use of psoaann based one-class classification, Proceedings of the International Conference on Informatics and Analytics, ACM, 2016, p. 33.
    [61] H. Yu, W. Xiaohui, PSO-based Energy-balanced Double Cluster-heads Clustering Routing for wireless sensor networks, Procedia Engineering 15 (2011) 3073-3077.
    [62] Z. Zhan, J. Zhang, Y. Li, H.S. Chung, Adaptive Particle Swarm Optimization, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39 (6) (2009) 1362-1381.
    [63] H. Kaur, D.S. Salaria, Bayesian regularization based neural network tool for software effort estimation, Global Journal of Computer Science and Technology (2013).
    [64] K. Aggarwal, Y. Singh, P. Ch, M. Puri, Bayesian regularization in a neural network model to estimate lines of code using function points, (2005).
    [65] I. Isa, Z. Saad, S. Omar, M. Osman, K. Ahmad, H.M. Sakim, Suitable MLP network activation functions for breast cancer and thyroid disease detection, 2010 Second International Conference on Computational Intelligence, Modelling and Simulation, IEEE, 2010, pp. 39-44.
    [66] B. Karlik, A.V. Olgac, Performance analysis of various activation functions in generalized MLP architectures of neural networks, International Journal of Artificial Intelligence and Expert Systems 1 (4) (2011) 111-122.
    [67] K. Methaprayoon, C. Yingvivatanapong, W.-J. Lee, J.R. Liao, An integration of ANN wind power estimation into unit commitment considering the forecasting uncertainty, IEEE Transactions on Industry Applications 43 (6) (2007) 1441-1448.
    [68] S. Muralidharan, K. Phiri, S.K. Sinha, B. Kim, ANALYSIS AND PREDICTION OF REAL ESTATE PRICES: A CASE OF THE BOSTON HOUSING MARKET, Issues in Information Systems 19 (2) (2018) 109-118.
    [69] A.G. Sarip, M.B. Hafez, Fuzzy logic application for house price prediction, International Journal of Property Sciences (E-ISSN: 2229-8568) 5 (1) (2015).
    [70] J. Núñez-Tabales, F. Rey Carmona, J. Caridad, Implicit Prices in Urban Real Estate Valuation, 2013.

    無法下載圖示 全文公開日期 2025/07/27 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)
    全文公開日期 本全文未授權公開 (國家圖書館:臺灣博碩士論文系統)
    QR CODE