簡易檢索 / 詳目顯示

研究生: 林芳婷
Lisayuri - Limanto
論文名稱: A Hybrid Inference Model Based on Synthetic Minority Over-sampling Technique and Evolutionary Least Square SVM for Predicting Construction Contractor Default Status
A Hybrid Inference Model Based on Synthetic Minority Over-sampling Technique and Evolutionary Least Square SVM for Predicting Construction Contractor Default Status
指導教授: 鄭明淵
Min-Yuan Cheng
口試委員: 姚乃嘉
Nie-Jia Yau
晁立中
Chih-Chao Chung
周瑞生
Jui-Sheng Chou
學位類別: 碩士
Master
系所名稱: 工程學院 - 營建工程系
Department of Civil and Construction Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 英文
論文頁數: 118
中文關鍵詞: Contractor Default Status PredictionSynthetic Minority Over-sampling TechniqueLeast Squares Support Vector MachineDifferential EvolutionSyntheticContractor Default Status Prediction
外文關鍵詞: Differential Evolution, Least Squares Support Vector Machine, Synthetic Minority Over-sampling Technique, Contractor Default Status Prediction, Synthetic, Contractor Default Status Prediction
相關次數: 點閱:286下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

Construction industry has several typical characteristics that are different compared to other economy sectors, including the dependability among project stakeholders. Thus, financial status of a construction company is an important issue in the construction industry. Assessing the financial status is challenging and the mapping relationship of input factors and the financial status of a company is very complicated. To avoid biased result and represent company’s financial condition, all available construction firm-years data in verified database center is employed in this study which caused imbalanced issue. This paper presents a hybrid inference model based on the financial ratios to estimate the contractor financial performance. The proposed model is constructed by combining Synthetic Minority Over-sampling Technique (SMOTE), Least Squares Support Vector Machine (LS-SVM), and Differential Evolution (DE). In the new model, SMOTE acts imbalanced dataset problem handling of non-default and default samples, LS-SVM is used as a supervised machine learning technique for classification, and DE is employed for specifying the optimal parameter of LS-SVM. A total record of 1695 construction contractor firm-years observations from 84 non-defaulted and 28 defaulted companies is collected and used to train and validate the proposed model. The Area Under the Curve (AUC) is utilized as the performance measurement of prediction results. As shown in the experimental results, the proposed models (AUC=0.98463) outperformed the other benchmark models, including Evolutionary Support Vector Machine Inference Model (ESIM), Support Vector Machine (SVM), Artificial Neural Network (ANN), and logistic regression. Therefore, the proposed approach is a promising alternative for predicting contractor default status.


Construction industry has several typical characteristics that are different compared to other economy sectors, including the dependability among project stakeholders. Thus, financial status of a construction company is an important issue in the construction industry. Assessing the financial status is challenging and the mapping relationship of input factors and the financial status of a company is very complicated. To avoid biased result and represent company’s financial condition, all available construction firm-years data in verified database center is employed in this study which caused imbalanced issue. This paper presents a hybrid inference model based on the financial ratios to estimate the contractor financial performance. The proposed model is constructed by combining Synthetic Minority Over-sampling Technique (SMOTE), Least Squares Support Vector Machine (LS-SVM), and Differential Evolution (DE). In the new model, SMOTE acts imbalanced dataset problem handling of non-default and default samples, LS-SVM is used as a supervised machine learning technique for classification, and DE is employed for specifying the optimal parameter of LS-SVM. A total record of 1695 construction contractor firm-years observations from 84 non-defaulted and 28 defaulted companies is collected and used to train and validate the proposed model. The Area Under the Curve (AUC) is utilized as the performance measurement of prediction results. As shown in the experimental results, the proposed models (AUC=0.98463) outperformed the other benchmark models, including Evolutionary Support Vector Machine Inference Model (ESIM), Support Vector Machine (SVM), Artificial Neural Network (ANN), and logistic regression. Therefore, the proposed approach is a promising alternative for predicting contractor default status.

ABSTRACT ACKNOWLEDGEMENTS ABBREVIATIONS AND SYMBOLS LIST OF FIGURES LIST OF TABLES CHAPTER 1 INTRODUCTION 1.1. Research Background and Motivation 1.2. Research Objective 1.3. Research Scope and Assumption 1.4. Research Methodology 1.4.1. Problem Formulation 1.4.2. Literature Review 1.4.3. Construction Contractor Default Status Model Construction 1.4.4. Prediction Result 1.4.5. Conclusion 1.5. Research Outline CHAPTER 2 LITERATURE REVIEW 2.1. Default Prediction in the Construction Industry 2.2. Least Square Support Vector Machine for Classification 2.3. Differential Evolution 2.4. Synthetic Minority Over-sampling Technique for Solving Imbalance Problem CHAPTER 3 MODEL DEVELOPMENT 3.1. Data Collection 3.1.1. Data Resource 3.1.2. Financial Ratio Selection 3.2. Data Preprocessing and Assumption 3.3. Hybrid SMOTE-ELSIM for Construction Contractor Default Status 3.4. Performance Validation and Measurement Approach 3.4.1. Cross Fold Validation 3.4.2. Receiver Operating Characteristic (ROC) Curve CHAPTER 4 RESULT AND DISCUSSION 4.1. Experimental Type 1 Result 4.2. Experimental Type 2 Result 4.3. Result Comparison and Discussion CHAPTER 5 CONCLUSION AND RECOMMENDATION 5.1. Conclusion 5.2. Future Recommendation REFERENCES APPENDIX

(2011). Wharton Research Data Services, The Wharton School of the University of Pennsylvania.
Abidali, A. F. and F. Harris (1995). "A methodology for predicting company failure in the construction industry." Construction Management and Economics 13(3): 189-196.
Abou El Ela, A. A., M. A. Abido and S. R. Spea (2010). "Optimal power flow using differential evolution algorithm." Electric Power Systems Research 80(7): 878-885.
Adeyemo, J. and F. Otieno (2010). "Differential evolution algorithm for solving multi-objective crop planning model." Agricultural Water Management 97(6): 848-856.
Agarwal, V. and R. Taffler (2008). "Comparing the performance of market-based and accounting-based bankruptcy prediction models." Journal of Banking & Finance 32(8): 1541-1551.
Ahn, H., K. Lee and K.-j. Kim (2006). Global Optimization of Support Vector Machines Using Genetic Algorithms for Bankruptcy Prediction. Neural Information Processing. I. King, J. Wang, L.-W. Chan and D. Wang, Springer Berlin Heidelberg. 4234: 420-429.
Al-Sobiei, O., D. Arditi and G. Polat (2005). "Managing Owner’s Risk of Contractor Default." Journal of Construction Engineering and Management 131(9): 973-978.
Altman, E. I. (1968). "Financial Ratios, Discriminant Analysis and The Prediction of Corporate Bankruptcy." The Journal of Finance 23(4): 589-609.
Arlot, S. and A. Celisse (2010). "A survey of cross-validation procedures for model selection." Statistics Surveys 4: 40-79.
Beaver, W. (1966). "Financial Ratios As Predictors of Failure." Journal of Accounting Research 4: 71-102.
Bishop, C. M. (2006). Pattern recognition and machine learning, springer New York.
Black, F. and M. Scholes (1973). "The Pricing of Options and Corporate Liabilities." The Journal of Political Economy 81(3): 637-654.
Brockman, P. and H. J. Turtle (2003). "A barrier option framework for corporate security valuation." Journal of Financial Economics 67(3): 511-529.
Chakraborty, U. K. (2008). Advances in differential evolution, Springer-Verlag New York Incorporated.
Chawla, N., K. Bowyer, L. Hall and P. Kegelmeyer (2002). "SMOTE: Synthetic Minority Over-sampling Technique." Journal of Artificial Intelligence Research 16: 321-357.
Chawla, N. V., K. W. Bowyer, L. O. Hall and W. P. Kegelmeyer (2002). "SMOTE: Synthetic Minority Over-sampling Technique." Journal of Artificial Intelligence Research 16: 321-357.
Cheng, M.-Y. and N.-D. Hoang (2013). "Groutability Prediction of Microfine Cement Based Soil Imprevoment Using Evolutionary LS-SVM Inference Model." Journal of Civil Engineering and Management (In Press).
Cheng, M.-Y., N.-D. Hoang, A. F. V. Roy and Y.-W. Wu (2012). "A novel time-depended evolutionary fuzzy SVM inference model for estimating construction project at completion." Eng. Appl. Artif. Intell. 25(4): 744-752.
Cheng, M.-Y. and A. F. V. Roy (2011). "Evolutionary fuzzy decision model for cash flow prediction using time-dependent support vector machines." International Journal of Project Management 29(1): 56-65.
Cheng, M.-Y. and Y.-W. Wu (2009). "Evolutionary support vector machine inference system for construction management." Automation in Construction 18(5): 597-604.
Chua, K. S. (2003). "Efficient computations for large least square support vector machine classifiers." Pattern Recognition Letters 24(1–3): 75-80.
Coelho, L. d. S., R. C. T. Souza and V. C. Mariani (2009). "Improved differential evolution approach based on cultural algorithm and diversity measure applied to solve economic load dispatch problems." Mathematics and Computers in Simulation 79(10): 3136-3147.
De Brabanter, K., P. Karsmakers, F. Ojeda, C. Alzate, J. De Brabanter, K. Pelckmans, B. De Moor, J. Vandewalle and J. Suykens (2011). "LS-SVMlab Toolbox User’s Guide." KU Leuven Leuven, Belgium.
Deakin, E. B. (1976). Distributions of financial accounting ratios: some empirical evidence.
Dichev, I. D. (1998). "Is the Risk of Bankruptcy a Systematic Risk?" The Journal of Finance 53(3): 1131-1147.
Edum-Fotwe, F., A. Price and A. Thorpe (1996). "A review of financial ratio tools for predicting contractor insolvency." Construction Management and Economics 14(3): 189-198.
Fan, A. and M. Palaniswami (2000). Selecting bankruptcy predictors using a support vector machine approach. Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on.
Fawcett, T. (2004). "ROC graphs: Notes and practical considerations for researchers." Machine Learning 31: 1-38.
Flach, P. and S. Wu (2003). Repairing concavities in ROC curves. Proc. 2003 UK workshop on computational intelligence, Citeseer.
Gao, Y., X. Wang, Y. Cheng and J. Pan (2010). "Fault diagnosis using a probability least squares support vector classification machine." Mining Science and Technology (China) 20(6): 917-921.
Geisser, S. (1993). Predictive inference: an introduction, CRC Press.
Glotsos, D., I. Kalatzis, P. Spyridonos, S. Kostopoulos, A. Daskalakis, E. Athanasiadis, P. Ravazoula, G. Nikiforidis and D. Cavouras (2008). "Improving accuracy in astrocytomas grading by integrating a robust least squares mapping driven support vector machine classifier into a two level grade classification scheme." Computer Methods and Programs in Biomedicine 90(3): 251-261.
Gupta, L. (1983). Financial ratios for monitoring corporate sickness, Oxford University Press, Delhi, India.
Haifeng, W. and H. Dejin (2005). Comparison of SVM and LS-SVM for Regression. Neural Networks and Brain, 2005. ICNN&B '05. International Conference on.
Kangari, R., F. Farid and H. M. Elgharib (1992). "Financial performance analysis for construction industry." Journal of Construction Engineering and Management 118(2): 349-361.
Kiranyaz, S., T. Ince, A. Yildirim and M. Gabbouj (2009). "Evolutionary artificial neural networks by multi-dimensional particle swarm optimization." Neural Networks 22(10): 1448-1462.
Li, X., P. Nie, Z.-J. Qiu and Y. He (2011). "Using wavelet transform and multi-class least square support vector machine in multi-spectral imaging classification of Chinese famous tea." Expert Systems with Applications 38(9): 11149-11159.
Lima, C. A. M., A. L. V. Coelho and M. Eisencraft (2010). "Tackling EEG signal classification with least squares support vector machines: A sensitivity analysis study." Computers in Biology and Medicine 40(8): 705-714.
Lu, Y., J. Zhou, H. Qin, Y. Li and Y. Zhang (2010). "An adaptive hybrid differential evolution algorithm for dynamic economic dispatch with valve-point effects." Expert Systems with Applications 37(7): 4842-4849.
Luts, J., G. Molenberghs, G. Verbeke, S. Van Huffel and J. A. K. Suykens (2012). "A mixed effects least squares support vector machine model for classification of longitudinal data." Computational Statistics & Data Analysis 56(3): 611-628.
Merton, R. C. (1974). "On The Pricing of Corporate Debt: The Risk Structure of Interest Rates." The Journal of Finance 29(2): 449-470.
Metz, C. E. (1978). "Basic principles of ROC analysis." Seminars in Nuclear Medicine 8(4): 283-298.
Meyer, P. A. and H. W. Pifer (1970). "Prediction of Bank Failures." The Journal of Finance 25(4): 853-868.
Min, J. H. and Y.-C. Lee (2005). "Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters." Expert Systems with Applications 28(4): 603-614.
Newton, G. W. (1994). Bankruptcy and Insolvency Accounting, Forms and Exhibits, John Wiley & Sons.
Ohlson and A. James (1980). "Financial Ratios and the Probabilistic Prediction of Bankruptcy." Journal of Accounting Research 18(1): 109.
Onwubolu, G. and D. Davendra (2009). Differential Evolution for Permutation—Based Combinatorial Problems, Springer.
Price, K., R. Storn and J. Lampinen (2005). Differential Evolution A Practical Approach to Global Optimization, Springer-Verlag.
Russell, J. and H. Zhai (1996). "Predicting Contractor Failure Using Stochastic Dynamics of Economic and Financial Variables." Journal of Construction Engineering and Management 122(2): 183-191.
Samui, P. (2011). "Least square support vector machine and relevance vector machine for evaluating seismic liquefaction potential using SPT." Natural hazards 59(2): 811-822.
Samui, P. and D. P. Kothari (2011). "Utilization of a least square support vector machine (LSSVM) for slope stability analysis." Scientia Iranica 18(1): 53-58.
Severson, G., J. Russell and E. Jaselskis (1994). "Predicting Contract Surety Bond Claims Using Contractor Financial Data." Journal of Construction Engineering and Management 120(2): 405-420.
Storn, R. and K. Price (1997). "Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces." Journal of global optimization 11(4): 341-359.
Sun, F., W. Zhong, H. Cheng and F. Qian (2013). "Novel Control Vector Parameterization Method with Differential Evolution Algorithm and Its Application in Dynamic Optimization of Chemical Processes." Chinese Journal of Chemical Engineering 21(1): 64-71.
Suykens, J. A. and J. Vandewalle (1999). "Least squares support vector machine classifiers." Neural processing letters 9(3): 293-300.
Suykens, J. A. K., T. V. Gestel, J. D. Brabanter, B. D. Moor and J. Vandewalle (2002). Least Squares Support Vector Machines, World Scientific Publishing Co. Pte. Ltd.
Taffler, R. J. (1984). "Empirical models for the monitoring of UK corporations." Journal of Banking & Finance 8(2): 199-227.
Theodossiou, P. T. (1993). "Predicting Shifts in the Mean of a Multivariate Time Series Process: An Application in Predicting Business Failures." Journal of the American Statistical Association 88(422): 441-449.
Thien, M. H. (1997). "Off-Line, Handwritten Numeral Recognition by Perturbation Method." IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5): 535-539.
Thomas Ng, S., J. M. W. Wong and J. Zhang (2011). "Applying Z-score model to distinguish insolvent construction companies in China." Habitat International 35(4): 599-607.
Tsai, L.-K., H.-P. Tserng, H.-H. Liao, P.-C. Chen and W.-P. Wang (2012). "Integration of Accounting-Based and Option-Based Models to Predict Construction Contractor Default." Journal of Marine Science and Technology 20(5): 479-484.
Tserng, H., H. Liao, E. Jaselskis, L. Tsai and P. Chen (2012). "Predicting Construction Contractor Default with Barrier Option Model." Journal of Construction Engineering and Management 138(5): 621-630.
Tserng, H. P., G.-F. Lin, L. K. Tsai and P.-C. Chen (2011). "An enforced support vector machine model for construction contractor default prediction." Automation in Construction 20(8): 1242-1249.
Tserng, H. P. and T. A. Nguyen (2012). Integration of Merton Model and Logit Model with Over-sampling Tehnique to Predict Construction Contractor Default, National Taiwan University. Master Degree.
Wilson, R. L. and R. Sharda (1994). "Bankruptcy prediction using neural networks." Decision Support Systems 11(5): 545-557.
Wong, K. P. and Z. Dong (2005). Differential evolution, an alternative approach to evolutionary algorithm. Intelligent Systems Application to Power Systems, 2005. Proceedings of the 13th International Conference on, IEEE.
Xin, N., X.-F. Gu, H. Wu, Y.-Z. Hu and Z.-L. Yang (2012). "Discrimination of raw and processed Dipsacus asperoides by near infrared spectroscopy combined with least squares-support vector machine and random forests." Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy 89(0): 18-24.
Xu, X.-s., T. Chen and R.-e. Ren (2006). Studying on Forecasting the Enterprise Bankruptcy Based on SVM. Management Science and Engineering, 2006. ICMSE '06. 2006 International Conference on.
Yu-Kun, B., L. Zhi-Tao, G. Lei and W. Wen (2005). Forecasting stock composite index by fuzzy support vector machines regression. Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on.
Yu, L., H. Chen, S. Wang and K. K. Lai (2009). "Evolving least squares support vector machines for stock market trend mining." Evolutionary Computation, IEEE Transactions on 13(1): 87-102.
Yuan, Y., R. Zhang and L. Luo (2009). "Classification study of novel piperazines as antagonists for the melanocortin-4 receptor based on least-squares support vector machines." Chemometrics and Intelligent Laboratory Systems 96(2): 144-148.
Zhang, G., M. Y. Hu, B. Eddy Patuwo and D. C. Indro (1999). "Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis." European Journal of Operational Research 116(1): 16-32.
Zhang, S., C. Zhang and Q. Yang (2003). "Data preparation for data mining." Applied Artificial Intelligence 17(5-6): 375-381.
Zmijewski, M. E. (1984). "Methodological issues related to the estimation of financial distress prediction models." Journal of Accounting Research 22: 59-82.
Zou, D., H. Liu, L. Gao and S. Li (2011). "An improved differential evolution algorithm for the task assignment problem." Engineering Applications of Artificial Intelligence 24(4): 616-624.

QR CODE