The Review of Attributes Influencing Housing Prices using Data Mining Methods

Authors

  • Pelin Kasap Department of Statistics, Ondokuz May?s University, Samsun, Turkey
  • Bur Department of Statistics, Ondokuz May?s University, Samsun, Turkey

Keywords:

Data mining, CRISP-DM, CART algorithm, C5 algorithm, Neural networks model.

Abstract

Prices of housing show an alteration throughout the world. The reason of this is characteristics of housing such as age, tax, number of rooms, per capita crime rate by town and so on. In this study, we carry out the stages of the CRISP-DM process to investigate attributes influencing prices of housing. We use CART, C5 decision tree algorithms and Neural networks model in modeling phase. Also, we use cross-validation method in modeling evaluation phase. It is shown that C5 model is the most appropriate model with the highest validation rate.

References

Breiman, L. Friedman, J.H., Olshen, R.A., Stone, C.J. Classification and Regression Trees. Chapman & Hall, New York, 1984.

Chapman P., Clinton J., Kerber R., Khabaz T., Reinartz T., Shearer C. and Wirth R. 2000. Step-by step Data Mining Guide, SPSS Inc..

Cortez, P. 2006. Data mining with Neural Networks and Support Vector Machines using the R/rminer tool, FCT grant PTDC/EIA/64541/2006, http://www.r-project.org/.

Fayyad, U.M. 1996. Advances in Knowledge Discovery and Data Mining, Menlo Park, CA: AAAI&MIT Press, USA.

Fayyad, U., Shapiro, G.P. and Smyth, P. 1996. From Data Mining to Knowledge Discovery in Databases, Al Magazine, Volume 17, Number 3, pp. 37-54, AAAI.

Friedman, J.H. 1997. Data Mining and Statistics: What is the Connection?, Proceedings of the 29th Symposium on the Interface Between Computer Science and Statistics, Houston, Texas, May 14-17, University of Huston.

Gatnar E., Rozmus D. Data Mining-The Polish Experience. In Innovations in Classification, Data Science, and Information Systems, pp. 217-223, Springer Berlin Heidelberg, 2005.

Gaur, P. 2012. Neural networks in data mining, International Journal of Electronic and Computer Science Engineering (IJECSE, ISSN-2277-1956), Vol.1, pp.1449-1453.

Harrison, D. and Rubinfeld, D.L. 1978. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, Vol. 5, 81-102.

John GH., Enhancements to the data mining process (Doctoral dissertation, stanford university), 1997.

Kantardzic, M. 2011. Data Mining Concepts, Models, Methods and Algorithms, Second Edition, IEEE Press, A John Wiley&Sons, Inc.

Kasih, J., Ayub, M and Susanto, S. 2013. Predicting students

Kovalerchuk, B., Vityaev, E., 2000. Data Mining in Finance: Advances in Relational and Hybrid Methods. Springer Science & Business Media, 2000 edition, ISBN-10: 0792378040, ISBN-13: 978-0792378044

Marmelstein, R.E., Hammack, L.P. and Lamont, G.B. 1999. "Concurrent approach for evolving compact decision rule sets", Proc. SPIE 3695, Data Mining and Knowledge Discovery: Theory, Tools, and Technology, 52 (February 25, 1999); doi:10.1117/12.339990; http://dx.doi.org/10.1117/12.339990

Quinlan, J.R. C4.5: Programs for Machine Learning. Morgan Kaufman. 1993.

Parsaye, K. 1997. OLAP and Data Mining: Bridging the Gap. Database Programming and Design, 10, pp.30-37.

Patil, N., Lathi, R. and Chitre V. 2012. Comparison of C5 & CART Classification algorithms using pruning technique, International Journal of Engineering Research&Technology (IJERT), ISSN:2278-0181, Vol.1, Issue 4.

Pnandya, R. and Pandya, J. 2015. C5 Algorithm to improved decision tree with feature selection and reduced error pruning, International Journal of Computer Applications, Vol. 117, No. 16.

Ramakrishnan, R. and Gehrke, J. 2002. Database Management Systems, 3rd Edition, McGraw-Hill Professional.

Ripundeep Singh Gill and Ashima, 2014. Neural networks in data mining, IOSR Journal of Engineering (IOSRJEN), Vol. 4, Issue 3, pp.1-6.

Romero, C., Ventura, S., Espejo, P.G. and Hervas, C. 2008. Data Mining Algorithms to Classify Students, The 1st International Conference on Educational Data Mining, Montreal, Quebec, Canada, pp. 8-18. (June 20-21).

Shamim, A., Shaikh, M.U. and Malik, S.R. 2010. Intelligent Data Mining in Autonomous Heterogeneous Distributed Bio Databases, Second International Conference on Computer Engineering and Applications.

Singh Y. and Chauhan, A.S. 2009. Neural networks in data mining, Journal of Theoretical and Applied Information Technology, Vol.5, No.1, pp.37-42.

Wirth, R. and Hipp, J. 2000. CRISP-DM: Towards a Standard Process Model for Data Mining, Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, pp.29-39, Manchester, UK.

Zekulin AD, Busche FD, U.S. Patent No.6,430,547. Washington, DC: U.S. Patent and Trademark Office. 2002.

Downloads

Published

2017-06-15

How to Cite

Kasap, P., & Bur. (2017). The Review of Attributes Influencing Housing Prices using Data Mining Methods. International Journal of Sciences: Basic and Applied Research (IJSBAR), 34(1), 155–165. Retrieved from https://www.gssrr.org/index.php/JournalOfBasicAndApplied/article/view/7580

Issue

Section

Articles