Document Type : Original Research Paper


1 Department of industrial Engineering, university of Torbat Heydarieh, Torbat Heydarieh,Iran

2 Department of Industrial Engineering, Ferdowsi University of Mashhad, Iran


Background and Objectives: Nowadays, significant advancements in information technology and communication field in different societies are seen. Given that these advancements, universities as a leading institution in the field of science, have moved towards electronic processes in the management of education and educational environments, there are databases with a large amount of information. By analyzing this massive data of educational systems, methods can be provided to improve the educational status of students. Educational data mining has sought to discover the knowledge contained in the data of the educational system. One of the applications of educational data mining is to predict students' academic performance. Predicting students' academic performance and providing useful solutions is of particular importance in the success of educational systems and can help managers make the right decisions to increase the efficiency of the educational system and better student performance. The purpose of this paper is to identify the effective indicators on academic performance, predict students' academic status using data mining techniques, and finally present a new trend for modifying unit selection and educational strategies to increase the efficiency of the education system.
Methods: steps of this research are determined according to CRISP model. In current research, Databases containing 9 datasets of specialized courses in industrial engineering were used. The students' grade was bachelor's degree. Indicators affecting student performance have been identified based on previous researches and expert opinions. Demographic data and academic records of undergraduate students are entered in database. After data preprocessing, 13 attributes are selected, different models were proposed to predict student's academic status in the next semester. Then, a comparison between the results of 4 different algorithms has been done.
Findings: All 13 attributes are identified to be effective according to information gain and gain ratio. This 13 attributes as follow: GPA, Total passed units, Number of conditional terms, Type of admission, Marital status, Gender, University admission year, Living place , Age, Current semester, Prerequisite course score, instructor of the course, Repeat the course. Between of 4 considered models, the Logit Boost algorithm is known as the best model in categorizing in two class and multi-class according to the accuracy rate and ROC.
Conclusion: Because of acceptable performance of data mining algorithms, the use of these algorithms in predicting student performance is appropriate and the proposed model can be used as a support tool for decision making in educational systems. Finally, according to the obtained results and the opinion of academic experts, the unit selection process was redesigned. The proposed model can be used as a decision support tool in educational systems. Finally, due to the results obtained and the opinions of the academic experts, the process of unit selection was redesigned. The presented process uses the available data in educational systems and data mining science, provides useful knowledge to decision-makers to make the right and appropriate decision. Decision makers can make appropriate decisions by examining the predictions made by the data mining algorithm and obtaining useful information, in order to make the educational system more efficient.


Main Subjects

©2020 The author(s). This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, as long as the original authors and source are cited. No permission is required from the authors or the publishers. 

[1] Pokay P, Blumenfeld PC. Predicting achievement early and late in the semester: the role of motivation and use of learning strategies. Journal of Educational Psychology. 1990; 82(1): 41-50.
[6] Akour I. Factors influencing faculty computer literacy and use in Jordan: A multivariate analysis. D.B.A. dissertation. Louisiana Tech University, United States, Louisiana; 2010.
[10] Yang M. Data Mining Techniques Applied to Texas Woman’s University’s Enrollment data – What Can the Data Tell us? [master’s thesis]. US: Texaz Woman’s University; 2006.
[19] Shahiri AM, Husain W, Rashid NA. A Review on Predicting Student’s Performance using Data Mining Techniques. The Third Information Systems International Conference. Procedia Computer Science. 2015; 72: 414-422.
[25] Strecht P, Cruz L, Soares C, Merdes-Moreria J, Abren R. A Comparative Study of Classification and Regression Algorithms for Modelling Students’ Academic Performance. Madrid, Spain: Paper presented at the 8th International Conference on Educational Data Mining. 392-395; 2015.
[27] Nghe TN, Janecek P, Haddawy P. A Comparative Analysis of Techniques for Predicting Academic Performance. Paper presented in the 37th ASEE /IEEE Frontiers in Education Conference; 2007.
[28] Kabakchieva D, Stefanova K, Kismov VS. Analyzing University Data for Determining Student Profiles and Predicting Performance. In 4th International Conference on Educational Data Mining. the Netherlands; 2011.
[32] Kaur P, Singh M, Josan GS. Classification and prediction based data mining algorithms to predict slow learners in education sector. In 3th International Conference on Recent Trends in Computing 2015(ICRTC-2015). Procedia Computer Science. 2015; 57: 500-508. 
[39] Tari M, Minai B, Farahi A, Niknam Pirzadeh M. Prediction of educational performance using CART algorithm, neural network and predictive accuracy increase using combination model. 3rd Iranian Conference on Electrical and Electronics Engineering (ICEEE2011), Gonabad; 2011. Persian.
[41] Rahmati A, Lesani M, Khalilzadeh R. The related factors to the students of Shahid Bahonar Kerman in 2009-2010 and its analytical model. Kerman: Shahid Bahonar University; 2012.  Persian.
[43] Shukor NA, Tasir Z, Meijden HV.  An examination of online learning effectiveness using data mining. Global Conference on Business & Social Science. Kuala Lumpur; 2014.
[47] Gitue A. The intelligent algorithm is to advise on the selection of student units based on the analysis of the chart of courses, the prediction of the grades and the pattern chosen by the previous students. University of Kordestan; 2014. Persian.
[49] Wirth R, Hipp J. CRISP-DM: Towards a Standard Process Model for Data Mining. In Proceedings of the Fourth International Conference on the Practical Application of Knowledge Discovery and Data Mining; 2000.
[54] Han J, Kamber M, Pei J.  Data Mining: Concepts and Techniques. Morgan Kaufman; 2012.
[56] Hand D, Mannila H, Smyth P. Principles of Data Mining. Cambridge: MIT Press; 2001.
[57] Giudici P. Applied data mining: statistical methods for business and industry. New York: John Wiley & Sons Ltd; 2003.