Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

The Effect of Imbalanced Classes on Students' Academic Performance Prediction: An Evaluation Study

Osama Mohammed El-Deeb, Walid Elbadawy, Doaa Saad Elzanfaly

Source Title: International Journal of e-Collaboration (IJeC) 18(1)

DOI: 10.4018/IJeC.304373

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Imbalanced classes in data mining have more challenges in the educational data mining field. This is because most of the datasets collected from educational records are imbalanced by nature. Some classes dominate others and cause bias predictions. This paper studies the effects of the imbalanced classes on the performance of seven different classifiers, which are J48, Random Forest, k-Nearest Neighbors, Naïve Bayes, Random Tree, SVM, and Linear Regression. Moreover, the effectiveness of the SMOTE technique for handling imbalanced data is evaluated against these classifiers. This will be done through the proposal of an early predictive model that predicts student’s academic performance and recommends their appropriate department in a multi-disciplinary institute. According to our results, the Random Forest technique is the best and has the highest level of accuracy is 94.585%.

Article Preview

Top

Introduction

Educational Data Mining is a data mining field that aims to derive useful information from raw data obtained from educational systems (Rawat & Malhan, 2019). This information can be used to better analyze the performance of the students to improve the decision-making process. One of the most challenging problems of educational data is its distribution. The distribution of educational data over time has exceptional characteristics. Among these characteristics is the imbalanced class distribution (Member & Fellow, 2012).

The class imbalanced distribution is identified by the ratio of the number of attributes of the majority class to that of the minority class (Anjana & Sardana, 2017). There are different techniques in the literature for handling the class imbalance problem. Oversampling and under-sampling techniques are the most common (Mohammed, Rawashdeh, & Abdullah, 2020). However, most of these techniques are dealing with the binary class imbalance and just a few findings are dealing with multi-class imbalances. Multi-class imbalance happens when the target variable consists of more than one class with unequal sample sizes for each class (Moubayed, Injadat, Shami, & Lutfiyya, 2018) (Wang & Yao, 2012). Techniques that are commonly used for handling binary-class imbalance may become inefficient for the multi-class imbalance.

The purpose of this paper is three-fold: First, it explores different techniques for handling the imbalanced dataset to evaluate their effects on the accuracy of predicting the students’ academic performance. Second, it proposes a predictive model for the performance of students at an early academic stage in a multi-disciplinary institute. Third, the model will recommend a study path for the student based on his performance. The main goal of predicting academic performance is to alleviate the risk of students’ dropout, the risk of course failure, and poor graduation rates. Most of the current studies are focusing only on the prediction part. However, in this paper, the authors added the recommendation part to guide students when choosing their specialization. This recommendation will help the students make better decisions on their educational path and enhance their performance. The proposed model is based on a real dataset that has been gathered from the Giza Higher Institute of Management Sciences. The Random Forest Classifier has achieved the best results among other classifiers after handling the class imbalance problem in the collected dataset.

Machine learning (ML) is a part of artificial intelligence (AI) that uses data to improve its performance. Machine learning algorithms are used in many fields, such as speech recognition, image classification, text recognition, and educational data mining. Machine learning algorithms play an important role in computer science because they are trained using data to make predictions and classifications. The authors are using machine learning algorithms in educational data mining to predict student performance and recommend student specialization based on regression and classification processes. So, in this paper, the authors train and evaluate J48, Random Forest, and Naïve Bayes classification classifiers to recommend student performance and Random Forest, Linear Regression, and K-Nearest Neighbor Regression classifiers to predict student performance.

The rest of this paper is organized as follows: Section 2 highlights the research significance. Section 3 presents related work about predicting student performance and handling imbalanced educational data. Section 4 outlines the proposed model for addressing the methods applied to handle the problem of imbalanced data, the use of appropriate techniques for class imbalance, and then the classification techniques. Finally, Section 5 describes the results for measuring the performance of the classifiers before and after using a resampling technique to handle the problem of imbalanced data to enhance the prediction accuracy of student performance. Section 6 is the conclusion and future study.

Complete Article List

Search this Journal:

Reset

Volume 20: 1 Issue (2024)

Volume 19: 7 Issues (2023)

Volume 18: 6 Issues (2022): 3 Released, 3 Forthcoming

Volume 17: 4 Issues (2021)

Volume 16: 4 Issues (2020)

Volume 15: 4 Issues (2019)

Volume 14: 4 Issues (2018)

Volume 13: 4 Issues (2017)

Volume 12: 4 Issues (2016)

Volume 11: 4 Issues (2015)

Volume 10: 4 Issues (2014)

Volume 9: 4 Issues (2013)

Volume 8: 4 Issues (2012)

Volume 7: 4 Issues (2011)

Volume 6: 4 Issues (2010)

Volume 5: 4 Issues (2009)

Volume 4: 4 Issues (2008)

Volume 3: 4 Issues (2007)

Volume 2: 4 Issues (2006)

Volume 1: 4 Issues (2005)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

The Effect of Imbalanced Classes on Students' Academic Performance Prediction: An Evaluation Study

Abstract

Introduction

Complete Article List