Problem Statement:
The "Predict Student Dropout Rates and Academic Success" data set was created to reduce the number of dropouts and failures in higher levels of education through machine learning techniques.
Methods:
We will be prioritizing Porier’s reading data sets method in order to analyze our data. We will need to use a critical, detailed approach to analyze this data. This is what Porier’s method is good for. In order to understand our question; “Predict Students Dropout and Academic Success data” is created to reduce the number of dropouts and failures we must use Porier’smethod to analyze as well as taking a head on approach like Koopman does. We will use critical thinking by asking ourselves “what does this data tell us?” Critical thinking will allow us to bring attention to deeper meanings in the data itself.
Biggest Predictors of Success:
- Students with tuition payments on time
- Students who parents have graduated themselves or achieved an education
- Students who attend classes regularly
- Students who were given scholarships
Data Analysis:
Background info and Introduction:
Analyzing student dropout data would help us further understand the deeper message behind the dataset. It's important to get all different sides of the story, and recognizing the exact percentage of dropouts from each variable was key to analyzing our data.
We looked at data from:
Students enrolled between years: 2008–2019
Databases that were used in the case study were: Academic Management System (AMC), General Directorate of Higher Education (DGES), National Competition for Access to Higher Education (CNAES), Contemporary Portugal Database (PORDATA)
Data collected from 17 different fields of study stemming from: Advertising and Marketing Management, Journalism, Nursing, etc.