This project was implemented during the Udacity FWD Data Analysis Nanodegree Program
A person makes a doctor appointment, receives all the instructions and no-show. Who to blame?
- Q1: Which certain gender show more commitment to their medical appointment that the other?
- Q2: Does patient's age correlate with them showing up to the appointment?
- Q3: Does the patient's Scholarship availability affect their decision to attend their medical appointment?
- Q4: Does patient's disease affect their decision to appear to the appointment?
- Q5: Most requent neighborhood, and day for appointments and scheduling?
- Q6: Duration between scheduling and appointment days, in terms of: minimum, maximum, etc..?
The dataset didn't require much handeling, as no missing values, nor duplicates, etc.. were found.
Some minor issues was detected:
- Issue 1: Some column names needs handling -> re-formated for neat and easy work
- Issue 2: Data types of some attribute
- Patient_ID is float -> converted to string
- Appointment_ID is int -> converted to string
- ScheduledDay and AppointmentDay are strings -> converted to datetime format
- Issue 3: Age attribute has negative values
Accordingly to the above figure, it is detectable that Female patients represent more than 50% of the data. Hence why, we need precise measuring when it comes to patient's gender
-
Cross tabulation of gender againtst No-Show variable:
No-Show/Gender No Yes F 0.796851 0.203149 M 0.800321 0.199679
Q3: Does the patient's Scholarship availability affect their decision to attend their medical appointment?
-
Cross tabulation of scholarship against No-Show variable:
No-Show/Scholarship No Yes 0 0.801926 0.198074 1 0.762637 0.237363
Patients with schoalrship:
- Around 80% of these patients attend their appointment
- 19% of them didn't show up
Patients without schoalrship:
- Around 76% of these patients attend their appointment
- Around 23% of them didn't show up
- Diabetes:
7.18% of patients have diabetes
-
Cross tabulation of diabetic/non-diabetic against No-Show variable
No-Show/Diabetes No Yes 0 0.796370 0.203630 1 0.819967 0.180033 -
Hypertension:
19.7% of patients have hypertension
-
Cross tabulation of patients with/without hypertension
No-Show/Hypertension No Yes 0 0.790961 0.209039 1 0.826980 0.173020 -
Alcoholism:
3% of patients are alcohol addicts
-
Cross tabulation of patients addicted/un-addicted to alcohol
No-Show/Alcoholism No Yes 0 0.798052 0.201948 1 0.798512 0.201488
- Dataset contained 81 different neighbourhoods
- The top 3 frequent neighbourhoods in appointments and scheduling:
Most frequent day for scheduling is Tuseday
Most frequent for appointments is Wednesday
We can also detect that Saturday is the least frequent day for both scheduling and appointments.
- Minimum waiting time = 0 days with 38562 patients
- Maximum waiting time = 179 days with 10 pateints
That alot!! A patient might wait for 179 days to get their appointment! Fortunately, it present rarely in the data, which means it's not the usual situation in appointments duration.
Now lets see if patient/s who waited this long, ended up attending their medical appointment of not? and vise versa, did patients who got immediate appointment, ended up commiting to their appointment?
- There are 10 patients who waited 179 days to attend their appointment.
- Among these patients, 80% of them attended their medical appointment despite the long wait, and 20% tend to not show up to the appointment.
Now lets see if patients who get immediate scheduling commit to their appointment:
We can finally say that, the average waiting time for patients to get their appointments is 10 days, 4 hours, and 24 minutes.
- 79.8% of patients showed up to their appointmens, and 20.2% didn't attend.
- Both female and male patients has almost the same commitment to attend their medical appointment.
- Most of the patients is of a young age
- Most of the patients do not hold a scholarship with percentage of 90.2%. While, 9.8% of the patients hold a scholarship. Both of these sections show almost similar commitment to their appointments.
- 7.18% of patients have diabetes: 81.9% of them showed up to their appointment, while, around 18% of them didn't attend.
- 19.7% of patients have hypertension: 82.6% of them showed up to their appointment, while, 17.3% of them didn't attend.
- 3% of patients are alcoholism: 79.8% of them showed up to their appointment; while, 0.2% of them didn't attend.
- Top 3 neighbourhoods frequently presented.
- Most frequent day for scheduling is Tuseday.
- Most frequent for appointments is Wednesday.
- Most patients patients might wait for a few seconds, or even get the appointment immediately.
- A patient might wait for 179 days to get their appointment! Fortunately, it present rarely in the data, which means it's not the usual situation in appointments duration. Despite that, 80% of them show up to their appointments.
- The average waiting time for patients to get their appointments is 10 days, 4 hours, and 24 minutes.