Graduate Student Satisfaction Exit Surveys

Photo by Nik on Unsplash

This post was originally designed because I was interested in working on student experience exit survey data from my department to see if there was a change from 2012 to 2015. These questions are given to every student that graduates from a graduate program at the University of Oregon (UO). This includes responses for terminal masters degrees, students that get masters degrees and move on to a doctoral degree, and the same potential students that respond again when they get their doctorate. This data is open to any UO student, staff, or faculty member that has login information for this data.

This data ended up becoming a time commitment as there was no efficient way to collect data from the pdf files for each college at the UO. An example can be seen here. One great resource for collecting data from pdfs was to use the pdftools package, but if you look at the example link provided above, the UO Graduate School decided to color code cells in the table, which threw off any function to extract all the values in an efficient manner. Anyway…

The data and other existing data files can be found here. When I have some more free time, I may decide to join the other datasets to the student experience data to examine some more interesting questions regarding this data. But for now, lets look at the student experience data.

Code

library(tidyverse)

theme_set(theme_minimal())

exit <- read_csv(here::here("posts", "2021-04-30-grad-student-exit-surveys/exit_data.csv")) |> 
  janitor::clean_names() |>
  mutate(
    program = str_replace_all(program, "_", " "),
    program = str_to_title(program)
  )

These exit surveys have several questions that are broken down into percentages about how many of the students agreed or disagreed with the statement. For instance, from the pdf, the first statement is Quality of the faculty in a student’s department. So we can look at that with this first plot. At the same time, we can also look at the difference between the two years of data. In order to look at all the variables at the same time that have the starting string of fac_qual, I’ll use pivot_longer to collect any variable that has that variable string about faculty quality. Since the first and second table on the pdf refer to excellent or good or excellent levels of student satisfaction about faculty quality, I decided to filter out the excellent student satisfaction and move on with only student satisfaction that is either good or excellent.

Code

exit |> 
  pivot_longer(
    matches(
      "^fac_qual"
    ),
    names_to = "fac_qual",
    values_to = "fac_values"
  ) |> 
  filter(fac_qual == "fac_qual_ex_good") |>
  ggplot(aes(fct_reorder(program, fac_values), fac_values)) +
  geom_col(aes(fill = as.factor(year)), position = "dodge2") +
  labs(title = "Student Experiences by Academic Program",
       x = "",
       y = "Specific Student Experience",
       caption = "Ex = Excellent") +
  coord_flip() +
  facet_wrap(~fac_qual, scales = "free") +
  theme(legend.position = "bottom",
        legend.title = element_blank())

Code

exit |> 
  pivot_longer(
    matches(
      "^fac_qual"
    ),
    names_to = "fac_qual",
    values_to = "fac_values"
  ) |> 
  filter(fac_qual == "fac_qual_fair_poor") |>
  ggplot(aes(fct_reorder(program, fac_values), fac_values)) +
  geom_col(aes(fill = as.factor(year)), position = "dodge2") +
  labs(title = "Student Experiences by Academic Program",
       x = "",
       y = "Specific Student Experience",
       caption = "Ex = Excellent") +
  coord_flip() +
  facet_wrap(~fac_qual, scales = "free") +
  theme(legend.position = "bottom",
        legend.title = element_blank())

So the first shot at making a visual for the two years looks a little cluttered because of using geom_col(). My first decision was to remove the columns and change those to points to make it a little less cluttered and clearer. I already enjoyed the way this looked better. I also decided to clean some things up by changing the names of the variables to better describe what the variables were assessing. I also decided to go back and change the programs to be title case and with spaces rather than underscores.

Code

exit |> 
  pivot_longer(
    matches(
      "^fac_qual"
    ),
    names_to = "fac_qual",
    values_to = "fac_values"
  ) |> 
  filter(fac_qual == "fac_qual_ex_good") |>
  mutate(fac_qual = recode(fac_qual, "fac_qual_ex_good" = "Excellent/Good Faculty Quality",
                           "fac_qual_fair_poor" = "Fair/Poor Faculty Quality")) |> 
  ggplot(aes(fct_reorder(program, fac_values), fac_values)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(year)), size = 2) +
  labs(title = "Faculty Quality by Academic Program",
       x = "",
       y = "Faculty Quality",
       caption = "Data from University of Oregon's (UO)\nstudent satisfaction surveys after graduation") +
  coord_flip() +
  facet_wrap(~fac_qual, scales = "free") +
  scale_color_manual(values = c("#d74122","#669b3e")) +
  theme(legend.position = "bottom",
        legend.title = element_blank())

Code

exit |> 
  pivot_longer(
    matches(
      "^fac_qual"
    ),
    names_to = "fac_qual",
    values_to = "fac_values"
  ) |> 
  filter(fac_qual == "fac_qual_fair_poor") |>
  mutate(fac_qual = recode(fac_qual, "fac_qual_ex_good" = "Excellent/Good Faculty Quality",
                           "fac_qual_fair_poor" = "Fair/Poor Faculty Quality")) |> 
  ggplot(aes(fct_reorder(program, fac_values), fac_values)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(year)), size = 2) +
  labs(title = "Faculty Quality by Academic Program",
       x = "",
       y = "Faculty Quality",
       caption = "Data from University of Oregon's (UO)\nstudent satisfaction surveys after graduation") +
  coord_flip() +
  facet_wrap(~fac_qual, scales = "free") +
  scale_color_manual(values = c("#d74122","#669b3e")) +
  theme(legend.position = "bottom",
        legend.title = element_blank())

Just in case anyone else is interested in this data, I also created a quick function to see how this visual looked like for other variables in the dataset. For instance, I’ll look at a couple of different variables.

Code

program_experience_agree <- function(name){
  exit |> 
    pivot_longer(
      matches(
          {{name}}
      )
    ) |>
    # filter(name != paste0({{name}}, "_ex") &
    #          name != paste0({{name}}, "_strong")) |> 
  filter(str_detect(name, "_agree")) |>
  ggplot(aes(fct_reorder(program, value), value)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(year)), size = 2) +
  labs(title = "Student Experiences by Academic Program",
       x = "",
       y = "") +
  coord_flip() +
  # facet_wrap(~name, scales = "free") +
  scale_color_manual(values = c("#d74122","#669b3e")) +
  theme(legend.position = "bottom",
        legend.title = element_blank())
}

program_experience_disagree <- function(name){
  exit |> 
    pivot_longer(
      matches(
          {{name}}
      )
    ) |>
    # filter(name != paste0({{name}}, "_ex") &
    #          name != paste0({{name}}, "_strong")) |> 
  filter(str_detect(name, "_disagree")) |>
  ggplot(aes(fct_reorder(program, value), value)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(year)), size = 2) +
  labs(title = "Student Experiences by Academic Program",
       x = "",
       y = "") +
  coord_flip() +
  # facet_wrap(~name, scales = "free") +
  scale_color_manual(values = c("#d74122","#669b3e")) +
  theme(legend.position = "bottom",
        legend.title = element_blank())
}

Below are all the variables from the dataset.

  [1] "year"                          "program"                      
  [3] "number_respondents"            "fac_qual_ex"                  
  [5] "pro_qual_ex"                   "money_sup_ex"                 
  [7] "field_dev_pace_ex"             "advising_qual_ex"             
  [9] "smart_community_ex"            "prof_dev_ex"                  
 [11] "equipment_ex"                  "grad_involve_ex"              
 [13] "research_opp_ex"               "grad_fair_assess_ex"          
 [15] "promote_inclu_ex"              "grant_train_ex"               
 [17] "teach_prep_ex"                 "grad_clear_assess_ex"         
 [19] "inter_sup_ex"                  "prof_ethic_train_ex"          
 [21] "fac_qual_ex_good"              "pro_qual_ex_good"             
 [23] "money_sup_ex_good"             "field_dev_pace_ex_good"       
 [25] "advising_qual_ex_good"         "smart_community_ex_good"      
 [27] "prof_dev_ex_good"              "equipment_ex_good"            
 [29] "grad_involve_ex_good"          "research_opp_ex_good"         
 [31] "grad_fair_assess_ex_good"      "promote_inclu_ex_good"        
 [33] "grant_train_ex_good"           "teach_prep_ex_good"           
 [35] "grad_clear_assess_ex_good"     "inter_sup_ex_good"            
 [37] "prof_ethic_train_ex_good"      "fac_qual_fair_poor"           
 [39] "pro_qual_fair_poor"            "money_sup_fair_poor"          
 [41] "field_dev_pace_fair_poor"      "advising_qual_fair_poor"      
 [43] "smart_community_fair_poor"     "prof_dev_fair_poor"           
 [45] "equipment_fair_poor"           "grad_involve_fair_poor"       
 [47] "research_opp_fair_poor"        "grad_fair_assess_fair_poor"   
 [49] "promote_inclu_fair_poor"       "grant_train_fair_poor"        
 [51] "teach_prep_fair_poor"          "grad_clear_assess_fair_poor"  
 [53] "inter_sup_fair_poor"           "prof_ethic_train_fair_poor"   
 [55] "encourage_agree"               "idea_resp_agree"              
 [57] "construct_feed_agree"          "time_feed_agree"              
 [59] "avail_agree"                   "career_sup_agree"             
 [61] "stu_equit_agree"               "ethic_emp_agree"              
 [63] "help_secure_fund_agree"        "help_prof_dev_agree"          
 [65] "publish_help_agree"            "encourage_intel_diff_agree"   
 [67] "comfort_talk_issue_agree"      "encourage_disagree"           
 [69] "idea_resp_disagree"            "construct_feed_disagree"      
 [71] "time_feed_disagree"            "avail_disagree"               
 [73] "career_sup_disagree"           "stu_equit_disagree"           
 [75] "ethic_emp_disagree"            "help_secure_fund_disagree"    
 [77] "help_prof_dev_disagree"        "publish_help_disagree"        
 [79] "encourage_intel_diff_disagree" "comfort_talk_issue_disagree"  
 [81] "collegial_strong"              "encouraging_strong"           
 [83] "supportive_strong"             "intel_open_strong"            
 [85] "inter_open_strong"             "inclu_stu_color_strong"       
 [87] "inclu_gender_strong"           "inclu_intern_stu_strong"      
 [89] "inclu_stu_disab_strong"        "inclu_first_gen_strong"       
 [91] "inclu_stu_sex_orient_strong"   "collegial_agree"              
 [93] "encouraging_agree"             "supportive_agree"             
 [95] "intel_open_agree"              "inter_open_agree"             
 [97] "inclu_stu_color_agree"         "inclu_gender_agree"           
 [99] "inclu_intern_stu_agree"        "inclu_stu_disab_agree"        
[101] "inclu_first_gen_agree"         "inclu_stu_sex_orient_agree"   
[103] "collegial_disagree"            "encouraging_disagree"         
[105] "supportive_disagree"           "intel_open_disagree"          
[107] "inter_open_disagree"           "inclu_stu_color_disagree"     
[109] "inclu_gender_disagree"         "inclu_intern_stu_disagree"    
[111] "inclu_stu_disab_disagree"      "inclu_first_gen_disagree"     
[113] "inclu_stu_sex_orient_disagree"

Code

# student equitable treatment
program_experience_agree(name = "stu_equit")

Code

program_experience_disagree(name = "stu_equit")

Code

# inclusive of students of color
program_experience_agree(name = "inclu_stu_color")

Code

program_experience_disagree(name = "inclu_stu_color")

Code

# inclusive of gender
program_experience_agree(name = "inclu_gender")

Code

program_experience_disagree(name = "inclu_gender")

Code

# inclusive of international students
program_experience_agree(name = "inclu_intern_stu")

Code

program_experience_disagree(name = "inclu_intern_stu")

Code

# inclusive of students with disabilities
program_experience_agree(name = "inclu_stu_disab")

Code

program_experience_disagree(name = "inclu_stu_disab")

Code

# inclusive of first generation students
program_experience_agree(name = "inclu_first_gen")

Code

program_experience_disagree(name = "inclu_first_gen")

Code

# inclusive of students of all sexual orientations
program_experience_agree(name = "inclu_stu_sex_orient")

Code

program_experience_disagree(name = "inclu_stu_sex_orient")

Lastly, I decided to look into the difference between the variables I’m most interested in. First, I wanted to look at how graduate students perceive inclusiveness of students of color within their departments. Another variable I was interested in was inclusiveness of first-generation graduate students. Thanks to the plotly package I was able to include some interactive components to the visuals. Specifically zooming in to specific departments give a better idea of the difference between agreeing and disagreeing on these topics. With plotly, you can also click on an option in the legend to only see those values. I also removed the strongly agree option since the agree applied to students that strongly agreed or agreed with the statement.

Code

library(plotly)

stu_color <- exit |> 
  pivot_longer(
    matches(
      "^inclu_stu_color"
    ),
    names_to = "stu_color",
    values_to = "stu_color_values"
  ) |>
  filter(stu_color != "inclu_stu_color_strong") |> 
  mutate(stu_color = recode(stu_color, "inclu_stu_color_agree" = "Agree with Inclusive Environment for Students of Color",
                           "inclu_stu_color_disagree" = "Disagree with Inclusive Environment for Students of Color")) |> 
  ggplot(aes(fct_reorder(program, stu_color_values), stu_color_values)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(stu_color)), size = 2) +
  labs(title = "Faculty Quality by Academic Program",
       x = "",
       y = "Faculty Quality",
       caption = "Data from University of Oregon's (UO)\nstudent satisfaction surveys after graduation") +
  coord_flip() +
  scale_color_manual(values = c("#d74122","#669b3e"))

stu_plot <- ggplotly(stu_color)
  # layout(legend = list(orientation = "h",
                       # xanchor = "center",
                       # x = 0,
                       # y = -60)) 
stu_plot

Code

firstgen <- exit |> 
  pivot_longer(
    matches(
      "^inclu_first_gen"
    ),
    names_to = "first_gen",
    values_to = "first_gen_values"
  ) |>
  filter(first_gen != "inclu_first_gen_strong") |> 
  mutate(first_gen = recode(first_gen, "inclu_first_gen_agree" = "Agree with Inclusive Environment for First Gen",
                           "inclu_first_gen_disagree" = "Disagree with Inclusive Environment for First Gen")) |> 
  ggplot(aes(fct_reorder(program, first_gen_values), first_gen_values)) +
  geom_point(aes(color = as.factor(year), shape = as.factor(first_gen)), size = 2) +
  labs(title = "Faculty Quality by Academic Program",
       x = "",
       y = "Faculty Quality",
       caption = "Data from University of Oregon's (UO)\nstudent satisfaction surveys after graduation") +
  coord_flip() +
  scale_color_manual(values = c("#d74122","#669b3e"))

first_plot <- ggplotly(firstgen) 
  # layout(legend = list(orientation = "h",
  #                      xanchor = "center",
  #                      x = 0,
  #                      y = -60)) 
first_plot