Improving Data Quality in the Survey of Graduate Students and - - PowerPoint PPT Presentation

improving data quality in the survey of graduate students
SMART_READER_LITE
LIVE PREVIEW

Improving Data Quality in the Survey of Graduate Students and - - PowerPoint PPT Presentation

Improving Data Quality in the Survey of Graduate Students and Postdoctorates in Science and Engineering (GSS) Peter Einaudi, RTI International Jonathan Gordon, RTI International Stephanie Eckman, RTI International Herschel Sanders, RTI


slide-1
SLIDE 1

www.rti.org

RTI International is a registered trademark and a trade name of Research Triangle Institute.

Improving Data Quality in the Survey of Graduate Students and Postdoctorates in Science and Engineering (GSS)

Peter Einaudi, RTI International Jonathan Gordon, RTI International Stephanie Eckman, RTI International Herschel Sanders, RTI International Mike Yamaner, National Science Foundation

slide-2
SLIDE 2

What is the GSS?

  • Survey of Graduate Students

and Postdoctorates in Science and Engineering (GSS)

– Sponsored by the National Center

for Science and Engineering Statistics (NCSES) within the National Science Foundation (NSF) and the National Institutes

  • f Health (NIH)
  • Establishment survey of all

U.S. academic institutions that grant graduate degrees in science, engineering, and health (SEH) fields

  • Institutions report enrollment

and financial support data for graduate students, postdoctoral researchers, and doctorate- holding nonfaculty researchers (NFRs)

slide-3
SLIDE 3

Sample data grid

  • Voluntary survey
  • High response rates

– 98.6% RR3 in 2016

  • Burdensome

– 56.4 hours average

  • Electronic Data

Interchange (EDI) limited prior to 2016

slide-4
SLIDE 4

Key Challenges

  • Improving the value of the data

– Users want to analyze master’s and

doctoral data separately

  • Aligning taxonomy of disciplines

across federal surveys

  • Ensuring all eligible units are reported

and classified appropriately

  • Mitigating response burden and

maintaining high response rates

slide-5
SLIDE 5

Goals of Redesign

Collect more data Reduce Burden Improve Quality

  • Separate master’s and doctoral

enrollment data

  • Increase use of EDI
  • Simplify coding of academic units
  • Eliminate data entry errors
  • Align request with administrative data
  • Improve coding of disciplines
slide-6
SLIDE 6

Proposed Solutions

  • Simplify and expand use of EDI

– Align data request with

respondents’ administrative data systems

– Reduce reliance on manual

data entry

– Facilitate separate master’s

and doctoral reporting without added burden

  • Replace GSS codes with

Classification of Instructional Programs (CIP)

– Already utilized and maintained

by institutions for mandatory reporting

– Improved comparability with

  • ther federal surveys

– More granular coding scheme – Shifts burden of taxonomy

changes from data reporters to data collectors

slide-7
SLIDE 7

Fall 2015– Summer 2016

Institutional site visits Recordkeeping and data reporting practices Coordinator survey Understand impact of changes

Fall 2016– Summer 2017

Pilot Feasibility Estimate reporting burden

Fall 2017– Summer 2018

Full implementation

slide-8
SLIDE 8

Fall 2015– Summer 2016

Institutional site visits Recordkeeping and data reporting practices Coordinator survey Understand impact of changes

Spring 2016– Summer 2017

Pilot Feasibility Estimate reporting burden

Fall 2017– Summer 2018

Full implementation

slide-9
SLIDE 9

What we learned

  • Coordinators can distinguish

between students enrolled in master’s and doctoral programs in their records

  • CIP codes available in

student information systems

– Codes less available to

report postdocs

  • Most coordinators willing to

use EDI

– Concerns about programming

effort required

– Privacy concerns about

transmitting individual-level data

slide-10
SLIDE 10

Integrating lessons learned into the Redesign

  • Flexibility in field coding method

– CIP codes strongly encouraged for

student data

– GSS or CIP codes for

postdoctoral data

  • Make EDI as simple as possible

– Excel templates – Eliminate additional programming

steps to format data

  • Allay concerns about privacy

– Avoid Personally Identifiable

Information

– Option to aggregate data

before transmission

slide-11
SLIDE 11

Fall 2015– Summer 2016

Institutional site visits Recordkeeping and data reporting practices Coordinator survey Understand impact of changes

Spring 2016– Summer 2017

Pilot Feasibility Estimate reporting burden

Fall 2017– Summer 2018

Full implementation

slide-12
SLIDE 12

Trial Run: Pilot Data Collection GSS 2016

  • Ran parallel to GSS 2016

survey cycle

– Stratified random sample of

coordinators

  • Master’s-only institutions

(n = 15)

  • Doctorate-granting institutions

with 15 or fewer units (n = 25)

  • Doctorate-granting institutions

with over 15 units (n = 40)

– Data request included:

  • Separate reporting of master’s

and doctoral students

  • Use of CIP codes to report

student data

  • Use of EDI to transmit data
slide-13
SLIDE 13

Pilot Results

  • 98.7% of coordinators

able to upload at least some data

  • All schools with both

master’s and doctoral students able to report these data separately

  • Nearly 90% of schools

reported student data with CIP codes

  • Most coordinators reported

similar or lower response burden compared to the previous year

slide-14
SLIDE 14

Average response burden for pilot coordinators

slide-15
SLIDE 15

Fall 2015– Summer 2016

Institutional site visits Recordkeeping and data reporting practices Coordinator survey Understand impact of changes

Spring 2016– Summer 2017

Pilot Feasibility Estimate reporting burden

Fall 2017– Summer 2018

Full implementation

slide-16
SLIDE 16

Preparation for full implementation

  • Targeted Communication

– Mail – E-mail – Conference presentations

  • Redesign of Survey Website

– Highlight changes – Taxonomy tool

  • Training

– Webinars – Training videos – Sandbox

slide-17
SLIDE 17

2017 Response method

slide-18
SLIDE 18

Average response burden

slide-19
SLIDE 19

Impact on response

  • Response rates declined…

but more data

– Increase in school-level

nonresponse

  • Item nonresponse declined

– Despite increase in items

  • Lower item nonresponse

with EDI

slide-20
SLIDE 20

Item nonresponse rates by section and response method

slide-21
SLIDE 21

Discussion

  • Understand your

respondents and their data systems

  • Design around

constraints: one size does not fit all

  • Pilot your changes
  • Communicate early

and often

slide-22
SLIDE 22

Thank You

Peter Einaudi Jonathan Gordon Mike Yamaner peinaudi@rti.org jgordon@rti.org myamaner@nsf.gov