AAS Statement on Limiting the Use of GRE Scores in Graduate Admissions in the Astronomical Sciences

Adopted 4 January 2016

Each year, roughly 55,000 physical science majors take the Graduate Record Exam (GRE)[1] and 5,000[2] take the Physics Subject Exam (PGRE). Both the GRE and PGRE are widely used in the astronomical community as a metric to rank graduate talent. Most US graduate programs in the astronomical sciences require the GRE and PGRE to evaluate applicants. In addition, GRE scores are required by several major fellowships and are used to rank graduate programs by organizations such as US News and World Report[3] and the National Research Council[4].

The evidence, however, suggests that GRE and PGRE scores are poor predictors of success in graduate study in the astronomical sciences. Glanz (1996)[5] demonstrated that GRE scores are weakly correlated with average grades in graduate physics courses at Harvard University. Sternberg & Williams (1997)[6] demonstrated that GRE scores fail to correlate with several key skills for graduate study, including analytical thinking, creativity, research acumen and teaching, and correlate only modestly with first-year grade point average. Preliminary research indicates similarly weak predictive power for the PGRE[7]. To be clear, the predictive power of these exams is not zero; longitudinal meta-analytic studies do find statistically significant linear correlation coefficients at the 0.1-0.2 level between test scores and long-term outcomes such as citations and scholarly output decades later. However, these correlations emerge only through multivariate analyses that control for the more dominant correlations of test scores with demographic variables — systematics for which graduate admissions committees rarely correct quantitatively.

Indeed, because the tests have such strong systematics, the use of GRE and PGRE scores as a measure of potential success has well-documented and powerful effects on the demographics of the resulting graduate cohorts. Halley et al. (1991)[8] showed that GRE performance correlates with whether the undergraduate institution has a graduate program, implicitly penalizing students from many liberal arts colleges. Research by the Education Testing Service (ETS), and more recently by Miller & Stassun (2014)[9], demonstrate that GRE scores correlate with demographic characteristics unrelated to potential for graduate study, such as gender, race and socioeconomic status. These correlations persist even in the GRE's recently revised general test[10]. These demographic correlations are a feature of standardized exams more generally (e.g., Helms 2009)[11] and may well be the result of stereotype threat, the fear of confirming negative stereotypes about one's own group (Steele & Aronson 1995)[12],[13]. Miller & Stassun show that misusing GRE scores, particularly by establishing score thresholds, fuels the underrepresentation of white women and minorities in graduate programs. ETS itself states, “A cutoff score [on the GRE] should never be used as the only criterion for denial of admission or awarding of a fellowship.”

A third issue with the GRE exam is its financial burden on test takers. Students currently pay $195 to take the GRE[14] and $150 to take the PGRE[15], as well as $27 for each institution/fellowship they designate to receive an official score beyond an initial four. Considering that students often take these exams multiple times (particularly the PGRE) and apply to 5-10 graduate programs, these tests require a significant investment. While ETS has a Fee Reduction Program[16] that covers 50% of exam costs, it applies to a single test and has stringent eligibility requirements. Fulfilling the GRE requirement is thus beyond the means of many students.

Based on this research, several physics and astronomy graduate programs and fellowships, notably the NSF Graduate Research Fellowship Program (GRFP) and the Ford Foundation Fellowship, have dropped the GRE and/or PGRE from their admissions or application requirements[17]. The National Society of Hispanic Students (NSHP) recently called for a critical reevaluation of the use of the GRE as an admissions metric[18]. Nevertheless, Miller (2013)[19] found that 96% of physics programs retain them, and over half specify cutoffs. As an alternative, some programs have begun to incorporate measures of non-cognitive skills (e.g., structured interviews that specifically assess these skills[20]) as less biased and much stronger predictors of potential for long-term success.

Recommendation: Given the research indicating that the GRE and PGRE are poor predictors of graduate student success, that their use in graduate admissions has a particularly negative impact on underrepresented groups, and that they represent a financial burden for many students in pursuing advanced degrees in the astronomical sciences, the AAS recommends that graduate programs eliminate or make optional the GRE and PGRE as metrics of evaluation for graduate applicants. If GRE or PGRE scores are used, the AAS recommends that admissions criteria account explicitly for the known systematics in scores as a function of gender, race, and socioeconomic status, and that cutoff scores not be used to eliminate candidates from admission, scholarships/fellowships, or financial support, in accordance with ETS recommendations.

[1] http://www.ets.org/Media/Tests/GRE/pdf/gre_0809_factors_200607.pdf 
[2] http://www.ets.org/s/gre/pdf/practice_book_physics.pdf
[3] http://www.usnews.com/education/best-graduate-schools/articles/2014/03/10/how-us-news-calculated-the-2015-best-graduate-schools-rankings 
[4] http://www.nap.edu/rdp/ 
[5] Glanz, J. (1996). How Not to Pick a Physicist? Science 274, 710
[6] Sternberg, R. & Williams, W. (1997). Does the Graduate Record Examination Predict Meaningful Success in the Graduate Training of Psychologists? American Psychologist 52, 630-641 
[7] Miller, C. (2015), preliminary analysis presented at Inclusive Astronomy 2015, https://www.youtube.com/watch?v=96vJQCov8Do
[8] Halley, J. W. et al. (1991). The Graduate Record Examination as an indicator of learning of the curriculum taught to physics majors in US institutions. American Journal of Physics 59, 403 
[9] Miller, C. & Stassun, K.G. (2014). A test that fails: A standard test for admission to graduate school misses potential winners, Nature Careers 510, 303 
[10 ]https://www.ets.org/s/gre/pdf/snapshot_test_taker_data_2014.pdf
[11] Helms, J. E. (2009). Defense of tests prevents objective considerations of validity and fairness. American Psychologist 64, 283-284
[12] Steele, C.M., & Aronson, J. Stereotype Threat and the Intellectual Test Performance of African Americans. Journal of Personality and Social Psychology 69, 797 
[13] A great resource on stereotype threat is http://www.reducingstereotypethreat.org/ 
[14] http://www.ets.org/gre/revised_general/about/fees
[15] http://www.ets.org/gre/subject/about/fees/ 
[16] http://www.ets.org/gre/subject/about/fees/reduction/
[17] See http://ainsleydiduca.com/gradschoolsdontrequiregre/#Sciences for a subset of these institutions
[18] http://www.hispanicphysicists.org/news/GREandDiversity.html 
[19] http://www.aps.org/publications/apsnews/201302/backpage.cfm
[20] Stassun et al. (2011). “The Fisk-Vanderbilt Master’s-to-Ph.D. Bridge Program: Recognizing, enlisting, and cultivating unrealized or unrecognized potential in underrepresented minority students”, American Journal of Physics79, 374. See also http://fisk-vanderbilt-bridge.org/tool-kit/