Threats to the Validity of Locally Developed Multiple-Choice Tests in Medical Education: Construct-Irrelevant Variance and Construct Underrepresentation

Downing, Steven M.

doi:10.1023/A:1021112514626

Threats to the Validity of Locally Developed Multiple-Choice Tests in Medical Education: Construct-Irrelevant Variance and Construct Underrepresentation

Published: November 2002

Volume 7, pages 235–241, (2002)
Cite this article

Advances in Health Sciences Education Aims and scope Submit manuscript

Steven M. Downing¹

1524 Accesses
69 Citations
Explore all metrics

Abstract

Construct-irrelevant variance (CIV) - the erroneous inflation or deflation of test scores due to certain types of uncontrolled or systematic measurement error - and construct underrepresentation (CUR) - the under-sampling of the achievement domain - are discussed as threats to the meaningful interpretation of scores from objective tests developed for local medical education use. Several sources of CIV and CUR are discussed and remedies are suggested. Test score inflation or deflation, due to the systematic measurement error introduced by CIV, may result from poorly crafted test questions, insecure test questions and other types of test irregularities, testwiseness, guessing, and test item bias. Using indefensible passing standards can interact with test scores to produce CIV. Sources of content under representation are associated with tests that are too short to support legitimate inferences to the domain and which are composed of trivial questions written at low-levels of the cognitive domain. ``Teaching to the test'' is another frequent contributor to CUR in examinations used in medical education. Most sources of CIV and CUR can be controlled or eliminated from the tests used at all levels of medical education, given proper training and support of the faculty who create these important examinations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

(Mis)Alignment of Medical Education Validation Research with Contemporary Validity Theory: The Mini-CEX as an Example

Bias Assessment and Prevention in Noncognitive Outcome Measures in Context Assessments

Characterizing the literature on validity and assessment in medical education: a bibliometric study

Article Open access 23 May 2018

References

American Educational Research Association, American Psychological Association, National Council on Measurement in Education (1999). Standards for Educational and Psychological Testing. Washington: American Educational Research Association.
Google Scholar
Anastasi, A. (1988). Psychological Testing. New York: Macmillan.
Google Scholar
Case, S.M. & Swanson, D.E. (1998). Constructing Written Test Questions for the Basic And Clinical Sciences. <http://www.nbme.org/nbme/itemwriting.htm>. Accessed 3/28/02 National Board of Medical Examiners, Philadelphia.
Google Scholar
Cole, N.S. & Moss, P.A. (1989). Bias in test use. In R.L. Linn (ed.), Educational Measurement (pp. 201-219). New York: American Council on Education and Macmillan.
Google Scholar
Cook, T.D. & Campbell, D.T. (1979). Quasi-experimentation: Design and Analysis Issues for Field Settings. Chicago: Rand McNally.
Google Scholar
Downing, S.M. (2002). Assessment of knowledge with written test forms. In G.R. Norman, C.P.M. Van der Vleuten & D.I. Newble (eds.), International Handbook for Research in Medical Education (pp. 647-672). Dordrecht, The Netherlands: Kluwer Academic Publications.
Google Scholar
Haladyna, T.M. (1999). Developing and Validating Multiple-choice Test Items. Hillsdale, NJ: Lawrence Erlbaum Associates.
Google Scholar
Haladyna, T.M., Downing, S.M. & Rodriguez, S.M. (2002). A review of multiple-choice item-writing guidelines. Applied Measurement in Education 15(3), 309-333.
Article Google Scholar
Jozefowicz, R.F., Koeppen, B.M. et al. (2002). The quality of in-house medical school examinations. Acad. Med. 77: 156-161.
Article Google Scholar
Messick, S. (1989). Validity. In R.L. Linn (ed.), Educational Measurement (pp. 13-104). New York: American Council on Education and Macmillan.
Google Scholar
Norcini, J.J. & Shea, J.A. (1997). The credibility and comparability of standards. Applied Measurement in Education 10(1): 39-59.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Medical Education (MC 591), University of Illinois at Chicago, College of Medicine, 808 South Wood Street, Chicago, IL, 60612-7309, USA
Steven M. Downing

Authors

Steven M. Downing
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Downing, S.M. Threats to the Validity of Locally Developed Multiple-Choice Tests in Medical Education: Construct-Irrelevant Variance and Construct Underrepresentation. Adv Health Sci Educ Theory Pract 7, 235–241 (2002). https://doi.org/10.1023/A:1021112514626

Download citation

Issue Date: November 2002
DOI: https://doi.org/10.1023/A:1021112514626

construct-irrelevant variance

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Threats to the Validity of Locally Developed Multiple-Choice Tests in Medical Education: Construct-Irrelevant Variance and Construct Underrepresentation

Abstract

Access this article

Similar content being viewed by others

(Mis)Alignment of Medical Education Validation Research with Contemporary Validity Theory: The Mini-CEX as an Example

Bias Assessment and Prevention in Noncognitive Outcome Measures in Context Assessments

Characterizing the literature on validity and assessment in medical education: a bibliometric study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Threats to the Validity of Locally Developed Multiple-Choice Tests in Medical Education: Construct-Irrelevant Variance and Construct Underrepresentation

Abstract

Access this article

Similar content being viewed by others

(Mis)Alignment of Medical Education Validation Research with Contemporary Validity Theory: The Mini-CEX as an Example

Bias Assessment and Prevention in Noncognitive Outcome Measures in Context Assessments

Characterizing the literature on validity and assessment in medical education: a bibliometric study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation