{"id":72819,"date":"2025-03-18T09:45:33","date_gmt":"2025-03-18T01:45:33","guid":{"rendered":"https:\/\/medicine.nus.edu.sg\/taps\/?post_type=issues&#038;p=72819"},"modified":"2025-07-01T08:59:37","modified_gmt":"2025-07-01T00:59:37","slug":"enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students","status":"publish","type":"issues","link":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/","title":{"rendered":"Enhancing standard setting: A judge&#8217;s guide for the Angoff method in assessing borderline students"},"content":{"rendered":"<p>Submitted: 14 March 2024<br \/>\r\nAccepted: 13 November 2024<br \/>\r\nPublished online: 1 April, TAPS 2025, 10(2), 91-93<br \/>\r\n<a href=\"https:\/\/doi.org\/10.29060\/TAPS.2025-10-2\/II3264\">https:\/\/doi.org\/10.29060\/TAPS.2025-10-2\/II3264<\/a><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Han Ting Jillian Yeo &amp; <\/span><span lang=\"EN-US\">Dujeepa D. Samarasekera<\/span><\/p>\r\n<p style=\"text-align: justify;\"><i><span lang=\"EN-GB\">Centre for Medical Education (CenMED), Yong Loo Lin School of Medicine, National University of Singapore, Singapore<\/span><\/i><\/p>\r\n<p style=\"text-align: center;\"><span lang=\"EN-GB\"><\/span><strong><span lang=\"EN-GB\">I. INTRODUCTION<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Assessment is an important component of training in ensuring that graduating students are competent to provide safe and effective medical care to patients. Typically, the passing score is set as a fixed mark, but this approach does not account for the varying difficulty of exams. As a result, students who have achieved the required level of competence might fail if the exam items are particularly challenging (false negative), while students who have not attained the necessary competence might pass if the items are unusually easy (false positive). Hence, deciding on the right pass mark is important for each assessment. To mitigate this issue, criterion referenced standard setting was adopted in medical education (Norcini, 2003). It determines the minimum competence level expected of a candidate and whether a candidate would pass or fail the assessments (Norcini, 2003). The Angoff method is one of the more commonly used standard setting techniques. It is an examinee centred method and requires a panel of judges to estimate the probability that a borderline candidate would get the item correct.<\/span><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Literature have questioned the reliability of the Angoff method. Variations in pass mark have been reported when the different panels of judges were engaged (Tavakol &amp; Dennick, 2017; Taylor et al., 2017). Judges reportedly faced challenges in visualising and defining the knowledge and skills required of borderline students and hence have difficulty estimating the probability that a borderline student would answer an item correctly (Tavakol &amp; Dennick, 2017). A study by Yeates et al. (2019) also reported the complexity judges faced in the standard setting process due to interaction between the environment, individual judgments, and interaction between the judges. Such variations in pass marks might lead to unfairness to students who were meant to pass but did not due to a higher pass mark. It is of a greater concern to patient safety if students who were meant to fail passed the examination due to a lowered pass mark. \u00a0To assist the judges, a guide was developed to set standards for medical and health professions examinations using a probability estimate.<\/span><\/p>\r\n<p style=\"text-align: center;\"><span lang=\"EN-GB\"><\/span><strong><span lang=\"EN-GB\">II. DEVELOPING A GUIDE<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Judges were to rate each item based on three criteria: relevance, frequency, and difficulty. The guide focused on these areas to assist the judges in their evaluations. The relevance of an item was rated on a 5-point scale ranging from \u201c1 \u2013 not knowing will not harm a patient\u201d to \u201c5 \u2013 not knowing will cause possible death to the patient\u201d. A highly relevant item was one which assessed a foundational knowledge or a core skill. A less relevant item assessed on knowledge or skill which was good to know or acquire but not required for progression to the next level of education. The difficulty of an item was rated on a 5-point scale ranging from \u201c1 \u2013 very easy\u201d to \u201c5 \u2013 very difficult\u201d. The difficulty of the item was dependent on the ease of understanding the item construction or the difficulty of the disease condition assessed. For instance, the inclusion of multiple comorbidities in the item stem, as opposed to one comorbidity, required the student to synthesise information before responding. The difficulty of the item was also associated with the level of learning that was assessed. Hence, an item which was assessed on application would be more challenging to the student compared to an item assessing recall. The frequency of an item was rated on a 4-point scale from \u201c1 \u2013 very rarely seen in practice of a basic doctor\u201d to \u201c4 \u2013 seen very often in practice of a basic doctor\u201d. For example, in the local context, influenza is a clinical condition commonly seen in clinical practice while tetanus is a rarer clinical condition. <\/span><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Judge\u2019s ratings of each criterion were converted into a probability estimate that a borderline candidate would get the item correct ranging from 0 to 100 percent for each item. An item with a low relevance and frequency but a high difficulty would be assigned a probability estimate between 0 to 30 percent suggesting that a borderline candidate was less likely to get the item correct. An item with a high relevance and frequency but a low difficulty would be assigned a probability estimate between 70 to 100 percent suggesting that there was a high probability a borderline candidate would get this item correct. Judges were given the freedom to assign an estimate from the range provided in the guide or to assign a probability estimate based on their own judgement or expertise.<\/span><\/p>\r\n<p style=\"text-align: center;\"><span lang=\"EN-GB\"><\/span><strong><span lang=\"EN-GB\">III. IMPLEMENTATION<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">To date, the guide was shared with judges during the Angoff standard setting sessions for the medical undergraduate assessments. The guide was given at the start of the session when calibrating judges to a similar mental model on what a borderline candidate was. Judges were free to use the guide in the decision-making process when providing a probability estimate for each item. During the calibration phase and discussion phase of the Angoff standard setting session, we observed that judges provided justifications for their probability estimates by referring to the three criteria. This was more prevalent among judges who were new to the Angoff method.\u00a0 We believed that the well-defined and objective criteria provided in the guide served as a useful framework for judges to develop a mental model on what a borderline candidate was.<\/span><\/p>\r\n<p style=\"text-align: center;\"><span lang=\"EN-GB\"><\/span><strong><span lang=\"EN-GB\">IV. LIMITATIONS AND FUTURE DIRECTIONS<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Several limitations have been identified. While we have attempted to implement the guide, judge\u2019s ratings remained influenced by their own criteria set by their personal experiences and beliefs which were often deeprooted and independent of the three identified criteria. This is especially so for judges who had prior experience in standard setting with Angoff method and had formed their own set of criteria. We see greater value in the use of the guide for training judges who were participating in Angoff standard setting for the first time.<\/span><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">The guide was developed within a specific medical school in Southeast Asia with its own unique curriculum and learning objectives. Its applicability and effectiveness may be limited in different educational contexts with varying curricula and assessment methods. These limitations highlighted the need for ongoing evaluation and adaptation of the guide and standard-setting methods to ensure they meet the needs of diverse educational settings and provide reliable assessment outcomes. The team is working on validating the use of the guide in our own local context. This would be conducted by quantifying the level of agreement between judges\u2019 ratings, correlating with other standard setting methods and soliciting feedback from judges on the utility of the guide.<\/span><\/p>\r\n<p style=\"text-align: center;\"><span lang=\"EN-GB\"><\/span><strong><span lang=\"EN-GB\">V. CONCLUSION<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">As more medical schools begin to adopt criterion referenced standard setting methods to set a defensible pass mark for assessments and given the complex process judges face when rating items, there is value in the provision of a guide to judges with defined criteria to facilitate the process of rating items. <\/span><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">By focusing on criteria such as relevance, frequency, and difficulty, the guide aimed to provide a structured framework for judges to make more consistent and objective probability estimates of a borderline candidate\u2019s performance. Preliminary observations suggested that the guide has been useful in standardising judges\u2019 evaluations and aligning them with the intended competence levels of a borderline candidate. However, variability in judges\u2019 personal criteria and context-specific development posed potential issues. Pilot testing, inter-rater reliability studies, and expert reviews were essential in evaluating the guide\u2019s impact on the pass marks. Ultimately, a well-validated guide has the potential to improve the fairness and reliability of assessments in medical and health professions education, ensuring that graduating students are competently prepared to provide safe and effective patient care.<\/span><\/p>\r\n<p align=\"center\"><strong><span lang=\"EN-GB\">Notes on Contributors<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Han Ting Jillian Yeo contributed to writing and editing the manuscript. <\/span><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">Dujeepa Samarasekera contributed to the concept and development of the manuscript.<\/span><\/p>\r\n<p align=\"center\"><strong><span lang=\"EN-GB\">Ethical Approval<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">No ethical approval was required for this study as no data were collected.<\/span><\/p>\r\n<p align=\"center\"><strong><span lang=\"EN-GB\">Funding<\/span><\/strong><\/p>\r\n<p style=\"text-align: justify;\"><span lang=\"EN-GB\">No funding sources are associated with this paper.<\/span><\/p>\r\n<p align=\"center\"><strong><span lang=\"EN-GB\">Declaration of Interest<\/span><\/strong><\/p>\r\n<p><span lang=\"EN-GB\">There are no conflicts of interests related to the content presented in the paper.<\/span><\/p>\r\n<p align=\"center\" style=\"text-align: center;\"><strong><span lang=\"EN-GB\">References<\/span><\/strong><\/p>\r\n<p style=\"text-align: left;\">Norcini J. J. (2003). Setting standards on educational tests. <i>Medical Education, 37<\/i>(5), 464\u2013469. <a href=\"https:\/\/doi.org\/10.1046\/j.1365-2923.%202003.01495.x\">https:\/\/doi.org\/10.1046\/j.1365-2923. 2003.01495.x<\/a><span><\/span><\/p>\r\n<p style=\"text-align: left;\">Tavakol, M., &amp; Dennick, R. (2017). The foundations of measurement and assessment in medical education. <i>Medical Teacher<\/i>, <i>39<\/i>(10), 1010\u20131015. <a href=\"https:\/\/doi.org\/10.1080\/0142159X.%202017.1359521\">https:\/\/doi.org\/10.1080\/0142159X. 2017.1359521<\/a><span><\/span><\/p>\r\n<p style=\"text-align: left;\">Taylor, C. A., Gurnell, M., Melville, C. R., Kluth, D. C., Johnson, N., &amp; Wass, V. (2017). Variation in passing standards for graduation-level knowledge items at UK medical schools.\u00a0<i>Medical Education<\/i>,\u00a0<i>51<\/i>(6), 612\u2013620. <a href=\"https:\/\/doi.org\/10.1111\/medu.13240\">https:\/\/doi.org\/10.1111\/medu.13240<\/a><span><\/span><\/p>\r\n<p style=\"text-align: left;\">Yeates, P., Cope, N., Luksaite, E., Hassell, A., &amp; Dikomitis, L. (2019). Exploring differences in individual and group judgements in standard setting. <i>Medical Education, 53<\/i>(9), 941\u2013952. <a href=\"https:\/\/doi.org\/10.1111\/medu.13915\">https:\/\/doi.org\/10.1111\/medu.13915<\/a><\/p>\r\n<p>*<span lang=\"EN-GB\">Han Ting Jillian Yeo<\/span><br \/>\r\n<span lang=\"EN-GB\">10 Medical Drive<\/span><br \/>\r\n<span lang=\"EN-GB\">Singapore 117597<\/span><br \/>\r\n<span lang=\"EN-GB\">Email: jillyeo@nus.edu.sg<\/span><\/p>","protected":false},"featured_media":0,"parent":0,"menu_order":12,"template":"","issues_category":[61890],"archive_category":[],"issue_type":[25],"volume_category":[61894],"class_list":["post-72819","issues","type-issues","status-publish","hentry","issues_category-innovations-and-insights","issue_type-past-issue","volume_category-volume-10-number-2-april-2025"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Enhancing standard setting: A judge&#039;s guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Enhancing standard setting: A judge&#039;s guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar\" \/>\n<meta property=\"og:description\" content=\"Submitted: 14 March 2024 Accepted: 13 November 2024 Published online: 1 April, TAPS 2025, 10(2), 91-93 https:\/\/doi.org\/10.29060\/TAPS.2025-10-2\/II3264 Han Ting Jillian Yeo &amp; Dujeepa D. Samarasekera Centre for Medical Education (CenMED), Yong Loo Lin School of Medicine, National University of Singapore, Singapore I. INTRODUCTION Assessment is an important component of training in ensuring that graduating students [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/\" \/>\n<meta property=\"og:site_name\" content=\"The Asia Pacific Scholar\" \/>\n<meta property=\"article:modified_time\" content=\"2025-07-01T00:59:37+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/\",\"url\":\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/\",\"name\":\"Enhancing standard setting: A judge's guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar\",\"isPartOf\":{\"@id\":\"https:\/\/medicine.nus.edu.sg\/taps\/#website\"},\"datePublished\":\"2025-03-18T01:45:33+00:00\",\"dateModified\":\"2025-07-01T00:59:37+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/medicine.nus.edu.sg\/taps\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Enhancing standard setting: A judge&#8217;s guide for the Angoff method in assessing borderline students\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/medicine.nus.edu.sg\/taps\/#website\",\"url\":\"https:\/\/medicine.nus.edu.sg\/taps\/\",\"name\":\"The Asia Pacific Scholar\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/medicine.nus.edu.sg\/taps\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Enhancing standard setting: A judge's guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/","og_locale":"en_US","og_type":"article","og_title":"Enhancing standard setting: A judge's guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar","og_description":"Submitted: 14 March 2024 Accepted: 13 November 2024 Published online: 1 April, TAPS 2025, 10(2), 91-93 https:\/\/doi.org\/10.29060\/TAPS.2025-10-2\/II3264 Han Ting Jillian Yeo &amp; Dujeepa D. Samarasekera Centre for Medical Education (CenMED), Yong Loo Lin School of Medicine, National University of Singapore, Singapore I. INTRODUCTION Assessment is an important component of training in ensuring that graduating students [&hellip;]","og_url":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/","og_site_name":"The Asia Pacific Scholar","article_modified_time":"2025-07-01T00:59:37+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/","url":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/","name":"Enhancing standard setting: A judge's guide for the Angoff method in assessing borderline students - The Asia Pacific Scholar","isPartOf":{"@id":"https:\/\/medicine.nus.edu.sg\/taps\/#website"},"datePublished":"2025-03-18T01:45:33+00:00","dateModified":"2025-07-01T00:59:37+00:00","breadcrumb":{"@id":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/medicine.nus.edu.sg\/taps\/issues\/enhancing-standard-setting-a-judges-guide-for-the-angoff-method-in-assessing-borderline-students\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/medicine.nus.edu.sg\/taps\/"},{"@type":"ListItem","position":2,"name":"Enhancing standard setting: A judge&#8217;s guide for the Angoff method in assessing borderline students"}]},{"@type":"WebSite","@id":"https:\/\/medicine.nus.edu.sg\/taps\/#website","url":"https:\/\/medicine.nus.edu.sg\/taps\/","name":"The Asia Pacific Scholar","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/medicine.nus.edu.sg\/taps\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/issues\/72819","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/issues"}],"about":[{"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/types\/issues"}],"wp:attachment":[{"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/media?parent=72819"}],"wp:term":[{"taxonomy":"issues_category","embeddable":true,"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/issues_category?post=72819"},{"taxonomy":"archive_category","embeddable":true,"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/archive_category?post=72819"},{"taxonomy":"issue_type","embeddable":true,"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/issue_type?post=72819"},{"taxonomy":"volume_category","embeddable":true,"href":"https:\/\/medicine.nus.edu.sg\/taps\/wp-json\/wp\/v2\/volume_category?post=72819"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}