2026 ASEE Annual Conference & Exposition

Toward Scalable Assessment of Undergraduate Reflective Practice: Comparing Multiple Reflection Quality Codebook Validation Approaches

Presented at Reflection

This empirical research, research brief presents preliminary findings on the development and validation of a theory-driven and automation-ready qualitative codebook for assessing reflection quality contextualized to STEM education learning environments. Reflective writing is increasingly common in STEM higher education, supporting students’ metacognition, conceptual understanding, professional identity development, and life-long learning. However, researchers and instructors both face a challenge: how can we assess reflection quality at scale to support systematic analyses in specific courses? Current approaches to qualitative analysis of reflection quality are time- and labor-intensive, vary across raters, and do not typically produce actionable, personalized feedback, especially in large classrooms. In response, this research seeks to answer: How can varying codebook validation methods support the reliability of qualitative codebook applications in STEM undergraduate reflective writing assignments?

To answer this question, we first present our two-dimensional codebook with example excerpts illustrating levels of reflection quality focused on qualities of both abstraction and situatedness. Abstraction refers to the exhibited level of reflective thinking grounded in Bain et al.’s (2024) 5R framework for reflection quality (spanning Reporting/Responding, Relating, Reasoning, and Reconstruction). Situatedness refers to course contextual factors expected to surface in reflections for a given learning environment (e.g., course activities, learning objectives, academic discipline). This codebook was created using abductive coding methods, and it was applied to a subset (N=400 undergraduate student reflection sessions) of our larger data corpus of reflective writing (N=3,000+) to capture all forms of situatedness and associated reflection quality in given text units.

The second portion of this paper focuses on a codebook interrater reliability analysis, an adaptive comparative judgement analysis with expert instructors, as well as a comparative analysis of the affordances and constraints of these two codebook validation approaches. Our methods push the boundaries of validation and application of qualitative codebooks in STEM Education Research towards a reality where researchers and practitioners alike can use and adapt codebooks for reflection quality to specific course contexts and automate their application with the help of AI. This work stands in contrast to traditional codebook generation and uses in DBER today, where face validity is often the primary (and only) form of codebook validation and where codebooks tend to remain specified or bounded for certain educational contexts.

Findings aim towards scaling of reflection quality qualitative analyses with the enablement of AI, where automation-ready, and therefore validated, reliable, and contextualizable, codebooks are necessary. Future work will advance understanding of how STEM undergraduate students develop as reflective practitioners while providing validated tools that leverage AI to enable scalable methods for qualitative analysis of reflection quality.

Authors

Dr. Margaret Webb http://orcid.org/https://0000-0002-3925-2120 Cornell University [biography]

Dr. Margaret (Maggie) Webb is postdoctoral associate at Cornell University. She holds a B.S. in Mechanical Engineering (Rice University, a M.S. in Civil Engineering, and a PhD in Engineering Education (Virginia Tech). Her research interests include STEM graduate and postdoctoral education, andragogy, understanding how academic systems influence researcher professional development, motivation, and agency.
Campbell James McColley Cornell University [biography]

Dr. Campbell McColley is a NSF Postdoctoral STEM Ed Research Fellow in the Department of Biomedical Engineering (BME) at Cornell University in the Biomedical Engineering Education Assessment and Research (BEEAR) Group. He received his Ph.D. in Environmental Engineering from Oregon State University, where he investigated microplastics transformations and behavior in aquatic environments. His work focuses on developing and supporting college-industry partnerships in engineering curricula with the wastewater treatment sector.
Mohammed A. Alrizqi http://orcid.org/0000-0001-6034-8314 Cornell University
Gabriel Azure Antonio Mendez-Sanders http://orcid.org/0009-0000-4678-2480 Cornell University [biography]

Gabriel Mendez-Sanders is a first-year PhD student in Cornell University's Smith School of Chemical and Biomolecular Engineering. His research focuses on renewable energy education and how practicing engineers serve as educators in the energy industry.
Alexandra Werth http://orcid.org/0000-0003-0310-2654 Cornell University [biography]

Alexandra Werth is an assistant professor at the Meinig School of Biomedical Engineering, specializing in Engineering Education Research (EER). She focuses on developing evidence-based teaching methodologies to foster authentic learning environments. Dr. Werth holds a Ph.D. in Electrical and Computer Engineering from Princeton University, where she developed a non-invasive mid-infrared glucose sensor. She later conducted postdoctoral research in physics education at the University of Colorado Boulder, where she helped develop the first large-enrollment introductory physics course-based research experience (CURE).

Note

The full paper will be available to logged in and registered conference attendees once the conference starts on June 21, 2026, and to all visitors after the conference ends on June 24, 2026

« View session