2025 ASEE Annual Conference & Exposition

FlexiGrader: an LLM-based personalized autograder to enable flexible and open-ended creative exploration in CS1

Presented at DSAI Technical Session 7: Natural Language Processing and LLM Applications

Computer Science courses often rely on programming assignments for learning assessment. Automatic grading (autograding) is a common mechanism to provide quick feedback to students and reduce teacher workload, especially in large classes. However, traditional autograders offer limited personalized feedback and often require all students to solve the same predefined problem, restricting creativity. In this paper, we address these limitations by developing an AI-based autograder that (1) can grade diverse, open-ended assignments where students work on independent, creative projects, enabling a new set of assessments in CS1 (introductory programming) courses, and (2) provides personalized feedback using large language models (LLMs). We present the design of a new assessment strategy in introductory programming courses where each student works on an open-ended problem for their summative assessment. We design generalized scaffolds (project proposal, schematic development, pseudocode, integration of files, and graphs) for these open-ended assessments so that each student completes a project of desired complexity. Existing autograders require rigid structure of inputs and outputs, and therefore, cannot grade such assessments. Our tool, FlexiGrader, integrates code execution verification and unit testing tailored to the specifications of each student individually, followed by code analysis using our fine-tuned Llama model to generate feedback and grades. FlexiGrader is capable of handling submissions from large classes and ensures flexibility in grading free-form assignments, making it easier for instructors to design and assess varied projects. The input requirements of our tool is a cover sheet that describes the individualized project, provides paths to external files, and describes the inputs needed to run the program that the student submits. Beyond this student-driven cover sheet, FlexiGrader provides options for the instructor to describe the grading rubric and choose the criteria that will be graded by the AI model. We hypothesize that live implementation of FlexiGrader in CS1 classrooms can enhance student self-efficacy and creativity in CS education by fostering independent project development. We plan to study this hypothesis in future research. Additionally, we discuss the operational costs of our autograding system, its compatibility with existing autograding frameworks, and the current limitations of our approach. By enabling more creative and personalized assignments, FlexiGrader has the potential to transform assessment practices in introductory computer science courses.

Authors

Mr. Alexis Frias University of California Merced [biography]

Alexis Frias is a graduate student in the Master of Computer Science program at the University of Southern California. He graduated from UC Merced with a bachelor's degree in computer science and engineering. Under the guidance of Dr. Ayush Pandey, his current work focuses on the development of safety-critical systems and AI-driven educational tools. His research interest span control theory, safety verification, transformer-based learning in nonlinear systems, and the development of AI-powered auto grading system. His broader research background includes contributions to robotics, agricultural technologies, and education, with involvement in research programs such as LINXS, MACES, and NCAS. He is passionate about AI development and mentoring, aiming to contribute to the advancement of trustworthy and efficient AI systems.
Shrivaikunth Krishnakumar San Jose State University [biography]

Shrivaikunth Krishnakumar is a Graduate Student in the Master of AI program at San Jose State University. He graduated from UC Merced with a bachelor's degree in Computer Science and Engineering. His current work focuses on development of AI-driven educational tools, and combating AI hallucinations in different areas. His research interest span AI-hallucinations, impact of AI in healthcare systems, and the development of AI-powered auto grading system. He is passionate about mentoring, AI development, and AI safety, contributing to the advancements of trustworthy AI systems.
Dr. Ayush Pandey http://orcid.org/0000-0003-3590-4459 University of California Merced [biography]

Ayush Pandey is an Assistant Professor of Teaching in the Electrical Engineering department at the University of California, Merced. Before joining UC Merced, he completed his Ph.D. in Control and Dynamical Systems at the California Institute of Technology working with Prof. Richard M Murray. He is interested in increasing access to computational tools for inclusive data analysis education and research. His dissertation research develops computer-aided design tools for the engineering of biological systems to address sustainability and health challenges at scale. In 2022, he was appointed as an Adjunct Professor at the Harvey Mudd College where he explored the role of mathematical modeling and analysis tools in interdisciplinary computational biology education. Prior to that, in 2018, he obtained his master’s degree in Electrical Engineering from the California Institute of Technology. In 2017, he obtained dual bachelor’s and master’s degrees in the Electrical Engineering department from the Indian Institute of Technology (IIT), Kharagpur.

Download paper (1.09 MB)

» Download paper

« View session