2026 ASEE Annual Conference & Exposition

Identifying, categorizing, and evaluating the taxonomy of STEM content and the obstacles in converting handwritten lecture materials of engineering courses for improved accessibility

Presented at Minorities in Engineering Division(MIND) Technical Session 5

The digital accessibility of handwritten STEM materials remains a barrier to inclusive
engineering education, particularly for students who rely on screen readers and text-to-speech technologies. The goal of this study is to better understand the challenges involved in converting handwritten mathematical and other STEM content into structured digital formats. We present a comprehensive evaluation of the current state of Handwritten Mathematical Expression Recognition (HMER) models, with a focused analysis of the MinerU-2.5 Vision Language Model’s performance. By analyzing the equations from multimodal engineering and computing lecture slides, we identify a critical performance gap between symbolic recognition and structural representation. Our findings show that while the model achieves a high rate (87.0%) in recovering mathematical meaning, spatial failures such as collapsed formatting, misinterpreted
sub/superscripts, and dropped summation bounds result in a lower overall interpretability rate (72.4%). We formalize these challenges into a three-part taxonomy of obstacles to symbolic accuracy: structural and spatial failures in preserving 2D arrangement or delimiters, environmental interference from multimodal elements like arrows or overlapping boundaries, and semantic inaccuracies as a result of ambiguous character substitutions. Furthermore, our layout analysis reveals a 66.5% accuracy rate for page-level decomposition, with a 9.6% failure rate primarily caused by multimodal complexity. These obstacles are also formalized into three categories based on whether they stem from issues with type-casting, environmental boundaries, or hierarchical structure. The paper concludes that modern HMER systems are currently bottlenecked more by spatial hierarchy modeling than raw symbol detection. The classification of obstacles in converting handwritten mathematical and other STEM content in engineering courses
to LaTeX helps increase the digital accessibility of STEM resources by providing a roadmap for developing practical guidelines for instruction design, benchmarking, and tool development.

Authors
  1. Advita Gelli University of Illinois at Urbana - Champaign
  2. Alan Tao University of Illinois at Urbana - Champaign
  3. YANGYANG ZHANG University of Illinois at Urbana - Champaign
  4. Sonika Tamilarasan The University of Illinois at Chicago
  5. Jonathan Hogg University of Illinois at Urbana - Champaign
  6. Yang Victoria Shao University of Illinois at Urbana - Champaign [biography]
  7. Dr. Chrysafis Vogiatzis Orcid 16x16http://orcid.org/0000-0003-0787-9380 University of Illinois at Urbana - Champaign [biography]
  8. Dr. Pablo D Robles-Granda University of Illinois at Urbana - Champaign [biography]
  9. Prof. Lawrence Angrave Orcid 16x16http://orcid.org/0000-0001-9762-7181 University of Illinois at Urbana - Champaign [biography]
  10. Dr. Hongye Liu University of Illinois at Urbana - Champaign [biography]
Note

The full paper will be available to logged in and registered conference attendees once the conference starts on June 21, 2026, and to all visitors after the conference ends on June 24, 2026