In this GIFTS paper (Great Ideas for Teaching, and Talking with, Students), we present a module around natural language processing (NLP), a branch of machine learning. It is the basis of many generative artificial intelligence (GAI) tools such as Chat GPT. Given the recent emphasis on GAI within universities and beyond, it has become important for first-year engineering students to develop a sense of how this technology functions. In this guide, we present a self-paced module for first-year engineering students focused on NLP. This lesson can be coupled with MATLAB, a common platform taught in the first year. The goal is for students to be able to articulate what natural language processing is and how it works, as well as to apply it to an example with real-world data. This module walks students through setting up the necessary MATLAB add-ons and addresses the following topics: loading in text, tokenization, removing stop words, lemmatization, sentiment analysis, bag of words, classifying text, and visualizing NLP. No prior experience with NLP is required to complete this module. Assessment is based on a challenge problem in which students select public data sets and apply the techniques learned through the lesson.
The full paper will be available to logged in and registered conference attendees once the conference starts on June 22, 2025, and to all visitors after the conference ends on June 25, 2025