The emergence of what is increasingly referred to as ‘vibe coding,’ formalized here as an AI-assisted development framework grounded in natural language prompting, iterative reasoning, and contextual evaluation, may represent an important shift in Data Science (DS) education. Instead of emphasizing syntax mastery and procedural execution, vibe coding reframes computational work as an interactive modeling process between human reasoning and intelligent systems. This paradigm can help rebalance the implementation of algorithms, analytical modeling, and theoretical understanding in DS instruction. This article develops a curriculum framework that integrates vibe coding into Data Science programs to support conceptual learning, model interpretability, and quantitative rigor. Automating routine programming tasks may allow faculty to reallocate instructional time toward the mathematical and statistical foundations of the discipline, including probability theory, linear algebra, optimization, and statistical inference. These areas form the computational core of Data Science and underpin reproducibility, verification, and uncertainty quantification. The curriculum structure is organized around three technical components. Conceptual alignment ensures that AI-assisted competencies are mapped to DS learning outcomes in data literacy, algorithmic reasoning, and model transparency. Pedagogical implementation embeds AI-supported programming exercises into courses such as Applied Data Modeling and AI Collaboration in Data Science, emphasizing prompt precision, data validation, and computational reproducibility. Ethical and analytical assurance introduces evaluation protocols for AI-generated code using statistical benchmarking, bias detection, and verification metrics. Integrating vibe coding into DS curricula can enhance accessibility, reduce redundant technical overhead, and strengthen students' ability to connect computational processes with statistical reasoning. This approach offers an instructional model that aligns automation with analytical depth, helping scientists prepare data to design, interpret, and validate AI-driven systems with both technical and mathematical precision. The approach may also be adapted, with discipline-specific modifications, to Computer Science curricula.
The full paper will be available to logged in and registered conference attendees once the conference starts on June 21, 2026, and to all visitors after the conference ends on June 24, 2026