LLM Reasoning 相关度: 8/10

Generative AI Spotlights the Human Core of Data Science: Implications for Education

Nathan Taback
arXiv: 2604.02238v1 发布: 2026-04-02 更新: 2026-04-02

AI 摘要

生成式AI凸显数据科学的人本核心,教育应聚焦人类推理能力。

主要贡献

  • 强调数据科学教育中人类推理的重要性
  • 分析了生成式AI对数据科学工作流程的影响
  • 提出了数据科学课程改革的建议

方法论

通过分析数据科学的发展历程和生成式AI的影响,提出了教育建议。

原文摘要

Generative AI (GAI) reveals an irreducible human core at the center of data science: advances in GAI should sharpen, rather than diminish, the focus on human reasoning in data science education. GAI can now execute many routine data science workflows, including cleaning, summarizing, visualizing, modeling, and drafting reports. Yet the competencies that matter most remain irreducibly human: problem formulation, measurement and design, causal identification, statistical and computational reasoning, ethics and accountability, and sensemaking. Drawing on Donoho's Greater Data Science framework, Nolan and Temple Lang's vision of computational literacy, and the McLuhan-Culkin insight that we shape our tools and thereafter our tools shape us, this paper traces the emergence of data science through three converging lineages: Tukey's intellectual vision of data analysis as a science, the commercial logic of surveillance capitalism that created industrial demand for data scientists, and the academic programs that followed. Mapping GAI's impact onto Donoho's six divisions of Greater Data Science shows that computing with data (GDS3) has been substantially automated, while data gathering, preparation, and exploration (GDS1) and science about data science (GDS6) still require essential human input. The educational implication is that data science curricula should focus on this human core while teaching students how to contribute effectively within iterative prompt-output-prompt cycles using retrieval-augmented generation, and that learning outcomes and assessments should explicitly evaluate reasoning and judgment.

标签

生成式AI 数据科学 教育 人工智能

arXiv 分类

cs.CY cs.AI stat.AP