Award Abstract # 1918751
Advancing the Science of Learning Data Science with Adaptive Learning for Future Workforce Development

NSF Org: DUE
Division Of Undergraduate Education
Recipient: UNIVERSITY OF MEMPHIS
Initial Amendment Date: January 13, 2020
Latest Amendment Date: January 13, 2020
Award Number: 1918751
Award Instrument: Standard Grant
Program Manager: Nasser Alaraje
nalaraje@nsf.gov
 (703)292-8063
DUE
 Division Of Undergraduate Education
EDU
 Directorate for STEM Education
Start Date: January 15, 2020
End Date: August 31, 2025 (Estimated)
Total Intended Award Amount: $3,439,035.00
Total Awarded Amount to Date: $3,439,035.00
Funds Obligated to Date: FY 2020 = $3,439,035.00
History of Investigator:
  • Andrew Olney (Principal Investigator)
    aolney@memphis.edu
  • Vasile Rus (Co-Principal Investigator)
  • Scott Fleming (Co-Principal Investigator)
  • Dale Bowman (Co-Principal Investigator)
  • Andrew Tawfik (Co-Principal Investigator)
Recipient Sponsored Research Office: University of Memphis
115 JOHN WILDER TOWER
MEMPHIS
TN  US  38152-0001
(901)678-3251
Sponsor Congressional District: 09
Primary Place of Performance: The University of Memphis
Applicant Services 101 Wilder To
Memphis
TN  US  38152-3370
Primary Place of Performance
Congressional District:
09
Unique Entity Identifier (UEI): F2VSMAKDH8Z7
Parent UEI:
NSF Program(s): IUSE
Primary Program Source: 04002021DB NSF Education & Human Resource
Program Reference Code(s): 7967, 9178
Program Element Code(s): 199800
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.076

ABSTRACT

This project aims to serve the national interest by improving training in data science. Data scientists are needed to power the ongoing revolution in Big Data that is transforming virtually every sector of the economy. Progress in training data scientists is currently limited by a lack of understanding about how data science is learned and by a lack of techniques to optimize that learning. This project will advance understanding of how data science is learned by weaving together statistics, programming, and machine learning and experimental results about student learning. It will use this understanding to create an innovative Artificial Intelligence-enabled data science tutor called ?DataWhys.? The DataWhys tutor can be integrated into JupyterLab, an established professional data science tool, and will provide 250 hours of training content.

To advance understanding of how data science is learned and how to optimize that learning, this project will identify the most effective scaffolds for worked examples across varying levels of expertise and identify when scaffolds should be removed. It will then compare a data science intelligent tutoring condition that implements these findings against worked example and pure problem-solving controls. This approach will synthesize previous work in the related fields of statistics, programming, and machine learning education, each of which has used only a few of the scaffolds and techniques that will be comprehensively investigated in this project. In addition to cross-sectional studies with college freshman, STEM majors, and graduate students, longitudinal studies will be conducted in partnership with the data science division of St. Jude Children's Research Hospital and through a summer internship for STEM majors from LeMoyne-Owen College. These longitudinal studies will provide additional evidence regarding workforce relevance through usability metrics and progress in personal learning plans. Source code and training material produced under the project will be publicly shared on GitHub where it can be freely used and modified by anyone under the open-source Apache license. This project is supported by the Accelerating Discovery: Educating the Future STEM Workforce program, which funds projects to educate the STEM workforce in the critical scientific areas defined by the Big Ideas for NSF Investment.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 61)
Banjade, R and Oli, P and Sajib, MH and Rus, V "Identifying Gaps in Students Explanations of Code Using LLMs" , v.14830 , 2024 Citation Details
Banjade, R. and Oli, P. and Tamang, L.J. and Rus, V. "Preliminary Experiments with Transformer based Approaches To Automatically Inferring Domain Models from Textbooks" Proceedings of the 15th International Conference on Educational Data Mining , 2022 Citation Details
Banker, Amanda and Pavlik, Philip and Olney, Andrew and Eglington, Luke "Online Tutoring System (MoFaCTS) for Anatomy and Physiology: Implementation and Initial Impressions" HAPS Educator , v.26 , 2022 https://doi.org/10.21692/haps.2022.012 Citation Details
Barboza, Luiz and Ferreira_Mello, Rafael and Souza_Teixeira, Erico and Olney, Andrew M "Visual Data Science with Blockly-DS" , 2024 https://doi.org/10.1145/3626253.3633408 Citation Details
Bridson, Kathryn and Atkinson, Jeffrey and Fleming, Scott D. "Delivering Round-the-Clock Help to Software Engineering Students Using Discord: An Experience Report" Proceedings of the 53rd ACM Technical Symposium on Computer Science Education , v.1 , 2022 https://doi.org/10.1145/3478431.3499385 Citation Details
Bridson, Kathryn and Fleming, Scott D. "Frequent, Timed Coding Tests for Training and Assessment of Full-Stack Web Development Skills: An Experience Report" Proceedings of the 52nd ACM Technical Symposium on Computer Science Education , 2021 https://doi.org/10.1145/3408877.3432549 Citation Details
Chapagain, Jeevan and Risha, Zak and Banjade, Rabin and Oli, Priti and Tamang, Lasang and Brusilovsky, Peter and RUs, Vasile "SelfCode: An Annotated Corpus and a Model for Automated Assessment of Self-Explanation During Source Code Comprehension" The International FLAIRS Conference Proceedings , v.36 , 2023 https://doi.org/10.32473/flairs.36.133385 Citation Details
DeFalco, J. A. and Blake-Plock, S. and Hampton, A. J. "The Renovated Room: Ethical Implications of Intentional AI in Learning Technology" Proceedings of the Ninth Annual GIFT Users Symposium (GIFTsym9) , 2021 Citation Details
Deng, Lih-Yuan and Yang, Ching-Chi and Bowman, Dale and Lin, Dennis K. and Lu, Henry Horng-Shing "Big Data Model Building Using Dimension Reduction and Sample Selection" Journal of Computational and Graphical Statistics , 2023 https://doi.org/10.1080/10618600.2023.2260052 Citation Details
Gatewood, Jessica and Tawfik, Andrew and Gish-Lieberman, Jaclyn J. "From Singular Design to Differentiation: A History of Adaptive Systems" TechTrends , v.66 , 2022 https://doi.org/10.1007/s11528-022-00702-3 Citation Details
Gish-Lieberman, Jaclyn J. and Tawfik, Andrew and Gatewood, Jessica "Micro-Credentials and Badges in Education: a Historical Overview" TechTrends , v.65 , 2021 https://doi.org/10.1007/s11528-020-00567-4 Citation Details
(Showing: 1 - 10 of 61)

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page