Award Abstract # 2411298
Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI

NSF Org: OAC
Office of Advanced Cyberinfrastructure (OAC)
Recipient: OHIO STATE UNIVERSITY, THE
Initial Amendment Date: April 2, 2024
Latest Amendment Date: April 2, 2024
Award Number: 2411298
Award Instrument: Standard Grant
Program Manager: Varun Chandola
vchandol@nsf.gov
 (703)292-2656
OAC
 Office of Advanced Cyberinfrastructure (OAC)
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: August 1, 2024
End Date: July 31, 2027 (Estimated)
Total Intended Award Amount: $360,000.00
Total Awarded Amount to Date: $360,000.00
Funds Obligated to Date: FY 2024 = $360,000.00
History of Investigator:
  • Xia Wang (Principal Investigator)
    wang.1037@osu.edu
Recipient Sponsored Research Office: Ohio State University
1960 KENNY RD
COLUMBUS
OH  US  43210-1016
(614)688-8735
Sponsor Congressional District: 03
Primary Place of Performance: Ohio State University
1960 KENNY RD
COLUMBUS
OH  US  43210-1016
Primary Place of Performance
Congressional District:
03
Unique Entity Identifier (UEI): DLWBSLWAJWR1
Parent UEI: MN4MDDMN8529
NSF Program(s): Software Institutes
Primary Program Source: 01002425DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 077Z, 4444, 8004
Program Element Code(s): 800400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

hpcGPT is a question answering service for academic computing centers such as the National Center for Supercomputing Applications, Ohio Supercomputer Center, San Diego Supercomputer Center, and Texas Advanced Computing Center. These Centers provide high-performance computing (HPC) platforms to tens of thousands of users for science and engineering research. In collaboration with Princeton University and Rutgers University, hpcGPT uses generative artificial intelligence (AI) and integrates heterogeneous data sources with different update frequencies to enhance the user support service quality and efficiency, decrease the response time, and improve precision of the support. With hpcGPT, user support teams can leverage the historical knowledge, real-time system status, and external technical expertise to better support the HPC users. With the high-quality and timely answers from hpcGPT, HPC users can resolve many technical issues, thus reducing the workload of the user support teams. This will allow the support teams to focus more on new and novel support issues. hpcGPT will significantly enhance the user support service quality, capacity, and efficiency without increasing the human effort.

hpcGPT combines the fine-tuning and Retrieval Augmented Generation (RAG) techniques to incorporate recent knowledge, past experience, domain expertise, documentations, and real-time system status of versatile computing. By building upon existing and recognized capabilities in large language model fine-tuning and hosting, retrieval augmentation generation, and external data source integration, hpcGPT reduces the complexity and effort required to align information and identify dependencies between questions, answers, and the supporting information. This is particularly beneficial for research groups and computing centers with diverse application requirements and limited staff. hpcGPT extends and translates a suite of Cyberinfrastructure building blocks and technologies such as large language model training and inference service hosting.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page