Award Abstract # 1824303
SPX: Collaborative Research: Harnessing the Power of High-Bandwidth Memory via Provably Efficient Parallel Algorithms
NSF Org: |
CCF
Division of Computing and Communication Foundations
|
Recipient: |
CARNEGIE MELLON UNIVERSITY
|
Initial Amendment Date:
|
February 27, 2018 |
Latest Amendment Date:
|
February 27, 2018 |
Award Number: |
1824303 |
Award Instrument: |
Standard Grant |
Program Manager: |
Damian Dechev
CCF
Division of Computing and Communication Foundations
CSE
Directorate for Computer and Information Science and Engineering
|
Start Date: |
January 1, 2018 |
End Date: |
August 31, 2022 (Estimated) |
Total Intended Award
Amount: |
$249,999.00 |
Total Awarded Amount to
Date: |
$249,999.00 |
Funds Obligated to Date:
|
FY 2017 = $249,999.00
|
History of Investigator:
|
-
Benjamin
Moseley
(Principal Investigator)
moseleyb85@gmail.com
|
Recipient Sponsored Research
Office: |
Carnegie-Mellon University
5000 FORBES AVE
PITTSBURGH
PA
US
15213-3815
(412)268-8746
|
Sponsor Congressional
District: |
12
|
Primary Place of
Performance: |
Carnegie-Mellon University
5000 Forbes Avenue
Pittsburgh
PA
US
15213-3815
|
Primary Place of
Performance Congressional District: |
12
|
Unique Entity Identifier
(UEI): |
U3NKNFLNQ613
|
Parent UEI: |
U3NKNFLNQ613
|
NSF Program(s): |
PPoSS-PP of Scalable Systems
|
Primary Program Source:
|
01001718DB NSF RESEARCH & RELATED ACTIVIT
|
Program Reference
Code(s): |
026Z,
9150
|
Program Element Code(s):
|
042Y00
|
Award Agency Code: |
4900
|
Fund Agency Code: |
4900
|
Assistance Listing
Number(s): |
47.070
|
ABSTRACT

An important bottleneck for many parallel scientific applications is memory performance. Recently, vendors have introduced a new memory called high-bandwidth memory (HBM) as an approach to alleviate this bottleneck. This project will develop an algorithmic foundation for using HBM. The project has the potential for a broad economic, technological, and scientific impact, since industry has an investment in this technology and many of the nation's strategic codes are being run on HBM-capable machines. The PIs will integrate research with education at the graduate and undergraduate levels by training PhD, MS, and honors program BS students in cross-cutting issues encompassing algorithm design, high-performance software, and processor architecture.
The new approach offered by vendors is to bond memory (HBM) directly to the processor chip, which allows for more connections, enabling higher bandwidth. Although the size of the new memory is larger than modern on-chip caches, physical constraints limit the capacity of the memory to be significantly smaller than DRAM. HBM does not cleanly fit in the standard memory hierarchy. This project will develop a foundational understanding of how to algorithmically design codes for HBM enhanced architectures. Overcoming these intellectual challenges to achieve multi-core scalability using HBM requires new algorithms, models, and abstractions, spearheaded by this collaboration between researchers who study hardware issues, high performance computing challenges, and theoretical modeling and analysis.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
(Showing: 1 - 10 of 41)
(Showing: 1 - 41 of 41)
Lu, K. and Lavastida, T. and Moseley, B. and Wang, Y.
"Scaling Average-Linkage via Sparse Cluster Embeddings"
Asian Conference on Machine Learning
, 2021
Citation
Details
Abo Khamis, Mahmoud and Curtin, Ryan and Im, Sungjin and Moseley, Ben and Ngo, Hung and Pruhs, Kirk and Samadian, Alireza
"An Approximation Algorithm for the Matrix Tree Multiplication Problem"
Symposium on Mathematical Foundations of Computer Science
, v.202
, 2021
https://doi.org/10.4230/LIPIcs.MFCS.2021.6
Citation
Details
Abo Khamis, Mahmoud and Curtin, Ryan R. and Moseley, Benjamin and Ngo, Hung Q. and Nguyen, XuanLong and Olteanu, Dan and Schleich, Maximilian
"On Functional Aggregate Queries with Additive Inequalities"
Symposium on Principles of Database Systems
, 2019
https://doi.org/10.1145/3294052.3319694
Citation
Details
Agrawal, Kunal and Lee, I-Ting Angelina and Li, Jing and Lu, Kefu and Moseley, Benjamin
"Practically Efficient Scheduler for Minimizing Average Flow Time of Parallel Jobs"
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
, 2019
10.1109/IPDPS.2019.00024
Citation
Details
Ahmadian, S and Epasto, A and Knittel, M and Kumar, R and Mahdian, M and Moseley, B and Pham, P and Vassilvitskii, S and Wang, Y
"Fair Hierarchical Clustering"
Advances in neural information processing systems
, 2020
https://doi.org/
Citation
Details
Berg, Benjamin and Harchol-Balter, Mor and Moseley, Benjamin and Wang, Weina and Whitehouse, Justin
"Optimal Resource Allocation for Elastic and Inelastic Jobs"
Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2020)
, 2020
10.1145/3350755.3400265
Citation
Details
Berg, Benjamin and Whitehouse, Justin and Moseley, Benjamin and Wang, Weina and Harchol-Balter, Mor
"The case for phase-aware scheduling of parallelizable jobs"
Performance Evaluation
, 2021
https://doi.org/10.1016/j.peva.2021.102246
Citation
Details
Chakrabarti, A. and Moseley, B.
"Backprop with Approximate Activations for Memory-efficient Network"
NuerIPS
, 2019
Citation
Details
Curtin, R and Moseley, B and Ngo, H and Nguyen, X and Olteanu, D and Schleich, M
"Rk-means: Fast Clustering for Relational Data"
AISTATS
, 2020
Citation
Details
Das, Rathish and Agrawal, Kunal and Bender, Michael A. and Berry, Jonathan and Moseley, Benjamin and Phillips, Cynthia A.
"How to Manage High-Bandwidth Memory Automatically"
Symposium on Parallelism in Algorithms and Architectures
, 2020
https://doi.org/10.1145/3350755.3400233
Citation
Details
DeLayo, Daniel and Zhang, Kenny and Agrawal, Kunal and Bender, Michael A. and Berry, Jonathan W. and Das, Rathish and Moseley, Benjamin and Phillips, Cynthia A.
"Automatic HBM Management: Models and Algorithms"
Proc. 34th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA)
, 2022
https://doi.org/10.1145/3490148.3538570
Citation
Details
Dinitz, M. and Im, S. and Lavastida, T. and Moseley, B and Vassilvitskii, S
"Algorithms with Prediction Portfolios"
Advances in neural information processing systems
, 2022
Citation
Details
Dinitz, M. and Im, S. and Lavastida, T. and Moseley, B. and Vassilvitskii, S.
"Faster Matchings via Learned Duals"
Advances in neural information processing systems
, 2021
Citation
Details
Dinitz, Michael and Moseley, Benjamin
"Scheduling for Weighted Flow and Completion Times in Reconfigurable Networks"
IEEE INFOCOM 2020 - IEEE Conference on Computer Communications
, 2020
https://doi.org/10.1109/INFOCOM41043.2020.9155537
Citation
Details
Giorgio Lucarelli, Benjamin Moseley
"Online Non-Preemptive Scheduling to Minimize Maximum Weighted Flow-Time on Related Machines"
FSTTCS
, 2019
https://doi.org/10.4230/LIPIcs.FSTTCS.2019.24
Citation
Details
Gupta, A. and Moseley, B. and Zhou, R.
"Structural Iterative Rounding for Generalized k-Median Problems"
International Colloquium on Automata, Languages, and Programming
, 2021
https://doi.org/10.4230/LIPIcs.ICALP.2021.77
Citation
Details
Im, S. and Moseley, B. and Pruhs, K. and Purohit, M
"Matroid Coflow Scheduling."
ICALP
, 2019
https://doi.org/10.4230/LIPIcs.ICALP.2019.145
Citation
Details
Im, Sungjin and Li, Shi and Moseley, Benjamin
"Breaking 1 - 1/e Barrier for Nonpreemptive Throughput Maximization"
SIAM Journal on Discrete Mathematics
, v.34
, 2020
https://doi.org/10.1137/17M1148438
Citation
Details
Im, Sungjin and Moseley, Benjamin and Munagala, Kamesh and Pruhs, Kirk
"Dynamic Weighted Fairness with Minimal Disruptions"
Sigmetrics
, 2020
10.1145/3393691.3394184
Citation
Details
Im, Sungjin and Moseley, Benjamin and Pruhs, Kirk
"The matroid intersection cover problem"
Operations Research Letters
, v.49
, 2021
https://doi.org/10.1016/j.orl.2020.11.003
Citation
Details
Im, Sungjin and Moseley, Benjamin and Zhou, Rudy
"The Matroid Cup Game"
Operations Research Letters
, v.49
, 2021
https://doi.org/10.1016/j.orl.2021.04.005
Citation
Details
Lattanzi, S. and Lavastida, T and Moseley, B Vassilvitskii
"Online Scheduling via Learned Weights"
vACM-SIAM Symposium on Discrete Algorithm
, 2020
10.1137/1.9781611975994.114
Citation
Details
Lattanzi, S and Moseley, B. and Vassilvitskii, S. and Wang, Y and Zhou, R.
"Robust Online Correlation Clustering"
Advances in neural information processing systems
, 2021
Citation
Details
Lavastida, T. and Moseley, B. and Ravi, R. and Xu, C.
"Learnable and Instance-Robust Predictions for Online Matching, Flows and Load Balancing"
European Symposium on Algorithms
, 2021
https://doi.org/10.4230/LIPIcs.ESA.2021.59
Citation
Details
Lavastida, T. and Ravi R. and Moseley, B. and Xu, C.
"Using Predicted Weights for Ad Delivery"
SIAM Conference on Applied and Computational Discrete Algorithms
, 2021
https://doi.org/10.1137/1.9781611976830.3
Citation
Details
Leichter, M. and Moseley, B. and Pruhs, P.
"An Efficient Reduction of a Gammoid to a Partition Matroid"
European Symposium on Algorithms
, 2021
https://doi.org/10.4230/LIPIcs.ESA.2021.62
Citation
Details
Moseley, B. and Vassilvitskii, S and Wang, Y.
"Hierarchical Clustering in General Metric Spaces using Approximate Nearest Neighbors"
International Conference on Artificial Intelligence and Statistics
, 2021
Citation
Details
Moseley, B. and \ Vassilvtiskii, S. and Wang, Y.
"Hierarchical Clustering in General Metric Spaces using Approximate Nearest Neighbors"
International Conference on Artificial Intelligence and Statistics
, 2021
Citation
Details
Moseley, B and Wang, J
"Approximation Bounds for Hierarchical Clustering: Average Linkage, Bisecting K-means, and Local Search"
Journal of machine learning research
, 2021
https://doi.org/
Citation
Details
Moseley, Benjamin and Pruhs, Kirk and Samadian, Alireza and Wang, Yuyan
"Relational Algorithms for k-Means Clustering"
International Colloquium on Automata, Languages, and Programming
, v.198
, 2021
https://doi.org/10.4230/LIPIcs.ICALP.2021.97
Citation
Details
Moseley, Benjamin and Zhang, Ruilong and Zhao, Shanjiawen
"Online Scheduling of Paralleizable Jobs in the Directed Acyclic Graphs and Speed-Up Curves Models"
SSRN Electronic Journal
, 2022
https://doi.org/10.2139/ssrn.4043347
Citation
Details
Samadian, A and Pruhs, K and Moseley, B and Im, S and Curtin, R
"Unconditional Coresets for Regularized Loss Minimization"
AISTATS
, 2020
Citation
Details
Silvio Lattanzi, Thomas Lavastida
"A Framework for Parallelizing Hierarchical Clustering Methods"
European Conference on Machine Learning
, 2019
Citation
Details
Singh, Shikha and Madaminov, Sergey and Bender, Michael A. and Ferdman, Michael and Johnson, Ryan and Moseley, Benjamin and Ngo, Hung and Nguyen, Dung and Olesen, Soeren and Stirewalt, Kurt and Washburn, Geoffrey
"A Scheduling Approach to Incremental Maintenance of Datalog Programs"
IPDPS
, 2020
10.1109/IPDPS47924.2020.00093
Citation
Details
Wang, Y and Moseley, B
"An Objective for Hierarchical Clustering in Euclidean Space and Its Connection to Bisecting K-means"
Proceedings of the AAAI Conference on Artificial Intelligence
, 2020
Citation
Details
Xu, Chenyang and Moseley, Benjamin
"Learning-Augmented Algorithms for Online Steiner Tree"
Proceedings of the AAAI Conference on Artificial Intelligence
, v.36
, 2022
https://doi.org/10.1609/aaai.v36i8.20854
Citation
Details
(Showing: 1 - 10 of 41)
(Showing: 1 - 41 of 41)
PROJECT OUTCOMES REPORT

Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the
Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or
recommendations expressed in this Report are those of the PI and do not necessarily reflect the
views of the National Science Foundation; NSF has not approved or endorsed its content.
This project developed an algorithmic foundation for managing high-bandwidth memory (HBM). HBM is a new type of memory technology that has higher bandwidth than traditional DRAM, but similar latency. HBM's bandwidth is so high because it is placed directly onto the processor package (unlike DRAM). HBM thus augments the existing memory hierarchy by providing a memory that can be accessed with up to 5x higher bandwidth than DDR4 (today's DRAM technology) when feeding a CPU, and up to 20x higher bandwidth when feeding a GPU.
The challenge with modeling HBM is that it does not cleanly fit in the standard memory hierarchy. This project developed a foundational understanding of how to algorithmically design codes for HBM enhanced architectures. Specifically, the project developed new algorithms, models, and abstractions, as a result of a tightly coupled collaboration between researchers who study hardware issues, HPC challenges, and theoretical modeling and analysis. Among our key findings is the fact that automatic HBM management is indeed possible in multicore systems. Among the successes from this project were a series of papers that show the theoretical underpinning of how to manage high-bandwidth memory automatically, establish tight bounds for parallel paging, and desmonstrate algorithms for online parallel paging with optimal makespan. Our works have shown how to manage HBM automatically, as efficiently as a standard cache, and with provably good performance guarantees.
Last Modified: 02/16/2023
Modified by: Benjamin J Moseley
Please report errors in award information by writing to: awardsearch@nsf.gov.