Skip to feedback

Award Abstract # 1909528
AF: Small: Collaborative Research: An Investigation of Richer Conductance Measures for Real-World Graphs

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: PURDUE UNIVERSITY
Initial Amendment Date: June 19, 2019
Latest Amendment Date: June 19, 2019
Award Number: 1909528
Award Instrument: Standard Grant
Program Manager: Peter Brass
pbrass@nsf.gov
 (703)292-2182
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: October 1, 2019
End Date: September 30, 2023 (Estimated)
Total Intended Award Amount: $250,000.00
Total Awarded Amount to Date: $250,000.00
Funds Obligated to Date: FY 2019 = $250,000.00
History of Investigator:
  • David Gleich (Principal Investigator)
Recipient Sponsored Research Office: Purdue University
2550 NORTHWESTERN AVE # 1100
WEST LAFAYETTE
IN  US  47906-1332
(765)494-1055
Sponsor Congressional District: 04
Primary Place of Performance: Purdue University
IN  US  47907-2114
Primary Place of Performance
Congressional District:
04
Unique Entity Identifier (UEI): YRXVL4JYCEF5
Parent UEI: YRXVL4JYCEF5
NSF Program(s): Algorithmic Foundations
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7923, 7933
Program Element Code(s): 779600
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Over the past two decades, massive networks or graphs appear in the very fabric of society. For example, networks appear in the form of social networks such as people on Facebook, sharing networks such as Twitter, computer networks such as the routers of the Internet, and power systems. Understanding the structure of these real-world networks is a fundamental scientific challenge. A significant aspect to this structure is the existence of "communities", or tightly knit collections of objects in the network with a large number of connections within them; examples iwould include a large group of mutual friends in a social networks, or an echo-chamber in a sharing network. A common technique to analyze community structure, as well as other structural features, is the use of random walks. In this technique, one imagines a particle that randomly walks in the graph by simply moving from object to object by following a random connection at each step. Despite the simplicity of this technique, it forms the foundation for state-of-the-art community-detection and graph-sampling methods; however, although there is a rich and deep mathematical theory on random walks, there is a lack of understanding for the success of this technique. In particular, the current theory on random walks essentially uses measures of "bottlenecks" (called conductance), and shows that random walks are effective when there are no bottlenecks. But there is overwhelming empirical evidence that real-world graphs contain bottlenecks, yet random walks are effective for analyzing them. The main aim of this research is a scientific investigation of this phenomenon, with the hope of finding the right mathematical tools to explain this behavior. Given the central role that massive networks play in modern society, such studies play a fundamental role in scientific research.

It has been recognized in earlier work that the classic notion of conductance is too crude a lens to understand real-world graphs. The aim of this research is to design richer conductance measures to study the behavior of random walks, design provably robust algorithms to approximate these measures, and demonstrate the relevance of these measures for algorithmic problems in graph sampling. The starting point for the investigation is a "truncated" notion of conductance that ignores small sets, introduced in the discrete math literature to study volumes of convex bodies. The investigators believe this to be a more useful characterization of random walks on real-world graphs. This leads to a number of research challenges. The first challenge is to design efficient algorithms that approximate these richer conductance measures. The second challenge is to prove that existing empirical heuristics are exploiting these other conductance measures, to get performance better than that predicted by previous theory. The third challenge is to perform a detailed study of these measures on real-world graphs in order to empirically ground the theory. One of the by-products of this research will be a greater insight into the actual structure of real-world graphs, and this will likely inspire better models. The primary outcomes from this research will be in the form of theorems and algorithms, as well as papers describing them, that characterize the impact of richer conductance measures on the behavior of algorithms run on networks. The investigators also plan to release software to compute or approximate the new conductance measures proposed.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 17)
Ibrahim, Rania and Gleich, David F "Local hypergraph clustering using capacity releasing diffusion" PLOS ONE , v.15 , 2020 https://doi.org/10.1371/journal.pone.0243485 Citation Details
Benson, Austin R. and Veldt, Nate and Gleich, David F. "Fauci-Email: A JSON Digest of Anthony Fauci's Released Emails" Proceedings of the International AAAI Conference on Web and Social Media , v.16 , 2022 Citation Details
Bonato, Anthony and Eikmeier, Nicole and Gleich, David F. and Malik, Rehan "Centrality in dynamic competition networks" Complex Networks and Their Applications VIII , 2019 https://doi.org/10.1007/978-3-030-36683-4_9 Citation Details
Colley, Charles and Nassar, Huda and Gleich, David F. "Dominant Z-Eigenpairs of Tensor Kronecker Products Decouple" SIAM Journal on Matrix Analysis and Applications , v.44 , 2023 https://doi.org/10.1137/22M1502008 Citation Details
Eikmeier, Nicole and Gleich, David F "Classes of preferential attachment and triangle preferential attachment models with power-law spectra" Journal of Complex Networks , v.8 , 2020 https://doi.org/10.1093/comnet/cnz040 Citation Details
Fountoulakis, Kimon and Liu, Meng and Gleich, David F. and Mahoney, Michael W. "Flow-Based Algorithms for Improving Clusters: A Unifying Framework, Software, and Performance" SIAM Review , v.65 , 2023 https://doi.org/10.1137/20m1333055 Citation Details
Gan, Junhao and Gleich, David F. and Veldt, Nate and Wirth, Anthony and Zhang, Xin "Graph Clustering in All Parameter Regimes" 45th International Symposium on Mathematical Foundations of Computer Science (MFCS 2020) , 2020 https://doi.org/10.4230/LIPIcs.MFCS.2020.39 Citation Details
Huang, Yufan and Gleich, David F and Veldt, Nate "Densest Subhypergraph: Negative Supermodular Functions and Strongly Localized Methods" , 2024 https://doi.org/10.1145/3589334.3645624 Citation Details
Jones, Landon R. and Swihart, Robert K. and Gleich, David F. and Albers, Geriann and Johnson, Scott A. and Hudson, Cassie M. and Zollner, Patrick A. "Estimating statewide carrying capacity of bobcats (Lynx rufus) using improved maximum clique algorithms" Landscape Ecology , v.37 , 2022 https://doi.org/10.1007/s10980-022-01460-6 Citation Details
Liu, Meng and Dey, Tamal K. and Gleich, David F. "Topological structure of complex predictions" Nature Machine Intelligence , 2023 https://doi.org/10.1038/s42256-023-00749-8 Citation Details
Liu, Meng and Gleich, David F. "Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering" Advances in Neural Information Processing Systems (NeurIPS) , 2020 Citation Details
(Showing: 1 - 10 of 17)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Networks of interactions are commonplace in our electronically mediated lives. We browse websites, we interact with friends, we message colleagues, we publicize research on social networks. These networks have rich and intricate structure. The focus on our project was to develop a new way to study this structure and address a hypothesized feature of this structure. Specifically, we wished to develop a size-resolved study of structure in these networks. This would permit us to understand what happens to groups in these networks as the groups get larger. 

 

Around a decade before our project started, a team of researchers looking at these networks identified a counter intuitive behavior in the structure of these networks as the groups get larger. To understand why their finding was counter intuitive, we need to consider what happens in physical space. Suppose we have a network or graph that describes a region of space where elements are connected if they are close by in space. The road network is a good example of this type of structure – roads connect nearby cities. We don’t a single, unique road directly connecting Fargo North Dakota to Miami, for example, although there is a sequence of roads that do make this journey possible. If we have a group of cities that are all nearby and “grow” the grow by including more cities nearby, then we can think of what happens to the surface area and physical area or volume of the group in the network. A fundamental result in geometry is that the interior grows faster than the exterior, e.g. as we grow a circle, or a square, this always happens. Consequently, if we look at the boundary to interior, we expect the boundary to grow slower. 

 

But that isn’t what was observed to happen with networks from our electrotonically mediated lives. Instead, what happened with the same ideas applied to these networks is that the groups boundary grew worse as the sets got bigger. To do this, the team a decade ago resorted to extensive computations to make sure this is likely what was going on, but there was no theoretical justification for their result. The worry was that maybe the algorithms just couldn’t find these groups that had smaller boundaries than their interiors. 

 

Through the research developed during this grant, we show that this is not the case and that the original conjecture is correct. To do this, we developed a new type of algorithm that can bound the tradeoff between boundary and interior for all sets in a graph as they get larger. This algorithm rests on an idea called mu-conductance that was developed in a different area but for a related purpose – that is, bounding the behavior of sets as they get bigger. 

 

We also discovered several related research findings along the way. These include new types of algorithms that produce optimal groups under specific types of conditions. We also developed a theory of regularized diffusions in graphs and hypergraphs that enables new types of solutions to problems of predicting group membership in a graph. That is, if one is given part of a group of nodes in a network, these other algorithms would help identify the rest of the group. 

 

The broader impacts of our work included training a PhD student. There are a variety of software packages developed that implement the algorithms developed in this proposal. They are freely available and support continued study of related problems. We also developed a number of videos on YouTube that detail the findings of the research in ways that will make them easier for others to learn about in the future. 

 


Last Modified: 01/24/2024
Modified by: David F Gleich

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page