Award Abstract # 2104489
CIF: Small: Coding-theoretic methods in discrepancy and energy optimization, with applications

NSF Org: CCF
Division of Computing and Communication Foundations
Recipient: UNIVERSITY OF MARYLAND, COLLEGE PARK
Initial Amendment Date: May 6, 2021
Latest Amendment Date: May 6, 2021
Award Number: 2104489
Award Instrument: Standard Grant
Program Manager: Phillip Regalia
pregalia@nsf.gov
 (703)292-2981
CCF
 Division of Computing and Communication Foundations
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: July 1, 2021
End Date: June 30, 2024 (Estimated)
Total Intended Award Amount: $500,000.00
Total Awarded Amount to Date: $500,000.00
Funds Obligated to Date: FY 2021 = $500,000.00
History of Investigator:
  • Alexander Barg (Principal Investigator)
    abarg@umd.edu
Recipient Sponsored Research Office: University of Maryland, College Park
3112 LEE BUILDING
COLLEGE PARK
MD  US  20742-5100
(301)405-6269
Sponsor Congressional District: 04
Primary Place of Performance: University of Maryland, College Park
2361 A.V. Williams
College Park
MD  US  20742-1000
Primary Place of Performance
Congressional District:
04
Unique Entity Identifier (UEI): NPU8ULVAAS23
Parent UEI: NPU8ULVAAS23
NSF Program(s): Comm & Information Foundations
Primary Program Source: 01002122DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7923, 7935
Program Element Code(s): 779700
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

The project studies properties of sequences of zeros and ones formed of n bits. Collections of such sequences, called codes, are used for representing data to be stored in computer memory or transmitted over an optical cable. In many applications in communications, statistics, and computer science it is beneficial to choose a code that is in some ways uniformly distributed over the set of all the possible binary sequences. These applications have led researchers to define a large group of problems in applied mathematics both on the theory side and in the domain of data processing procedures. The main topic of this project is investigation of uniformly distributed codes, including their construction, evaluation of the properties, a group of related geometric problems, as well and their uses in applied problems of algorithm design, computer vision, and economical representation of data. The project relies on ideas drawn from recent developments in computer science as well as certain classical methods in applied mathematics, and it aims at new characterizations and applications of uniformly distributed sets of binary sequences.

The theory of uniform distributions has seen ongoing development through most of the last century, motivated primarily by problems of numerical integration of multivariable functions. In the context of point sets on the surface of the sphere in n dimensions, approximation to the uniform distribution is quantified by the quadratic discrepancy of the point set, measured as the average number of points of the set in a region of the surface of the sphere. Spherical point sets with small quadratic discrepancy approach uniformly distributed collections of points on the sphere. This project is devoted to an extension of this theory to binary codes that approximate the uniform distribution on the Hamming space. Ways of advancing the theory of such codes were suggested in recent works of the investigator, and they rely on Fourier analysis on the Boolean cube, the theory of positive-definite kernels, linear programming, and other tools from coding theory. Applications of this work include estimating the error probability of decoding, derandomization of algorithms, and some variants of the compressed sensing problem.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Pathegama, Madhura and Barg, Alexander "Smoothing of Binary Codes, Uniform Distributions, and Applications" Entropy , v.25 , 2023 https://doi.org/10.3390/e25111515 Citation Details
Patra, A. and Barg, A. "Interior-point regenerating codes on graphs" IEEE International Symposium on Information Theory , 2022 Citation Details
Jain, Shubham P and Iosue, Joseph T and Barg, Alexander and Albert, Victor V "Quantum spherical codes" Nature Physics , v.20 , 2024 https://doi.org/10.1038/s41567-024-02496-y Citation Details
Barg, Alexander and Boyvalenkov, Peter and Stoyanova, Maya "Bounds for the sum of distances of spherical sets of small size" Discrete Mathematics , v.346 , 2023 https://doi.org/10.1016/j.disc.2023.113346 Citation Details
Barg, Alexander and Chen, Zitan and Tamo, Itzhak "A construction of maximally recoverable codes" Designs, Codes and Cryptography , v.90 , 2022 https://doi.org/10.1007/s10623-022-01020-8 Citation Details
Barg, Alexander and Glazyrin, Alexey and Kao, Wei-Jiun and Lai, Ching-Yi and Tseng, Pin-Chieh and Yu, Wei-Hsuan "On the size of maximal binary codes with 2, 3, and 4 distances" Combinatorial Theory , v.4 , 2024 https://doi.org/10.5070/C64163844 Citation Details
Barg, Alexander and Schwartz, Moshe and Yohananov, Lev "Storage Codes on Coset Graphs with Asymptotically Unit Rate" Combinatorica , 2024 https://doi.org/10.1007/s00493-024-00114-2 Citation Details
Barg, Alexander and Skriganov, Maxim "Bounds for discrepancies in the Hamming space" Journal of Complexity , v.65 , 2021 https://doi.org/10.1016/j.jco.2021.101552 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

In many problems of computer science and engineering, data is represented using binary bit strings. For the purposes of reliable communication or storage, these strings are often supplemented with parity checks, enabling the recovery of the data corrupted by noise. A range of problems in communication and cryptography further calls for the encoded vectors to be distributed uniformly in the space of all such vectors. This project studied properties and limitations of binary codes from the perspective of the uniform distribution.


One such group of problems relates to reliable transmission of the data over a public channel which at the same time should be protected from unauthorized actors seeking to acquire the transmitted information. If those actors observe a uniformly distributed data stream, they are unable to gain any knowledge of the protected information.


This project explored limitations of attainable rates of communication under the condition that the encoded data be close to the uniform distribution on the set of binary strings. Utilizing a range of measures of closeness, the project introduced new techniques to find such rates, and gave precise answers in a broad range of cases. The results established in the project rely on mathematical proofs that involve new tools within information and coding theory to complete the arguments. These results also connect the problem of uniform distributions with several previously unrelated methods in combinatorial mathematics and information theory, developing a deeper understanding of the methods and their applicability range.


The project has additionally explored applications of the methods developed in it in problems of computer science motivated by cryptographic applications. One of these problems, known as hashing, enters many cryptographic primitives underlying a wide range of secure communication applications such as financial transactions or data storage. The specific results obtained in the project design new tools to used to derive sharper guarantees for uniformity of the encoded data irrespective of the initial conditions for the source that generates it. They advance the state of the art in the problem of linear hashing and related tasks of applied cryptography.


The results of the project were broadly disseminated in the form of online preprints, published or submitted journal papers, and multiple talks at colloquia and conferences over the 3-year period. The project also supported Ph.D. research of a student who will graduate with a doctoral degree within the next academic year. Presentation of the results has led to new collaborations between the PI and his students and colleagues from other universities that will likely extend beyond the term of this project.


Last Modified: 08/31/2024
Modified by: Alexander Barg

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page