Skip to feedback

Award Abstract # 2317233
Collaborative Research: SaTC: CORE: Medium: Differentially Private SQL with flexible privacy modeling, machine-checked system design, and accuracy optimization

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: THE RESEARCH FOUNDATION FOR THE STATE UNIVERSITY OF NEW YORK
Initial Amendment Date: January 2, 2024
Latest Amendment Date: January 2, 2024
Award Number: 2317233
Award Instrument: Continuing Grant
Program Manager: Karen Karavanic
kkaravan@nsf.gov
 (703)292-2594
CNS
 Division Of Computer and Network Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: May 1, 2024
End Date: April 30, 2028 (Estimated)
Total Intended Award Amount: $335,584.00
Total Awarded Amount to Date: $167,158.00
Funds Obligated to Date: FY 2024 = $167,158.00
History of Investigator:
  • Zeyu Ding (Principal Investigator)
    dding1@binghamton.edu
Recipient Sponsored Research Office: SUNY at Binghamton
4400 VESTAL PKWY E
BINGHAMTON
NY  US  13902
(607)777-6136
Sponsor Congressional District: 19
Primary Place of Performance: SUNY at Binghamton
4400 VESTAL PKWY E
BINGHAMTON
NY  US  13902-4400
Primary Place of Performance
Congressional District:
19
Unique Entity Identifier (UEI): NQMVAAQUFU53
Parent UEI: L9ZDVULCHCV3
NSF Program(s): Secure &Trustworthy Cyberspace
Primary Program Source: 01002425DB NSF RESEARCH & RELATED ACTIVIT
01002627DB NSF RESEARCH & RELATED ACTIVIT

01002728DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7924, 025Z
Program Element Code(s): 806000
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Companies and government agencies maintain large databases crucial to their operations. Such databases contain sensitive information about people's interactions with state and local agencies (e.g., tax filings, travel data) or interactions with companies (e.g., customer profiles and purchase histories, employee salary and tax data, and performance reviews). However, such databases also have immense value for analytics that can be used to improve internal operations, guide policy decisions, and provide aggregate information about society. "Formal Privacy" is a scientific field that studies how to inject noise into analyses to protect confidential information without adversely affecting the utility of the analyses. However, existing technology is difficult to apply and requires significant technical expertise. The goal, and broader significance and importance of this project are to democratize access to advanced formal privacy tools. The project's novelties are (1) a customizable privacy model for capturing different privacy concerns in a database and (2) automated tools that reason about how much noise must be injected into a data analysis to satisfy these confidentiality concerns without adversely affecting the analysis results.

Prior work used simple, pre-specified privacy models that severely limited the types of applications that can be supported and required significant technical expertise in the design of those systems to obtain accurate query answers. The project team develops a middleware application for SQL databases consisting of (1) automated tools for analyzing a database schema and interactively developing a privacy model of which data elements need the plausible deniability of differential privacy variations and (2) automated tools for reasoning about SQL queries and customize privacy-preserving query execution plans to the privacy model that is most appropriate for the data. The end result is an open-source, customizable, privacy-preserving database analytics system compatible with existing SQL databases.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page