
NSF Org: |
IIS Division of Information & Intelligent Systems |
Recipient: |
|
Initial Amendment Date: | January 30, 2006 |
Latest Amendment Date: | September 28, 2009 |
Award Number: | 0542881 |
Award Instrument: | Continuing Grant |
Program Manager: |
Maria Zemankova
IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering |
Start Date: | March 1, 2006 |
End Date: | February 28, 2009 (Estimated) |
Total Intended Award Amount: | $0.00 |
Total Awarded Amount to Date: | $159,545.00 |
Funds Obligated to Date: |
FY 2007 = $92,456.00 FY 2008 = $172,000.00 |
History of Investigator: |
|
Recipient Sponsored Research Office: |
ONE CASTLE POINT ON HUDSON HOBOKEN NJ US 07030-5906 (201)216-8762 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
ONE CASTLE POINT ON HUDSON HOBOKEN NJ US 07030-5906 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
Info Integration & Informatics, COLLABORATIVE SYSTEMS |
Primary Program Source: |
app-0107 01000809DB NSF RESEARCH & RELATED ACTIVIT 01000910DB NSF RESEARCH & RELATED ACTIVIT 01001011DB NSF RESEARCH & RELATED ACTIVIT |
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
Data quality problem is of great importance due to the emergence of large volumes of data. Many business and industrial applications critically rely on the quality of information stored in diverse databases and data warehouses. The goal of this research project is to develop a systematic methodology of data quality analysis and improvement to achieve robust decision making under imperfect information environments. The project develops a unified framework for data quality assessment and evaluation, deliveries practical solutions to improve data quality through information production and management, and disseminates research findings by maintaining a website to increase the awareness of information quality among academic and industrial professionals. The approach consists of developing Bayesian network models to capture inter-relationships between data quality metrics, applying statistical sampling schemes and data mining methods for data quality assessment, and generalizing statistical techniques for root cause identification and data quality improvement. The techniques are evaluated and validated using synthetic examples and real-life cases from telecommunication and information technologies (IT) industries. The outcomes of the project are expected to be generic and provide a concrete basis of data quality management that can be applied to different data-intensive applications. The project will have broad impacts on advanced theory and methodology of information quality management, enhanced decision making, and the creation of a workforce of data quality assurance researchers and practitioners. Knowledge gained and results obtained from this research project will be broadly disseminated via Internet (http://www.stevens.edu/engineering/seem/Research/projects/DataQuality.html), in conferences, workshops, and various levels of courses.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
Please report errors in award information by writing to: awardsearch@nsf.gov.