
NSF Org: |
IIS Division of Information & Intelligent Systems |
Recipient: |
|
Initial Amendment Date: | August 18, 2018 |
Latest Amendment Date: | August 18, 2018 |
Award Number: | 1836565 |
Award Instrument: | Standard Grant |
Program Manager: |
Rebecca Hwa
IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering |
Start Date: | September 1, 2018 |
End Date: | August 31, 2021 (Estimated) |
Total Intended Award Amount: | $98,522.00 |
Total Awarded Amount to Date: | $98,522.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
800 WEST CAMPBELL RD. RICHARDSON TX US 75080-3021 (972)883-2313 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
800 W. Campbell Road Richardson TX US 75080-3021 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | Robust Intelligence |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
This project seeks to develop new algorithms and data structures for learning and planning in situations where the environment is represented with a set of relations between objects. Relational representations capture interactions between objects in a succinct and easily interpretable representation. Examples of domains that are well-suited to relational representations includes intelligent drones assisting soldiers, activities in a supply chain management, communication and friendship connections in a social network, and tracking individuals and activities in video. Most recent advances in machine learning and planning, such as so-called "deep neural networks", however, employ simple "flat" representations, where the state of the world is an uninterpreted string of bits. This project will make machine learning and planning methods easier to use and more robust by generalizing them so that they explicitly work with relational models and data. The methods, theory, and data resulting from this proposal will impact the scientific community in several positive ways and will be made publicly available through an appropriate website. The research will be disseminated through refereed journals and conference proceedings and made available to researchers. Code for the proposed algorithms and descriptions of new benchmark problems will also be made publicly available. The investigators will work on organizing workshops and tutorials based on the challenges and findings arising from this project.
Many special purpose solutions have been developed to address small parts of these problems, but there are no
general purpose tools that harness recent advances in machine learning to tackle this family of problems. This proposal seeks to develop such tools, drawing upon the investigators' prior experience in learning relational regression trees and experience in value function approximation for reinforcement learning. In addition, this project seeks to build a bridge between recent advances in deep learning, which generally has not been compatible with relational representations, and recent advances in relational learning.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
The key outcomes of this project are two-fold -- (1) developing the first set of learnable algorithms for reinforcement learning in structured domains with interacting objects and (2) developing probabilistic inference techniques that exploit underlying symmetries to efficeintly answer probabilistic queries. The outcomes of the two directions are clearly related. The grand vision of the proposal is to solve relational POMDPs i.e., learning to act in stochastic, noisy, partially observable structured domains. To realize this goal, it is important that the agent reasons with partial information which is achieved by lifted probabilistic inference (the PI has co-edited the first book in this topic and the book was released by MIT Press in August 2021). Once the probabilistic inference is performed, the agent has to efffectively use these probabilistic inference results in selecting the best set of actions. This is achieved by developing the efficient relational reinforcement learning methods, which is the first outcome. Our extensive experiments demonstrate that advancement in a single research area such as deep networks is not sufficient to realize the grand vision. Instead, progress in multiple areas such as deep networks, reinforcement learning, symbolic representations, and probabilistic inference, to name a few, are necessary. It is crucial that these different progresses are integrated in an AI system and our project takes a step in that direction.
Collectively, the two outcomes of this project have allowed the group to take the first step towards realizing the grand vision of the proposal in particular and the AI community in general.
Last Modified: 09/15/2021
Modified by: Sriraam Natarajan
Please report errors in award information by writing to: awardsearch@nsf.gov.