Award Abstract # 1158659
CAREER: Real-Time Stochastic Optimization with Large Structured Strategy Sets and High-Volume Data Streams

NSF Org: CMMI
Division of Civil, Mechanical, and Manufacturing Innovation
Recipient: UNIVERSITY OF SOUTHERN CALIFORNIA
Initial Amendment Date: September 16, 2011
Latest Amendment Date: August 23, 2012
Award Number: 1158659
Award Instrument: Continuing Grant
Program Manager: Edwin Romeijn
CMMI
 Division of Civil, Mechanical, and Manufacturing Innovation
ENG
 Directorate for Engineering
Start Date: August 15, 2011
End Date: August 31, 2014 (Estimated)
Total Intended Award Amount: $275,425.00
Total Awarded Amount to Date: $275,425.00
Funds Obligated to Date: FY 2009 = $28,648.00
FY 2010 = $79,882.00

FY 2011 = $82,215.00

FY 2012 = $84,680.00
History of Investigator:
  • Paat Rusmevichientong (Principal Investigator)
    rusmevic@marshall.usc.edu
Recipient Sponsored Research Office: University of Southern California
3720 S FLOWER ST FL 3
LOS ANGELES
CA  US  90033
(213)740-7762
Sponsor Congressional District: 34
Primary Place of Performance: University of Southern California
CA  US  90089-1147
Primary Place of Performance
Congressional District:
33
Unique Entity Identifier (UEI): G88KLJR3KYT5
Parent UEI:
NSF Program(s): OPERATIONS RESEARCH
Primary Program Source: 01000910DB NSF RESEARCH & RELATED ACTIVIT
01001011DB NSF RESEARCH & RELATED ACTIVIT

01001112DB NSF RESEARCH & RELATED ACTIVIT

01001213DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 072E, 073E, 077E, 1045, 1187, 9147, 9148, MANU
Program Element Code(s): 551400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.041

ABSTRACT

This Faculty Early Career Development (CAREER)research proposes to support the development of a modeling and algorithmic framework for solving large-scale real-time stochastic optimization problems. This research will focus on a setting where distributions of the underlying uncertainty are not known. The decision maker in this environment often faces a large number of alternatives and must make a decision in real-time based on high-volume data streams. By exploiting specific structures of each problem, this research will develop methodologies that combine real-time decision-making with approximation algorithms for solving complex stochastic optimization problems having large strategy sets. The research will investigate applications of these algorithms to problems in search-based advertising and supply chain management by working with industry partners. In addition, the outcome of the research will be integrated into an interactive teaching module or a case study.

The proposed research encompasses many problems in the current information-driven environment, including ad placement in search-based advertising services, inventory management for online retailers, and the multi-armed bandit problem. If successful, the research will result in a unifying framework for studying and analyzing this class of problems, along with a suite of algorithms applicable to these problems. The research will establish new techniques for extending the performance guarantee of approximation algorithms to reflect long-run average performance of the system, and provide new methodologies for improving the convergence rates by leveraging special structures within each problem. Extensive collaboration with companies in the industry and the integration of these relationships to enrich the classroom experience for students will be an important outcome of this project.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 19)
A. J. Mersereau and P. Rusmevichientong and J. N. Tsitsiklis "A Structured Multiarmed Bandit Problem and the Greedy Policy" IEEE Transactions on Automatic Control , v.54 , 2009 , p.2787-2802
A. J. Mersereau, P. Rusmevicheintong, and J. N. Tsitsiklis "A Structured Multiarmed Bandit Problem and the Greedy Policy" IEEE Transactions on Automatic Control , v.54 , 2009 , p.2787
Broder, Josef; Rusmevichientong, Paat "Dynamic Pricing Under a General Parametric Choice Model" OPERATIONS RESEARCH , v.60 , 2012 , p.965-980
G. Li and P. Rusmevichientong "A Greedy Algorithm for the Two-Level Nested Logit Model" Operations Research Letters , v.42 , 2014 , p.319-324
J. Broder and P. Rusmevichientong "Dynamic Pricing under a General Parametric Choice Model" Operations Research , v.60 , 2012 , p.965-980
N. Golrezaei and H. Nazerzadeh and P. Rusmevichientong "Real-Time Optimization of Personalized Assortments" Management Science , v.60 , 2014 , p.1532-1551
P. Rusmevicheintong and J. N. Tsitsiklis "Linearly Parameterized Bandits" Mathematics of Operations Research , v.35 , 2010 , p.395
P. Rusmevichientong and H. Topaloglu "Robust Assortment Optimization Under the Multinomial Logit Choice Model" Operations Research , v.60 , 2012 , p.865-882
P. Rusmevichientong and J. N. Tsitsiklis "Linearly Parameterized Bandits" Mathematics of Operations Research , v.35 , 2010 , p.395-411
P. Rusmevichientong and Z.-J. Max Shen and D. B. Shmoys "A PTAS for Capacitated Sum-of-Ratios Optimization" Operations Research Letters , v.37 , 2009 , p.230-238
P. Rusmevichientong and Z.-J. Max Shen and D. B. Shmoys "Dynamic Assortment Optimization with a Multinomial Logit Choice Model and Capacity Constraint" Operations Research , v.58 , 2010 , p.1666-1680
(Showing: 1 - 10 of 19)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The objective of the award is to develop effective algorithms for high-dimensional complex stochastic optimization problems involving large structured decision sets and high-volume data streams, with applications to revenue management, dynamic pricing, and resource allocations.  

 

Intellectual Merits:  During the lifetime of the award, we have developed many innovative algorithms and solution methods for the structured multiarmed bandit problems, dynamic assortment optimization, and the d-level nested logit model.  We will briefly describe each problem and discuss our contributions in each area.

 

Motivated by applications in online advertising, we consider the structured multiarmed bandit problem where the expected payoff of each decision is unknown a priori, and we only observe noisy samples of the reward.    We assume that the mean reward of each decision is a linear function of the decision’s attributes.  For example, if each decision corresponds to an advertisement to be shown to the user, the attributes of the ad might correspond to its content, popularity, or user ratings.    Our goal is to develop an algorithm for choosing a sequence of decisions that maximizes the total expected reward.  Since the mean reward of each decision is unknown, we need to simultaneously estimate the reward of the arms and choose the best decision based on the available information.  In this ``learning and earning’’ environment, we develop a method that achieves the best possible reward, and they have been applied to problems in online advertising and dynamic assortment optimization.  The results are published in IEEE Transaction on Automatic Control, Mathematics of Operations Research, and Operations Research.

 

In the dynamic assortment optimization, we consider the problem of real-time optimization of personalized assortments.  This problem is motivated by the availability of real-time data on customer characteristics, we consider the problem of personalizing the assortment of products to each arriving customer. Through actual sales data from an online retailer, we demonstrate that personalization based on each customer's location can lead to over 10% improvements in revenue, compared to a policy that treats all customers the same.  We propose a family of simple and effective algorithms, called Inventory-Balancing, for real-time personalized assortment optimization, that do not require any forecasting: An Inventory-Balancing algorithm, maintains a "discounted-revenue index" for each product, in which the (actual) revenue is multiplied by a (virtual) discount factor that depends on the fraction of the product's remaining inventory. Upon an arrival of each customer, based on the customer's type, the algorithm offers to her the assortment that maximizes the expected discounted revenue.   In addition to personalization, we have also considered the assortment packing problem in which a firm must decide, in advance, the release date of each product in a given collection over a selling season. This problem is motivated by retailers' frequent introduction of new items to refresh product lines and maintain their market share. Our formulation models the trade-offs among profit margins, preference weights, and limited life cycles.  These works result in 2 papers that have been published in Management Science.

 

Finally, we consider the assortment and price optimization problems under the d-level nested logit model.  We provide a new formulation of the d-level nested logit model using a tree of depth d. Our formulation enables us to study two canonical revenue management problems under this model: finding the optimal assortment assuming fixed prices, and determini...

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page