NSF Award Search: Award # 1618244

Award Abstract # 1618244

III: Small: Quantifying Multifaceted Perception Dynamics in Online Social Networks

NSF Org:	IIS Division of Information & Intelligent Systems
Recipient:	ILLINOIS INSTITUTE OF TECHNOLOGY
Initial Amendment Date:	July 27, 2016
Latest Amendment Date:	April 25, 2017
Award Number:	1618244
Award Instrument:	Standard Grant
Program Manager:	Sylvia Spengler sspengle@nsf.gov (703)292-7347 IIS Division of Information & Intelligent Systems CSE Directorate for Computer and Information Science and Engineering
Start Date:	September 1, 2016
End Date:	May 31, 2020 (Estimated)
Total Intended Award Amount:	$471,992.00
Total Awarded Amount to Date:	$471,992.00
Funds Obligated to Date:	FY 2016 = $471,992.00
History of Investigator:	Aron Culotta (Principal Investigator) aculotta@tulane.edu Jennifer Cutler (Former Co-Principal Investigator)
Recipient Sponsored Research Office:	Illinois Institute of Technology 10 W 35TH ST CHICAGO IL US 60616-3717 (312)567-3035
Sponsor Congressional District:	01
Primary Place of Performance:	Illinois Institute of Technology Chicago IL US 60616-3717
Primary Place of Performance Congressional District:	01
Unique Entity Identifier (UEI):	E2NDENMDUEG8
Parent UEI:
NSF Program(s):	Info Integration & Informatics
Primary Program Source:	01001617DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s):	7364, 7923
Program Element Code(s):	736400
Award Agency Code:	4900
Fund Agency Code:	4900
Assistance Listing Number(s):	47.070

ABSTRACT

Measuring public perceptions and how they change over time is a central problem in marketing, public health, and politics. Traditional measurement methods rely on surveys and focus groups, which can be costly and time-consuming. Online social networks offer an attractive alternative: real-time perceptions can be estimated from public, online activity and compared with an entity's communications to quantify how public messaging affects perception. While prior algorithmic approaches rely purely on text-based sentiment analysis, this project will develop novel methods based on the insight that an entity's online social connections are indicative of how they are perceived (e.g., "birds of a feather flock together"). Thus, rather than typical one-dimensional measures of sentiment, the project will instead investigate public perception with respect to multiple characteristics of an entity (e.g., is it seen as pro-environment, pro-health, etc.). A multi-faceted evaluation will be performed to study the phenomenon of "greenwashing," a deceptive marketing practice in which firms market their products or policies as more environmentally friendly than they truly are. This project has the potential to enhance consumer protection by exposing deceptive marketing practices.

The project will develop social network analysis algorithms to assess perception of an entity and also language processing algorithms to quantify the communications of an entity with respect to a perceptual attribute. The approaches to both problems rely on innovative algorithms to measure the strengths of the social and linguistic relations between public entities and exemplar accounts that typify the perceptual attribute of interest. A key advantage of the approach is its minimal requirement of human input, e.g., given only a single keyword like "environment," the approach identifies suitable exemplars and fits linguistic and perceptual models. The project will develop novel machine learning methods for domain adaptation, positive-unlabeled learning, and learning from label proportions in order to fit such models and ensure they are robust to omitted variable bias. The models will be evaluated using public Twitter and Facebook data to quantify the relationship between the perceptions and online communications of brands and other public entities, with a particular focus on identifying cases of greenwashing.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 11)

Show All

Aron Culotta and Jennifer Cutler "Mining brand perceptions from Twitter social networks" Marketing Science , v.35 , 2016 , p.343

Ehsan Ardehaly and Aron Culotta "Co-training for Demographic Classification Using Deep Learning from Label Proportions" ICDM ACUMEN Workshop , 2017

Ehsan Ardehaly and Aron Culotta "Learning from noisy label proportions for classifying online social data" Social Network Analysis and Mining , v.8 , 2018

Ehsan Ardehaly and Aron Culotta "Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions" ICDM , 2017

Jennifer Cutler and Aron Culotta "Using online social networks to measure consumers? brand perception" Applied Marketing Analytics , v.2 , 2017 , p.312

Jennifer Cutler and Aron Culotta "Using weak supervision to scale the development of machine learning models for social media-based marketing research" Applied Marketing Analytics , 2019

Shreesh Kumara Bhat and Aron Culotta "Identifying leading indicators of product recalls from online reviews using positive unlabeled learning and domain adaptation" International Conference on Weblogs and Social Media , 2017

Tung Nguyen, Li Zhang, Aron Culotta "Estimating Tie Strength in Follower Networks to Measure Brand Perceptions" ASONAM/FAB , 2019

Zhao Wang and Aron Culotta "Are Words Commensurate with Actions? Quantifying Commitment to A Cause from Online Public Messaging" IEEE International Conference on Data Mining ACUMEN Workshop , 2017

Zhao Wang and Aron Culotta "When do words matter? Understanding the impact of lexical choice on audience perception using individual treatment effect estimation" AAAI , 2019

Zhao Wang and Aron Culotta "When do words matter? Understanding the Impact of Lexical Choice on Audience Perception using Individual Treatment Effect Estimation" AAAI , 2019

(Showing: 1 - 10 of 11)

Show All

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Measuring public perceptions and how they change over time is a central problem in marketing, public health, and politics. Traditional measurement methods rely on surveys and focus groups, which can be costly and time-consuming. Online social networks offer an attractive alternative: real-time perceptions can be estimated from public, online activity and compared with an entity's communications to quantify how public messaging affects perception. While prior algorithmic approaches rely purely on text-based sentiment analysis, the project developed novel methods based on the insight that an entity's online social connections are indicative of how they are perceived (e.g., "birds of a feather flock together"). Thus, rather than typical one-dimensional measures of sentiment, the project investigated public perception with respect to multiple characteristics of an entity (e.g., is it seen as pro-environment, pro-health, etc.). As a use case, this project examined "greenwashing," a deceptive marketing practice in which firms market their products or policies as more environmentally friendly than they truly are.

The project developed social network analysis algorithms to assess perception of an entity and also language processing algorithms to quantify the communications of an entity with respect to a perceptual attribute. The approaches to both problems rely on innovative algorithms to measure the strengths of the social and linguistic relations between public entities and exemplar accounts that typify the perceptual attribute of interest. A key advantage of the approach is its minimal requirement of human input --- e.g., given only a single keyword like "environment," the approach identifies suitable exemplars and fits linguistic and perceptual models.

There are a number of broader, technical contributions of this project that have general applicability to other areas of machine learning and text classification:

1. Classifier Robustness: While machine learning has made great strides recently, automated classification algorithms still make seemingly simple mistakes. Drawing on a long line of research in causal inference, this project has developed a number of novel classification methods that have increased the robustness of such methods on instances that differ somewhat from the original training instances. For example, if a human makes a minor edit to a sentence, the classifier should still be able to classify it correctly. The resulting methods are less susceptible to spurious correlations than existing approaches, not only increasing robustness but also improving trust in autonomous systems.

2. Alternative training methods for machine learning: Traditional machine learning requires many thousands, if not millions, of training examples, which can be expensive to maintain and can quickly become outdated. A key part of this project has been the development of weakly supervised learning methods that operate using cheaper, more readily available types of supervision. For example, the fact that 80% of a set of instances are positive is often easier to collect than labeling every single instance individually. The methods developed have been shown to achieve comparable accuracy using this sort of supervision as methods that require more expensive supervision, which expands the types of problems that can be solved with machine learning.

3. Combining social networks and text classification: Understanding human language is difficult to do without understanding the social context in which the communications occur. With online social networks, we can observe not only the language used, but also how two users are connected socially. By combining these perspectives, we have found that we can construct more accurate models of the intent and semantics of online communications.

Last Modified: 09/29/2020
Modified by: Aron Culotta

Please report errors in award information by writing to: awardsearch@nsf.gov.

Success

Error