Award Abstract # 2235006
Collaborative Research: EAGER: Cross-platform Election Advertising Transparency Initiative

NSF Org: SES
Division of Social and Economic Sciences
Recipient: WESLEYAN UNIVERSITY
Initial Amendment Date: August 7, 2022
Latest Amendment Date: August 7, 2022
Award Number: 2235006
Award Instrument: Standard Grant
Program Manager: Sara Kiesler
SES
 Division of Social and Economic Sciences
SBE
 Directorate for Social, Behavioral and Economic Sciences
Start Date: October 1, 2022
End Date: September 30, 2024 (Estimated)
Total Intended Award Amount: $260,201.00
Total Awarded Amount to Date: $260,201.00
Funds Obligated to Date: FY 2022 = $260,201.00
History of Investigator:
  • Erika Franklin Fowler (Principal Investigator)
    efowler@wesleyan.edu
  • Sebastian Zimmeck (Co-Principal Investigator)
Recipient Sponsored Research Office: Wesleyan University
237 HIGH ST
MIDDLETOWN
CT  US  06459-3208
(860)685-3683
Sponsor Congressional District: 01
Primary Place of Performance: Wesleyan University
237 HIGH STREET RM 409
MIDDLETOWN
CT  US  06459-3208
Primary Place of Performance
Congressional District:
01
Unique Entity Identifier (UEI): ZETJL6DKF963
Parent UEI:
NSF Program(s): Secure &Trustworthy Cyberspace,
Human Networks & Data Sci Infr
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7434, 025Z, 065Z, 7916, 107Z, 9102
Program Element Code(s): 806000, 130Y00
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.075

ABSTRACT

Nearly $2 billion was spent on digital advertising in the presidential general election in 2020, and online advertising is only expected to grow. Campaigns? rapid shift to digital, however, has caused headaches for those who track political advertising. The data provided by online platforms, such as Facebook and Google, come in different formats, contain different collections that are not immediately comparable, and lack metadata necessary to find all ads related to a specific campaign for office, all ads featuring a specific candidate, or all ads within either set that discuss a specific issue. The sheer volume of content requires the use of computational methods to extract the information necessary to answer even basic descriptive questions such as how much money was spent online in a particular race or across all federal races. This project provides the infrastructure necessary to answer critical questions about the role of digital advertising in American democracy, including the extent to which ?dark money? dominates campaigns, and the spread and reach of misinformation in campaigns.

The project team is building expandable infrastructure to acquire, process, integrate, label, and distribute digital election advertising data from two large online platforms. The result is a centralized repository that provides robust documentation of all procedures and code such that they can be modified to apply to other platforms and contexts for expansion. Cross-platform integration and standardization of digital election advertising data through human and state-of-the-art computational methods reduces costs to individual researchers, provides parallel procedures already in place for analyzing TV advertising (and thus maximizing comparability to existing data), provides linkage information to other data sources, and produces accessible data to benefit the community. Accomplishing this aim is a high risk-high reward endeavor. For instance, new approaches or methods are needed to capture and label relevant advertising by focus, especially when the true sponsor is unclear or disguised.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fowler, Erika Franklin and Floyd, Breeze and Zhang, Meiqing and Kim, Yujin and Ridout, Travis N and Oleinikov, Pavel and Franz, Michael M "Election Advertising on Meta, Google, and Snapchat in 2024" The Forum , 2025 https://doi.org/10.1515/for-2025-2011 Citation Details
Franz, Michael M and Zhang, Meiqing and Ridout, Travis N and Oleinikov, Pavel and Yao, Jielu and Cakmak, Furkan and Fowler, Erika Franklin "Quantifying Data-Driven Campaigning Across Sponsors and Platforms" Media and Communication , v.12 , 2024 https://doi.org/10.17645/mac.8577 Citation Details
Ridout, Travis N and Neumann, Markus and Yao, Jielu and Baum, Laura and Franz, Michael M and Oleinikov, Pavel and Franklin_Fowler, Erika "Platform Convergence or Divergence? Comparing Political Ad Content Across Digital and Social Media Platforms" Social Science Computer Review , v.42 , 2024 https://doi.org/10.1177/08944393241258767 Citation Details
Zhang, Meiqing and Cakmak, Furkan and Neumann, Markus and Zimmeck, Sebastian and Oleinikov, Pavel and Yao, Jielu and Yu, Harry and Jacewicz, Aleks and Tassone, Isabella and Floyd, Breeze and Baum, Laura and Franz, Michael_M and Ridout, Travis_N and Fowler "Comparable 2022 General Election Advertising Datasets from Meta and Google" Scientific Data , v.12 , 2025 https://doi.org/10.1038/s41597-025-05228-w Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Overview

The Cross-Platform Election Advertising Transparency Initiative (hereafter CREATIVE) pilot infrastructure provides cross-platform integration and standardization of electoral digital advertising data for community use. Election advertising is a key link between candidates and the public. Candidates for federal office in 2022 spent an estimated $150 million on ads on two key platforms:  Meta and Google.  The rapid shift in election spending to digital, however, has caused headaches for those who track and study advertising. Transparency in digital ad spending and content is difficult to obtain and often illusory in part because campaign finance laws require minimal disclosure for many ad sponsors -- and no disclosure for some -- meaning that content tracking across media is the only viable way of uncovering some advertising activity.  The Meta and Google ad libraries are important steps in providing more transparency; however, their data come in different formats with different universes, are not user-friendly for research purposes, lack basic metadata that would enable one to find all ads related to a specific campaign for office, and often require the use of novel computational techniques to extract large-scale data. 

CREATIVE enables easy-to-access, shared baseline data for the 2022 midterm general election period, code and information critical to answering vital research questions about elections and democracy, including the role of `dark money' spending and the scope and scale of political misinformation. In addition, CREATIVE infrastructure reduces costs to individual researchers, provides parallel procedures already in place for analyzing TV advertising to maximize comparability to existing data, provides linkage information to other data sources, and has produced cumulative data to benefit the community. The result is a shared knowledge base with the potential for transformative science. Our specific aims included 1) acquiring digital advertising across platforms (including content, spending, impression and sponsor names) along with candidate and sponsor data for the 2022 federal, general election period and 2) processing and integrating data through human and computational methods to produce validated labels for analysis and distribution. 

 

Intellectual Merit

Scholars seeking to analyze election advertising data frequently perform similar tasks to gather and analyze digital advertising activity, and because of the demands of grappling with incomparable universes across platforms that also lack uniform identification for ease of integration, no one has attempted to systematically understand the universe of federal election activity across platforms. Integrated data on digital ads is essential for a complete picture of how candidates, parties, and outside groups appeal to voters and what the effects of these appeals are. Information on how the volume and content of online activity compares within and across platforms -- along with how it compares with television -- is also required to answer long-standing questions about advertising effects, and to understand how online and social media advertising affect democracy, political polarization and inequality. Current piecemeal efforts by individual scholars to analyze digital advertising end up repeating expensive and time-consuming tasks that exacerbate inequalities in who among the research community is able to produce scientific knowledge given that data acquisition, processing and integration is tedious, computing costs are high, and most social scientists lack the necessary computational skills and expertise to address the problem in a comprehensive fashion.

 

Broader Impacts

As spending on U.S. election advertising continues to increase year after year -- and the number of ads on digital and social media platforms along with the number of sponsors of ad activity explodes -- the ability to analyze ad spending and content in a systematic fashion is crucial. CREATIVE infrastructure provides community data to help accelerate knowledge and add to our understanding of how candidates, parties and groups tailor and distribute their advertising online. Our datasets can be easily connected via linking identifiers with other government and social science data sources that enable scholars to answer questions about the role of `dark money,' the spread and reach of information in campaigns, including disinformation, among other questions. Furthermore, our validation, labeling and standardization of ad data across platforms provides high-quality, labeled textual and audiovisual data along with human validation that is accessible to researchers and may facilitate work in other fields, including in AI.


Last Modified: 01/28/2025
Modified by: Erika Franklin Fowler

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page