
NSF Org: |
SES Division of Social and Economic Sciences |
Recipient: |
|
Initial Amendment Date: | August 7, 2022 |
Latest Amendment Date: | August 7, 2022 |
Award Number: | 2235006 |
Award Instrument: | Standard Grant |
Program Manager: |
Sara Kiesler
SES Division of Social and Economic Sciences SBE Directorate for Social, Behavioral and Economic Sciences |
Start Date: | October 1, 2022 |
End Date: | September 30, 2024 (Estimated) |
Total Intended Award Amount: | $260,201.00 |
Total Awarded Amount to Date: | $260,201.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
237 HIGH ST MIDDLETOWN CT US 06459-3208 (860)685-3683 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
237 HIGH STREET RM 409 MIDDLETOWN CT US 06459-3208 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
Secure &Trustworthy Cyberspace, Human Networks & Data Sci Infr |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.075 |
ABSTRACT
Nearly $2 billion was spent on digital advertising in the presidential general election in 2020, and online advertising is only expected to grow. Campaigns? rapid shift to digital, however, has caused headaches for those who track political advertising. The data provided by online platforms, such as Facebook and Google, come in different formats, contain different collections that are not immediately comparable, and lack metadata necessary to find all ads related to a specific campaign for office, all ads featuring a specific candidate, or all ads within either set that discuss a specific issue. The sheer volume of content requires the use of computational methods to extract the information necessary to answer even basic descriptive questions such as how much money was spent online in a particular race or across all federal races. This project provides the infrastructure necessary to answer critical questions about the role of digital advertising in American democracy, including the extent to which ?dark money? dominates campaigns, and the spread and reach of misinformation in campaigns.
The project team is building expandable infrastructure to acquire, process, integrate, label, and distribute digital election advertising data from two large online platforms. The result is a centralized repository that provides robust documentation of all procedures and code such that they can be modified to apply to other platforms and contexts for expansion. Cross-platform integration and standardization of digital election advertising data through human and state-of-the-art computational methods reduces costs to individual researchers, provides parallel procedures already in place for analyzing TV advertising (and thus maximizing comparability to existing data), provides linkage information to other data sources, and produces accessible data to benefit the community. Accomplishing this aim is a high risk-high reward endeavor. For instance, new approaches or methods are needed to capture and label relevant advertising by focus, especially when the true sponsor is unclear or disguised.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
Overview
The Cross-Platform Election Advertising Transparency Initiative (hereafter CREATIVE) pilot infrastructure provides cross-platform integration and standardization of electoral digital advertising data for community use. Election advertising is a key link between candidates and the public. Candidates for federal office in 2022 spent an estimated $150 million on ads on two key platforms: Meta and Google. The rapid shift in election spending to digital, however, has caused headaches for those who track and study advertising. Transparency in digital ad spending and content is difficult to obtain and often illusory in part because campaign finance laws require minimal disclosure for many ad sponsors -- and no disclosure for some -- meaning that content tracking across media is the only viable way of uncovering some advertising activity. The Meta and Google ad libraries are important steps in providing more transparency; however, their data come in different formats with different universes, are not user-friendly for research purposes, lack basic metadata that would enable one to find all ads related to a specific campaign for office, and often require the use of novel computational techniques to extract large-scale data.
CREATIVE enables easy-to-access, shared baseline data for the 2022 midterm general election period, code and information critical to answering vital research questions about elections and democracy, including the role of `dark money' spending and the scope and scale of political misinformation. In addition, CREATIVE infrastructure reduces costs to individual researchers, provides parallel procedures already in place for analyzing TV advertising to maximize comparability to existing data, provides linkage information to other data sources, and has produced cumulative data to benefit the community. The result is a shared knowledge base with the potential for transformative science. Our specific aims included 1) acquiring digital advertising across platforms (including content, spending, impression and sponsor names) along with candidate and sponsor data for the 2022 federal, general election period and 2) processing and integrating data through human and computational methods to produce validated labels for analysis and distribution.
Intellectual Merit
Scholars seeking to analyze election advertising data frequently perform similar tasks to gather and analyze digital advertising activity, and because of the demands of grappling with incomparable universes across platforms that also lack uniform identification for ease of integration, no one has attempted to systematically understand the universe of federal election activity across platforms. Integrated data on digital ads is essential for a complete picture of how candidates, parties, and outside groups appeal to voters and what the effects of these appeals are. Information on how the volume and content of online activity compares within and across platforms -- along with how it compares with television -- is also required to answer long-standing questions about advertising effects, and to understand how online and social media advertising affect democracy, political polarization and inequality. Current piecemeal efforts by individual scholars to analyze digital advertising end up repeating expensive and time-consuming tasks that exacerbate inequalities in who among the research community is able to produce scientific knowledge given that data acquisition, processing and integration is tedious, computing costs are high, and most social scientists lack the necessary computational skills and expertise to address the problem in a comprehensive fashion.
Broader Impacts
As spending on U.S. election advertising continues to increase year after year -- and the number of ads on digital and social media platforms along with the number of sponsors of ad activity explodes -- the ability to analyze ad spending and content in a systematic fashion is crucial. CREATIVE infrastructure provides community data to help accelerate knowledge and add to our understanding of how candidates, parties and groups tailor and distribute their advertising online. Our datasets can be easily connected via linking identifiers with other government and social science data sources that enable scholars to answer questions about the role of `dark money,' the spread and reach of information in campaigns, including disinformation, among other questions. Furthermore, our validation, labeling and standardization of ad data across platforms provides high-quality, labeled textual and audiovisual data along with human validation that is accessible to researchers and may facilitate work in other fields, including in AI.
Last Modified: 01/28/2025
Modified by: Erika Franklin Fowler
Please report errors in award information by writing to: awardsearch@nsf.gov.