Award Abstract # 1831481
RIDIR: Collaborative Research: Integrated Communication Database and Computational Tools

NSF Org: SMA
SBE Office of Multidisciplinary Activities
Recipient: THE LELAND STANFORD JUNIOR UNIVERSITY
Initial Amendment Date: September 5, 2018
Latest Amendment Date: September 5, 2018
Award Number: 1831481
Award Instrument: Standard Grant
Program Manager: Sara Kiesler
skiesler@nsf.gov
 (703)292-8643
SMA
 SBE Office of Multidisciplinary Activities
SBE
 Directorate for Social, Behavioral and Economic Sciences
Start Date: September 15, 2018
End Date: August 31, 2022 (Estimated)
Total Intended Award Amount: $209,486.00
Total Awarded Amount to Date: $209,486.00
Funds Obligated to Date: FY 2018 = $209,486.00
History of Investigator:
  • Jennifer Pan (Principal Investigator)
    jp1@stanford.edu
Recipient Sponsored Research Office: Stanford University
450 JANE STANFORD WAY
STANFORD
CA  US  94305-2004
(650)723-2300
Sponsor Congressional District: 16
Primary Place of Performance: Stanford University
CA  US  94305-2004
Primary Place of Performance
Congressional District:
16
Unique Entity Identifier (UEI): HJD6G4D6TJY5
Parent UEI:
NSF Program(s): Secure &Trustworthy Cyberspace
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 025Z, 026Z, 062Z, 065Z, 7433, 7434, 8083, 9178, 9179
Program Element Code(s): 806000
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.075

ABSTRACT

This project will develop an integrated research framework for sociotechnical cybersecurity research and broader investigations of information provenance by behavioral, information, and computer scientists. Currently, researchers are mainly limited to natural language processing of large bodies of online text. This project will make it possible to analyze larger information worlds, including those from such countries as China, and the flow of information, including video and audio information, in newspapers, TV, and online sources. The project addresses a core goal of cybersecurity research, which is to understand the provenance, flow, and termination of information warfare, and censorship.

The project is aimed at constructing an integrated and unified information database that combines mass communication data from TV and print sources from six locations, with data from two popular online communication platforms. The project will generate a variety of metadata and time series data on topics, actors, events, and sentiments presented in communications by automated multimodal content analysis using text, image, video, and audio. Variables will be linked to identify trajectories of information flow between communication channels through multiple platforms. It will develop a new class of computational models and algorithms that can automatically analyze both verbal and nonverbal communications data by machine learning, computer vision, deep learning, and natural language processing. This project will allow researchers across the computational and social sciences to access the metadata and time series data through a search interface for qualitative research, a statistical package for quantitative research, and various visualization tools. This project will therefore link previously untapped data sources using cutting-edge computational methods to enable scholars to conduct systematic research on large-scale patterns in the emerging information and communication ecosystem.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lu, Yingdan and Pan, Jennifer "Capturing Clicks: How the Chinese Government Uses Clickbait to Compete for Visibility" Political Communication , 2020 10.1080/10584609.2020.1765914 Citation Details
Lu, Yingdan and Pan, Jennifer "The Pervasive Presence of Chinese Government Content on Douyin Trending Videos" Computational communication research , v.4 , 2022 https://doi.org/10.5117/CCR2022.2.002.LU Citation Details
Lu, Yingdan and Schaefer, Jack and Park, Kunwoo and Joo, Jungseock and Pan, Jennifer "How Information Flows from the World to China" The International Journal of Press/Politics , 2022 https://doi.org/10.1177/19401612221117470 Citation Details
Muise, Daniel and Lu, Yingdan and Pan, Jennifer and Reeves, Byron "Selectively localized: Temporal and visual structure of smartphone screen activity across media environments" Mobile Media & Communication , v.10 , 2022 https://doi.org/10.1177/20501579221080333 Citation Details
Zhang, Han and Pan, Jennifer "CASM: A Deep-Learning Approach for Identifying Collective Action Events with Text and Image Data from Social Media" Sociological Methodology , v.49 , 2019 10.1177/0081175019860244 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The main goal of this project was to develop integrated and unified news databases and identify trajectories of inter-media flow through multiple media platforms. We achieved this objective, which in turn allowed us to systematically study, for the first time, the flow of information from the global media ecosystem into China. We develop a semi-automated system that combines deep learning and human annotation to find co-occurring content between global and Chinese social media platforms. We find that approximately one-fourth of content with relevance for China that gains widespread global public attention makes it way to Chinese social media. Chinese state-controlled media and commercialized domestic media play a dominant role in facilitating these inflows of information. However, Weibo users without traditional media or government affiliations are also an important mechanism for transmitting information into China. These results imply that while online censorship combined with control over traditional media provide substantial leeway for the Chinese government to set the agenda, social media provides opportunities for non-institutional actors to influence the information environment. Methodologically, the system we develop offers a new approach for the quantitative analysis of cross- platform and cross-lingual communication.


Additional outcomes generated during this project include the development of a system that uses deep learning on text and images to offline collective action using social media data, advances in knowledge of how the Chinese government uses social media for political propaganda and influence, and how people in very different contexts (US, China, Myanmar) use smartphones to consume media.

 

 

 


Last Modified: 11/09/2022
Modified by: Jennifer Pan

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page