Award Abstract # 1827993
Standard Grant: Productive Ambiguity in Classification

NSF Org: SES
Division of Social and Economic Sciences
Recipient: ARIZONA STATE UNIVERSITY
Initial Amendment Date: August 10, 2018
Latest Amendment Date: June 17, 2019
Award Number: 1827993
Award Instrument: Standard Grant
Program Manager: Frederick Kronz
SES
 Division of Social and Economic Sciences
SBE
 Directorate for Social, Behavioral and Economic Sciences
Start Date: August 1, 2018
End Date: July 31, 2021 (Estimated)
Total Intended Award Amount: $158,162.00
Total Awarded Amount to Date: $158,162.00
Funds Obligated to Date: FY 2018 = $158,162.00
History of Investigator:
  • Beckett Sterner (Principal Investigator)
    beckett.sterner@asu.edu
  • Manfred Laubichler (Co-Principal Investigator)
  • Nico Franz (Co-Principal Investigator)
  • Joeri Witteveen (Co-Principal Investigator)
  • Liz Lerman (Co-Principal Investigator)
  • Joeri Witteveen (Former Co-Principal Investigator)
Recipient Sponsored Research Office: Arizona State University
660 S MILL AVENUE STE 204
TEMPE
AZ  US  85281-3670
(480)965-5479
Sponsor Congressional District: 04
Primary Place of Performance: Arizona State University
P.O. Box 876011
Tempe
AZ  US  85287-6011
Primary Place of Performance
Congressional District:
04
Unique Entity Identifier (UEI): NTLHJXM55KZ6
Parent UEI:
NSF Program(s): IIS Special Projects,
Information Technology Researc,
STS-Sci, Tech & Society
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1353, 075Z
Program Element Code(s): 748400, 164000, 760300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.075

ABSTRACT

This is a project in the history and philosophy of biology that has very substantial ramifications for data-intensive science. The PI will investigate how the history of taxonomy can shed new light on the value of ambiguity for science in the domain of data-intensive science. The project focuses on detecting trade-offs in the value of ambiguity for scientific language as a function of changing social contexts. Accurate disambiguation relies on a shared background of knowledge and abilities, which may prove inadequate as concepts spread into new contexts or a community grows larger and more heterogeneous. The project will fund graduate and undergraduate research assistants to analyze a text corpus drawn from two centuries of history in biological taxonomy. It will also support public events and the creation of educational materials addressing the theme of productive ambiguity in naming and classification.

This project will implement an integrative conceptual framework enabling empirical investigation of ambiguity in linguistic settings. It will use an information-theoretic framework from cognitive pragmatics to quantify ambiguity in a way that is open-ended enough to accommodate a wide range of phenomena shaping human language and communication while also reflecting the specific constraints required by computers. It will provide novel tools for tracking changes in language at the level of populations rather than individuals while remaining sensitive to underlying social institutions and individual differences. Results from this project will open new perspectives on the history of types and subspecies in systematics, and it will inform contemporary debates about the virtues of maximal determinacy in the computational representation of human language and meaning.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Franz, Nico and Gilbert, Edward and Sterner, Beckett "Distributed, but Global in Reach: Outline of a de-centralized paradigm for biodiversity data intelligence" Biodiversity Information Science and Standards , v.3 , 2019 https://doi.org/10.3897/biss.3.37749 Citation Details
Franz, Nico and Sterner, Beckett and Upham, Nathan and Cortés Hernández, Kevin "Redesigning the Trading Zone between Systematics and Conservation: Insights from Malagasy mouse lemur classifications, 1982 to present" Biodiversity Information Science and Standards , v.4 , 2020 https://doi.org/10.3897/biss.4.59234 Citation Details
Sen, Atriya and Sterner, Beckett and Franz, Nico and Powel, Caleb and Upham, Nathan "Combining Machine Learning & Reasoning for Biodiversity Data Intelligence" Proceedings of the AAAI Conference on Artificial Intelligence , v.35 , 2021 https://doi.org/10.1609/aaai.v35i17.17750 Citation Details
Sterner, Beckett and Elliott, Steve and Upham, Nate and Franz, Nico "Bats, objectivity, and viral spillover risk" History and Philosophy of the Life Sciences , v.43 , 2021 https://doi.org/10.1007/s40656-021-00366-x Citation Details
Sterner, Beckett and Upham, Nathan and Gupta, Prashant and Powell, Caleb and Franz, Nico "Wanted: Standards for FAIR taxonomic concept representations and relationships " Biodiversity Information Science and Standards , v.5 , 2021 https://doi.org/10.3897/biss.5.75587 Citation Details
Sterner, Beckett and Witteveen, Joeri and Franz, Nico "Coordinating dissent as an alternative to consensus classification: insights from systematics for bio-ontologies" History and Philosophy of the Life Sciences , v.42 , 2020 10.1007/s40656-020-0300-z Citation Details
Sterner, Beckett W. and Gilbert, Edward E. and Franz, Nico M. "Decentralized but Globally Coordinated Biodiversity Data" Frontiers in Big Data , v.3 , 2020 https://doi.org/10.3389/fdata.2020.519133 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

This project brought together an interdisciplinary team of researchers in history and philosophy of science, computational humanities, biology, and the arts. The central topic of the project was how ambiguity can be productive in scientific research, focusing specifically on ambiguity in taxonomic classifications. The project investigated how the history of taxonomy can shed new light on the value of ambiguity for science in the domain of data-intensive science, complementing existing research on figurative language in science. The project focused in particular on detecting trade-offs in the value of ambiguity for scientific language as a function of changing social contexts. The broader theme of productive ambiguity in the setting of computational data processing has contemporary relevance far beyond current challenges in biology: the idea that we should eliminate ambiguity wherever possible remains entrenched in many efforts to develop controlled vocabularies or ontologies for the Semantic Web today, including in the digital humanities and library sciences.

Despite disruptions to the team’s ability to collaborate during the Covid-19 pandemic, the project achieved major outcomes in each of main areas listed above: it synthesized prior interdisciplinary knowledge about productive ambiguity to demonstrate and evaluate its significance for science, created innovative computational methods for measuring ambiguity in scientific texts, and created new courses and teaching materials engaging ambiguity as a theme at the intersection of art and science. The project has supported eight peer-reviewed publications in history and philosophy of science, artificial intelligence, and biodiversity data science. Several more manuscripts are currently under peer review at journals in the fields of knowledge representation and scientometrics. Results were also disseminated through presentations at leading international conferences in the fields of philosophy of science and biodiversity data science. PI Sterner is continuing to develop methods and empirical results created during the project through new and ongoing collaborations.

Broader impacts for the project were achieved through research training of students, a new course offered by PI Sterner and Co-PI Lerman, collaboration on a new dance composition addressing biodiversity loss led by Co-PI Lerman, and contributions to improving the reusability of scientific articles in ecology and evolution. The project supported mentorship and training of 10 undergraduate and graduate students in diverse fields such as philosophy of science, computer science, and sustainability. Sterner and Lerman collaborated on designing and teaching a new course, Creative Tools for Saving Biodiversity, that introduced undergraduate and graduate students in the arts and sciences to techniques for creative thinking and research focused on addressing wicked problems such as global biodiversity loss. Sterner and Co-PI Franz also collaborated with Lerman on a new dance composition she is developing, “Wicked Bodies,” which engages themes related to the extinction of biological species and how different sorts of knowledge figure into a sustainable relationship between people and nature. The composition was originally scheduled to be performed during the project but has been delayed due to the Covid-19 pandemic. A further outcome of project research was to apply a computational method developed for author name disambiguation on large collections of scientific texts in order to correctly identify authors as people across publications where names are duplicated or vary. In a manuscript now under review, the project reports accuracy of over 90% for disambiguating authors on a collection of more than 600,000 scientific articles that have not previously been analyzed. These results will be made open access online and Sterner is pursuing new collaborations to make them available through key repositories such as WikiData.


Last Modified: 11/24/2021
Modified by: Beckett Sterner

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page