
NSF Org: |
SES Division of Social and Economic Sciences |
Recipient: |
|
Initial Amendment Date: | August 10, 2018 |
Latest Amendment Date: | June 17, 2019 |
Award Number: | 1827993 |
Award Instrument: | Standard Grant |
Program Manager: |
Frederick Kronz
SES Division of Social and Economic Sciences SBE Directorate for Social, Behavioral and Economic Sciences |
Start Date: | August 1, 2018 |
End Date: | July 31, 2021 (Estimated) |
Total Intended Award Amount: | $158,162.00 |
Total Awarded Amount to Date: | $158,162.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
660 S MILL AVENUE STE 204 TEMPE AZ US 85281-3670 (480)965-5479 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
P.O. Box 876011 Tempe AZ US 85287-6011 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): |
IIS Special Projects, Information Technology Researc, STS-Sci, Tech & Society |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.075 |
ABSTRACT
This is a project in the history and philosophy of biology that has very substantial ramifications for data-intensive science. The PI will investigate how the history of taxonomy can shed new light on the value of ambiguity for science in the domain of data-intensive science. The project focuses on detecting trade-offs in the value of ambiguity for scientific language as a function of changing social contexts. Accurate disambiguation relies on a shared background of knowledge and abilities, which may prove inadequate as concepts spread into new contexts or a community grows larger and more heterogeneous. The project will fund graduate and undergraduate research assistants to analyze a text corpus drawn from two centuries of history in biological taxonomy. It will also support public events and the creation of educational materials addressing the theme of productive ambiguity in naming and classification.
This project will implement an integrative conceptual framework enabling empirical investigation of ambiguity in linguistic settings. It will use an information-theoretic framework from cognitive pragmatics to quantify ambiguity in a way that is open-ended enough to accommodate a wide range of phenomena shaping human language and communication while also reflecting the specific constraints required by computers. It will provide novel tools for tracking changes in language at the level of populations rather than individuals while remaining sensitive to underlying social institutions and individual differences. Results from this project will open new perspectives on the history of types and subspecies in systematics, and it will inform contemporary debates about the virtues of maximal determinacy in the computational representation of human language and meaning.
This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
PROJECT OUTCOMES REPORT
Disclaimer
This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.
This project brought together an interdisciplinary team of researchers in history and philosophy of science, computational humanities, biology, and the arts. The central topic of the project was how ambiguity can be productive in scientific research, focusing specifically on ambiguity in taxonomic classifications. The project investigated how the history of taxonomy can shed new light on the value of ambiguity for science in the domain of data-intensive science, complementing existing research on figurative language in science. The project focused in particular on detecting trade-offs in the value of ambiguity for scientific language as a function of changing social contexts. The broader theme of productive ambiguity in the setting of computational data processing has contemporary relevance far beyond current challenges in biology: the idea that we should eliminate ambiguity wherever possible remains entrenched in many efforts to develop controlled vocabularies or ontologies for the Semantic Web today, including in the digital humanities and library sciences.
Despite disruptions to the team’s ability to collaborate during the Covid-19 pandemic, the project achieved major outcomes in each of main areas listed above: it synthesized prior interdisciplinary knowledge about productive ambiguity to demonstrate and evaluate its significance for science, created innovative computational methods for measuring ambiguity in scientific texts, and created new courses and teaching materials engaging ambiguity as a theme at the intersection of art and science. The project has supported eight peer-reviewed publications in history and philosophy of science, artificial intelligence, and biodiversity data science. Several more manuscripts are currently under peer review at journals in the fields of knowledge representation and scientometrics. Results were also disseminated through presentations at leading international conferences in the fields of philosophy of science and biodiversity data science. PI Sterner is continuing to develop methods and empirical results created during the project through new and ongoing collaborations.
Broader impacts for the project were achieved through research training of students, a new course offered by PI Sterner and Co-PI Lerman, collaboration on a new dance composition addressing biodiversity loss led by Co-PI Lerman, and contributions to improving the reusability of scientific articles in ecology and evolution. The project supported mentorship and training of 10 undergraduate and graduate students in diverse fields such as philosophy of science, computer science, and sustainability. Sterner and Lerman collaborated on designing and teaching a new course, Creative Tools for Saving Biodiversity, that introduced undergraduate and graduate students in the arts and sciences to techniques for creative thinking and research focused on addressing wicked problems such as global biodiversity loss. Sterner and Co-PI Franz also collaborated with Lerman on a new dance composition she is developing, “Wicked Bodies,” which engages themes related to the extinction of biological species and how different sorts of knowledge figure into a sustainable relationship between people and nature. The composition was originally scheduled to be performed during the project but has been delayed due to the Covid-19 pandemic. A further outcome of project research was to apply a computational method developed for author name disambiguation on large collections of scientific texts in order to correctly identify authors as people across publications where names are duplicated or vary. In a manuscript now under review, the project reports accuracy of over 90% for disambiguating authors on a collection of more than 600,000 scientific articles that have not previously been analyzed. These results will be made open access online and Sterner is pursuing new collaborations to make them available through key repositories such as WikiData.
Last Modified: 11/24/2021
Modified by: Beckett Sterner
Please report errors in award information by writing to: awardsearch@nsf.gov.