
NSF Org: |
OAC Office of Advanced Cyberinfrastructure (OAC) |
Recipient: |
|
Initial Amendment Date: | August 27, 2007 |
Latest Amendment Date: | August 27, 2007 |
Award Number: | 0721667 |
Award Instrument: | Standard Grant |
Program Manager: |
Kevin Thompson
kthompso@nsf.gov (703)292-4220 OAC Office of Advanced Cyberinfrastructure (OAC) CSE Directorate for Computer and Information Science and Engineering |
Start Date: | August 15, 2007 |
End Date: | July 31, 2011 (Estimated) |
Total Intended Award Amount: | $212,000.00 |
Total Awarded Amount to Date: | $212,000.00 |
Funds Obligated to Date: |
|
History of Investigator: |
|
Recipient Sponsored Research Office: |
600 FORBES AVENUE PITTSBURGH PA US 15282 (412)396-1537 |
Sponsor Congressional District: |
|
Primary Place of Performance: |
600 FORBES AVENUE PITTSBURGH PA US 15282 |
Primary Place of
Performance Congressional District: |
|
Unique Entity Identifier (UEI): |
|
Parent UEI: |
|
NSF Program(s): | SOFTWARE DEVELOPEMENT FOR CI |
Primary Program Source: |
|
Program Reference Code(s): |
|
Program Element Code(s): |
|
Award Agency Code: | 4900 |
Fund Agency Code: | 4900 |
Assistance Listing Number(s): | 47.070 |
ABSTRACT
A Modular Software Framework for Evaluation, Testing, and Cross-Fertilization of Authorship Attribution Techniques
PI: Patrick M. Juola, PhD, Duquesne University
Documents do not speak only of their contents; to a trained eye, they can also say much about their author. The field of ``authorship attribution'' in humanities scholarship has been attending to this for centuries, trying to determine how and to what accuracy the author of a document can be determined. Recent developments in corpus linguistics have shown it to be possible to make these determinations automatically by ``non-traditional'' methods, essentially statistical investigations of the words, phrases, layout, and other features of the document.
Unfortunately, the current state-of-the-art is a confused collection of proposed methods, with little guidance about which methods work, why they work, and under what conditions they work best. We are addressing this by developing a modular software framework (using a theoretical model proposed by Juola[23]) to perform this task in a modular design that permits easy swapping of functional components in cross-combination.
By applying a rigorous testing method to the resulting set of (novel) combinations, the project is establishing accuracy benchmarks for various techniques (under the various testing conditions), finding new combinations resulting in improved techniques, and creating "best practices."
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH
Note:
When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external
site maintained by the publisher. Some full text articles may not yet be available without a
charge during the embargo (administrative interval).
Some links on this page may take you to non-federal websites. Their policies may differ from
this site.
Please report errors in award information by writing to: awardsearch@nsf.gov.