text-only page produced automatically by Usablenet Assistive Skip all navigation and go to page content Skip top navigation and go to directorate navigation Skip top navigation and go to page navigation
National Science Foundation
Awards
design element
Search Awards
Recent Awards
Presidential and Honorary Awards
About Awards
Grant Policy Manual
Grant General Conditions
Cooperative Agreement Conditions
Special Conditions
Federal Demonstration Partnership
Policy Office Website



Award Abstract #1127567

SBIR Phase II: Software to Automate the Detection of Websites that are Fraudulent or Otherwise Harmful to Consumers

NSF Org: IIP
Division of Industrial Innovation and Partnerships
divider line
Initial Amendment Date: August 29, 2011
divider line
Latest Amendment Date: September 25, 2012
divider line
Award Number: 1127567
divider line
Award Instrument: Standard Grant
divider line
Program Manager: Glenn H. Larsen
IIP Division of Industrial Innovation and Partnerships
ENG Directorate for Engineering
divider line
Start Date: September 1, 2011
divider line
End Date: February 28, 2014 (Estimated)
divider line
Awarded Amount to Date: $600,000.00
divider line
Investigator(s): Michael Lai fastlane@sitejabber.com (Principal Investigator)
divider line
Sponsor: GGL Projects, Inc.
3150 18th Street
San Francisco, CA 94110-2076 (415)894-5806
divider line
NSF Program(s): STTR PHASE II,
SMALL BUSINESS PHASE II
divider line
Program Reference Code(s): 169E, 5373, 8032
divider line
Program Element Code(s): 1591, 5373

ABSTRACT

This Small Business Innovation Research (SBIR) Phase II project will develop software to automatically detect a broad spectrum of websites that are fraudulent or otherwise harmful to consumers. Much work has been done on specific software capable of detecting websites hosting malware or engaged in phishing. However, software does not yet exist which can detect a broader array of harmful websites, including those selling counterfeits, selling illegal drugs, and hosting weight-loss scams, to name just a few. The challenge in doing this involves selecting the right features of fraudulent sites which in isolation or combination are good indictors of a site's harmfulness. Using these features, a machine learning classifier can be trained using data on known harmful websites. Unknown websites can then be run through the classifier to evaluate their potential for harm. Additional challenges involve gathering sufficient data to properly train the classifier, making the classifier general enough to detect a range of harmful sites while still maintaining accuracy, and updating the classifier in real-time such that it can improve with ongoing human feedback and additional data.

The principal impact of this project is the protection of consumers from online fraud. Today, consumers lack reliable resources to evaluate unfamiliar websites. Most use familiar sites like Amazon or take a gamble on Google search results. These gambles frequently result in fraud. It is believed that there are now over 250 million websites and $100 billion lost yearly to online fraud. While the statistics cover many types of fraud, examples of risky sites include online counterfeiters, pharmacies, and retailers. The software developed in this project will greatly improve transparency around websites and protect millions from fraud. The technical achievements in this project involve the use of a vector space model in converting non-discrete features of fraudulent sites into useful data that can be inputted into a machine learning classifier. Additionally, this technology will include innovative feature choices, access to high-quality data, and the creation of a general classifier capable of improving itself in real-time and detecting a broad array of heretofore undetectable fraudulent sites.

 

Please report errors in award information by writing to: awardsearch@nsf.gov.

 

 

Print this page
Back to Top of page
  RESEARCH AREAS   FUNDING   AWARDS   DOCUMENT LIBRARY   NEWS   ABOUT NSF  
Website Policies  |  Budget and Performance  |  Inspector General  |  Privacy  |  FOIA  |  No FEAR Act  |  USA.gov
Accessibility  |  Plain Language  |  Contact
National Science Foundation Logo
National Science Foundation, 2415 Eisenhower Avenue, Alexandria, Virginia 22314, USA
Tel: (703) 292-5111, FIRS: (800) 877-8339 | TDD: (800) 281-8749
  Text Only Version