Award Abstract # 2046972
CAREER: Sign-to-Speech: An Edge-IoT Platform and Software Library for Real Time Sign Language Recognition

NSF Org: IIS
Division of Information & Intelligent Systems
Recipient: THE PENNSYLVANIA STATE UNIVERSITY
Initial Amendment Date: March 8, 2021
Latest Amendment Date: May 16, 2023
Award Number: 2046972
Award Instrument: Continuing Grant
Program Manager: Dan Cosley
dcosley@nsf.gov
 (703)292-8832
IIS
 Division of Information & Intelligent Systems
CSE
 Directorate for Computer and Information Science and Engineering
Start Date: March 1, 2021
End Date: February 28, 2026 (Estimated)
Total Intended Award Amount: $500,000.00
Total Awarded Amount to Date: $506,000.00
Funds Obligated to Date: FY 2021 = $188,434.00
FY 2022 = $311,566.00

FY 2023 = $6,000.00
History of Investigator:
  • Mahanth Gowda (Principal Investigator)
Recipient Sponsored Research Office: Pennsylvania State Univ University Park
201 OLD MAIN
UNIVERSITY PARK
PA  US  16802-1503
(814)865-1372
Sponsor Congressional District: 15
Primary Place of Performance: Pennsylvania State Univ University Park
110 Technology Center Building
University Park
PA  US  16802-1503
Primary Place of Performance
Congressional District:
15
Unique Entity Identifier (UEI): NPM2J7MSCF61
Parent UEI:
NSF Program(s): HCC-Human-Centered Computing
Primary Program Source: 01002223DB NSF RESEARCH & RELATED ACTIVIT
01002122DB NSF RESEARCH & RELATED ACTIVIT

01002324DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 9251, 7367, 1045
Program Element Code(s): 736700
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

This project will advance the state-of-the-art in cross-disciplinary areas including motion signal processing, machine learning (ML), sign language modeling, and real-time ML with dynamic device/edge partitioning, to develop new technology for automatic Sign Language Recognition (SLR) and translation to spoken language that enables more seamless communication between deaf and hearing people. The technology will incorporate wearable devices (such as a smartwatch, smart ring, and earphones) that are gaining in popularity, and will have broad impact through its introduction in deaf communities along with a sign language equivalent of voice assistants such as Amazon Alexa. The project will establish a pipeline of collaboration with deaf students, as well as courses based on SLR technology that will be disseminated through MOOC platforms such as Coursera. Additional impact will derive from workshops on wearable computing that will be conducted at the K-12 level, and a "sign-to-speech" library that will be publicly released for extensibility of the new technology to multiple sign languages.

To achieve its goals this research will include three thrusts: Development of ML models with efficient training that can perform accurate SLR by fusing multimodal input data from wearable devices that capture body motion and facial expressions; Implementation of efficient ML models by means of optimal partitioning between end-device and edge resources to achieve the best tradeoff in real time performance and SLR accuracy; Design of systematic user studies with fluent sign language users both for generating training data for ML models as well as for validation of accuracy, usability, and acceptability of the technology within the deaf community.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Liu, Yilin and Zhang, Shijia and Gowda, Mahanth "A Practical System for 3D Hand Pose Tracking using EMG Wearables with Applications to Prosthetics and User Interfaces" IEEE Internet of Things Journal , 2022 https://doi.org/10.1109/JIOT.2022.3223600 Citation Details
Liu, Yilin and Zhang, Shijia and Gowda, Mahanth and Nelakuditi, Srihari "Leveraging the Properties of mmWave Signals for 3D Finger Motion Tracking for Interactive IoT Applications" Proceedings of the ACM on Measurement and Analysis of Computing Systems , v.6 , 2022 https://doi.org/10.1145/3570613 Citation Details
Zhang, Shijia and Liu, Yilin and Gowda, Mahanth "Let's Grab a Drink: Teacher-Student Learning for Fluid Intake Monitoring using Smart Earphones" 2022 IEEE/ACM Seventh International Conference on Internet-of-Things Design and Implementation (IoTDI) , 2022 https://doi.org/10.1109/IoTDI54339.2022.00014 Citation Details
Zhang, Shijia and Lu, Taiting and Zhou, Hao and Liu, Yilin and Liu, Runze and Gowda, Mahanth "I am an Earphone and I can Hear my Users Face: Facial Landmark Tracking using Smart Earphones" ACM Transactions on Internet of Things , 2023 https://doi.org/10.1145/3614438 Citation Details
Zhou, Hao and Lu, Taiting and DeHaan, Kenneth and Gowda, Mahanth "ASLRing: American Sign Language Recognition with Meta-Learning on Wearables" , 2024 https://doi.org/10.1109/IoTDI61053.2024.00022 Citation Details
Zhou, Hao and Lu, Taiting and Liu, Yilin and Zhang, Shijia and Gowda, Mahanth "Learning on the Rings: Self-Supervised 3D Finger Motion Tracking Using Wearable Sensors" Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies , v.6 , 2022 https://doi.org/10.1145/3534587 Citation Details
Zhou, Hao and Lu, Taiting and Liu, Yilin and Zhang, Shijia and Liu, Runze and Gowda, Mahanth "One Ring to Rule Them All: An Open Source Smartring Platform for Finger Motion Analytics and Healthcare Applications" The ACM/IEEE International Conference on Internet of Things Design and Implementation (IoTDI) , 2023 https://doi.org/10.1145/3576842.3582382 Citation Details
Zhou, Hao and Lu, Taiting and Mckinnie, Kristina and Palagano, Joseph and Dehaan, Kenneth and Gowda, Mahanth "SignQuery: A Natural User Interface and Search Engine for Sign Languages with Wearable Sensors" The 29th Annual International Conference On Mobile Computing And Networking , 2023 https://doi.org/10.1145/3570361.3613286 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page