American Sign Language Alphabet Recognition with YOLOv5 Enhanced by MediaPipe Hands

Zijian WANG, Ziwei LIU, Zongxi LI*, Yufeng JIANG, Fengheng LI

*Corresponding author for this work

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

Sign language recognition (SLR) aims to break the communication barrier between hearing-impaired individuals and others, and is beneficial in building an inclusive and caring society. This paper aims to achieve real-Time static sign language interpretation from images employing deep learning methods. Conventionally, a pipeline approach is used to locate the position of the hand gesture in an image and then classify the gesture. You Only Look Once version 5 (YOLOv5) is a suitable model that captures the hand gesture with bounding boxes and predicts the gesture with a convolution-based classifier. However, in practice, SLR performance may be significantly affected when gestures are partially occluded, or the background environment is complicated. Therefore, this paper proposes a method using MediaPipe Hands to enhance hand features, enabling YOLOv5 to locate the hand position more precisely. In addition, MediaPipe Hands can predict finger position even when the hand is partly obscured, providing essential features in the classification stage. In experiments, MediaPipe Hands improves the success rate of hand localization in complex environments and increase hand gesture classification accuracy. Compared to the baseline model, the model employing features enhanced by MediaPipe Hands outperformed those without MediaPipe Hands in static sign language recognition (SSLR1). Moreover, our method was tested in real-life scenarios by implementing a web service application and demonstrated improved real-Time recognition performance.

Original languageEnglish
Title of host publicationProceedings of the 2023 International Conference on Instrumentation, Control, and Automation, ICA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages103-108
Number of pages6
ISBN (Electronic)9798350301274
ISBN (Print)9798350301281
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event8th International Conference on Instrumentation, Control, and Automation, ICA 2023 - Jakarta, Indonesia
Duration: 9 Aug 202311 Aug 2023

Conference

Conference8th International Conference on Instrumentation, Control, and Automation, ICA 2023
Country/TerritoryIndonesia
CityJakarta
Period9/08/2311/08/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • Image Classification
  • MediaPipe
  • Sign Language
  • Static Sign Language Recognition
  • YOLOv5

Fingerprint

Dive into the research topics of 'American Sign Language Alphabet Recognition with YOLOv5 Enhanced by MediaPipe Hands'. Together they form a unique fingerprint.

Cite this