Abstract
As a promising way, controlling smart devices through gestures offers the benefits of non-contact interaction, efficiency and convenience. Previous researches on acoustic-based gesture recognition have mostly focused on near-field gestures within 1 meter and for a single user only. However, such a nearfield sensing scheme is inadequate to meet the growing demands for multi-person human-computer interaction in far-field spaces. In this paper, we present a novel acoustic-based room-scale gesture recognition system that is capable of recognizing gestures simultaneously performed by multi-user. Our approach achieves far-field sensing by examining the relationship between acoustic signal frame length and sensing range, and overcoming a series of practical challenges incurred by far-field sensing. To simultaneously detect and distinguish gestures of multiple persons, we divide the sensing area into multiple beamforming sub-scanning areas and apply binary search to detect multiple users, which allows for an efficient scanning process and facilitates real-time detection. Finally, we conduct a data augmentation scheme to enlarge the training data and apply a lightweight deep learning framework to classify different gestures. Extensive experiments confirm that our system enables multi-user gesture detection and can recognize nine gestures at a distance up to 7 meters.
Original language | English |
---|---|
Title of host publication | 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON) |
Publisher | IEEE |
Pages | 231-239 |
Number of pages | 9 |
ISBN (Electronic) | 9798350300529 |
ISBN (Print) | 9798350300536 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON) - Madrid, Spain Duration: 11 Sept 2023 → 14 Sept 2023 |
Conference
Conference | 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON) |
---|---|
Period | 11/09/23 → 14/09/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Funding
This work was supported by the National Nature Science Foundation of China under Grant 62102139, the Nature Science Foundation of Hunan Province of China under Grant 2022JJ30168 and 2023JJ20015, and the Fundamental Research Funds for the Central Universities under Grant 531118010612. This research has benefited from financial support of the Hong Kong Institute of Business Studies, Faculty of Business, Lingnan University, Hong Kong Special Administrative Region, China.
Keywords
- acoustic
- beamforming
- channel impulse response
- far-field sensing
- gesture recognition