GINA: Group Gender Identification Using Privacy-Sensitive Audio Data

Jiaxing SHEN, Oren LEDERMAN, Jiannong CAO, Florian BERG, Shaojie TANG, Alex Sandy PENTLAND

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

13 Citations (Scopus)

Abstract

Group gender is essential in understanding social interaction and group dynamics. With the increasing privacy concerns of studying face-to-face communication in natural settings, many participants are not open to raw audio recording. Existing voice-based gender identification methods rely on acoustic characteristics caused by physiological differences and phonetic differences. However, these methods might become ineffective with privacy-sensitive audio for two main reasons. First, compared to raw audio, privacy-sensitive audio contains significantly fewer acoustic features. Moreover, natural settings generate various uncertainties in the audio data. In this paper, we make the first attempt to identify group gender using privacy-sensitive audio. Instead of extracting acoustic features from privacy-sensitive audio, we focus on conversational features including turn-taking behaviors and interruption patterns. However, conversational behaviors are unstable in gender identification as human behaviors are affected by many factors like emotion and environment. We utilize ensemble feature selection and a two-stage classification to improve the effectiveness and robustness of our approach. Ensemble feature selection could reduce the risk of choosing an unstable subset of features by aggregating the outputs of multiple feature selectors. In the first stage, we infer the gender composition (mixed-gender or same-gender) of a group which is used as an additional input feature for identifying group gender in the second stage. The estimated gender composition significantly improves the performance as it could partially account for the dynamics in conversational behaviors. According to the experimental evaluation of 100 people in 273 meetings, the proposed method outperforms baseline approaches and achieves an F1-score of 0.77 using linear SVM.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Data Mining (ICDM)
PublisherIEEE
Pages457-466
Number of pages10
ISBN (Electronic)9781538691588
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event18th IEEE International Conference on Data Mining, ICDM 2018 - Singapore, Singapore
Duration: 17 Nov 201820 Nov 2018

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference18th IEEE International Conference on Data Mining, ICDM 2018
Country/TerritorySingapore
CitySingapore
Period17/11/1820/11/18

Bibliographical note

The work is completed during the visit of the first author to MIT Media Lab. It was partially supported by the funding for Project of Strategic Importance provided by The Hong Kong Polytechnic University (Project Code: 1-ZE26). It was also supported by demonstration project on large data provided by The Hong Kong Polytechnic University (project account code: 9A5V) and NSFC Key Grant with Project No. 61332004.

Keywords

  • Gender detection
  • Group gender identification
  • Nonlinguistic audio analysis

Fingerprint

Dive into the research topics of 'GINA: Group Gender Identification Using Privacy-Sensitive Audio Data'. Together they form a unique fingerprint.

Cite this