This data set is colleted for our work (Speaker recognition system of flexible throat microphone using contrastive learning). A small-scale Flexible Throat Microphone (FTM) speech data set is collected for training and evaluation. As shown in Fig.1, a sensitive piezoelectric film vibration sensor is used as the FTM for recording speech. Google speech command data set that contains about 65,000 samples of 30 English words served as our reference in the process of recording FTM speech. We select the most common 20 words of the 30, and invite 20 speakers (10 males and 10 females) to pronounce them ten times. The 20 speakers are from different parts of China and speak with different accents, which increases the diversity of this data set. These signals are recorded to form the FTM data set with 4,000 data. And the corresponding CM data are recorded simultaneously to form parallel data pairs when recording FTM data. Each zip file is named after the speaker and contains FTM and CM speech files.
-
Notifications
You must be signed in to change notification settings - Fork 0
ChandlerBean/Speaker-recognition-system-of-flexible-throat-microphone-using-contrastive-learning
About
Speaker recognition for flexible throat microphone viacontrastive learning
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published