Improved voice-based biometrics using multi-channel transfer learning

dc.contributor.authorCherifi, Youssouf Ismail
dc.contributor.authorDahimene, Abdelhakim
dc.date.accessioned2021-11-02T13:31:57Z
dc.date.available2021-11-02T13:31:57Z
dc.date.issued2020
dc.description.abstractIdentifying the speaker has become more of an imperative thing to do in the modern age. Especially since most personal and professional appliances rely on voice commands or speech in general terms to operate. These systems need to discern the identity of the speaker rather than just the words that have been said to be both smart and safe. Especially if we consider the numerous advanced methods that have been developed to generate fake speech segments. The objective of this paper is to improve upon the existing voice-based biometrics to keep up with these synthesizers. The proposed method focuses on defining a novel and more speaker adapted features by implying artificial neural networks and transfer learning. The approach uses pre-trained networks to define a mapping from two complementary acoustic features to a speaker adapted phonetic features. The complementary acoustics features are paired to provide both information about how the speech segments are perceived (type 1 feature) and produced (type 2 feature). The approach was evaluated using both a small and large closed-speaker data set. Primary results are encouraging and confirm the usefulness of such an approach to extract speaker adapted features whether for classical machine learning algorithms or advanced neural structures such as LSTM or CNNen_US
dc.identifier.issn1646-3692
dc.identifier.urihttps://dspace.univ-boumerdes.dz/handle/123456789/7342
dc.language.isoenen_US
dc.publisherDigital Libraryen_US
dc.relation.ispartofseriesADIS International Journal on Computer Science and Information Systems / Vol. 15, N°. 1;pp. 99-113
dc.subjectSpeech Analysisen_US
dc.subjectTransfer Learningen_US
dc.subjectPattern Recognitionen_US
dc.subjectSpeaker Recognitionen_US
dc.subjectFeature Extractionen_US
dc.titleImproved voice-based biometrics using multi-channel transfer learningen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Youssouf Ismail Cherifi.pdf
Size:
689.46 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: