An extensible speaker identification sidekit in Python

SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualiz...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5095 - 5099
Main Authors Larcher, Anthony, Lee, Kong Aik, Meignier, Sylvain
Format Conference Proceeding Journal Article
LanguageEnglish
Published IEEE 01.03.2016
Subjects
Online AccessGet full text
ISSN2379-190X
DOI10.1109/ICASSP.2016.7472648

Cover

Abstract SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualization, SIDEKIT offers a wide range of standard algorithms and flexible interfaces. The use of a single efficient programming and scripting language (Python in this case), and the limited dependencies, facilitate the deployment for industrial applications and extension to include new algorithms as part of the whole tool-chain provided by SIDEKIT. Performance of SIDEKIT is demonstrated on two standard evaluation tasks, namely the RSR2015 and NIST-SRE 2010.
AbstractList SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualization, SIDEKIT offers a wide range of standard algorithms and flexible interfaces. The use of a single efficient programming and scripting language (Python in this case), and the limited dependencies, facilitate the deployment for industrial applications and extension to include new algorithms as part of the whole tool-chain provided by SIDEKIT. Performance of SIDEKIT is demonstrated on two standard evaluation tasks, namely the RSR2015 and NIST-SRE 2010.
Author Meignier, Sylvain
Larcher, Anthony
Lee, Kong Aik
Author_xml – sequence: 1
  givenname: Anthony
  surname: Larcher
  fullname: Larcher, Anthony
  email: anthony.larcher@univ-lemans.fr
  organization: LIUM, Univ. du Maine, Le Mans, France
– sequence: 2
  givenname: Kong Aik
  surname: Lee
  fullname: Lee, Kong Aik
  organization: Human Language Technol. Dept., ASTAR, Singapore, Singapore
– sequence: 3
  givenname: Sylvain
  surname: Meignier
  fullname: Meignier, Sylvain
  organization: LIUM, Univ. du Maine, Le Mans, France
BookMark eNo9kEtLAzEUhaMoaKu_oJtZupma1ySZZSm-oGChCu5CkrmlsdPMOEnR_ntnaPVuLhfOOZzvjtBFaAIgNCF4Sggu71_ms9VqOaWYiKnkkgquztCIcFn2oxQ5R9eUyTInJf64QqMYPzHGSnJ1jcQsZPCTIERva8hiC2YLXeYrCMmvvTPJNyGL_b31KfMhWx7Spgk36HJt6gi3pz1G748Pb_PnfPH61LdZ5BuqZMpdJRznygpTScu4NZgZU3HneKUMdkAKKHFlSmcKjK0tGBVrRqQSHAvHqGVjxI-5-9Caw7epa912fme6gyZYD_C67xhjqwd4fYLvbXdHW9s1X3uISe98dFDXJkCzj5ooWvDhM4N0cpR6APgP_wv6BQROZ4I
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IH
CBEJK
RIE
RIO
7SP
8FD
L7M
ADTOC
UNPAY
DOI 10.1109/ICASSP.2016.7472648
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
Electronics & Communications Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle Technology Research Database
Advanced Technologies Database with Aerospace
Electronics & Communications Abstracts
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1479999881
9781479999880
EISSN 2379-190X
EndPage 5099
ExternalDocumentID oai:HAL:hal-01433157v1
7472648
Genre orig-research
GroupedDBID 23M
29P
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
M43
OCL
RIE
RIL
RIO
RNS
7SP
8FD
L7M
ADTOC
UNPAY
ID FETCH-LOGICAL-h287t-cd6c448b6ad7b34ba03aad4cc4d8a0ce15e90da9ca500bb5326f31786406c32b3
IEDL.DBID RIE
IngestDate Sun Oct 26 03:48:59 EDT 2025
Fri Jul 11 16:17:23 EDT 2025
Wed Aug 27 02:08:06 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Language English
License other-oa
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-h287t-cd6c448b6ad7b34ba03aad4cc4d8a0ce15e90da9ca500bb5326f31786406c32b3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
OpenAccessLink https://proxy.k.utb.cz/login?url=https://hal.science/hal-01433157
PQID 1825498818
PQPubID 23500
PageCount 5
ParticipantIDs ieee_primary_7472648
unpaywall_primary_10_1109_icassp_2016_7472648
proquest_miscellaneous_1825498818
PublicationCentury 2000
PublicationDate 20160301
PublicationDateYYYYMMDD 2016-03-01
PublicationDate_xml – month: 03
  year: 2016
  text: 20160301
  day: 01
PublicationDecade 2010
PublicationTitle 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublicationTitleAbbrev ICASSP
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0008748
Score 2.22216
Snippet SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker...
SourceID unpaywall
proquest
ieee
SourceType Open Access Repository
Aggregation Database
Publisher
StartPage 5095
SubjectTerms Algorithms
Electronics
Feature extraction
Open source software
open-source
Panels
python
Source code
Speaker recognition
Speech
toolkit
Tutorials
Visualization
SummonAdditionalLinks – databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEB1qe9CTSivWL1bwujUxm93ssYilCJYeLNRT2K_Q0pKWJkHqr3c3SevHQfCWQBKSmSH73u7bNwB3IiRSKyax5xSuhEcS80g41urLIGFM8XLC7WVEhxPyPA2nDUC7vTAzizjrf787xs5-LvBDdgAtGlq03YTWZDTuv9UGQr7H7-3HZJmznvRpz4JjJ9mqW6X8QI2HRboW23exXH4bQAbHlZAxK30HnW5k0Sty2VMfv1wZ_3q3E-h8bc9D4_3AcwoNk7aB9lNUzmjbGl8alK2NWJgNmutaDlRmALnmnIt5juYpGm-dbUAHJoOn18chrpsi4JklNzlWmipLqSQVmsmASOEFQmiiFNGR8JTxQ8M9LXjZ6kDK0MKzxGKEiNqRWwUPMjiDZrpKzTkgzzDOEmpzwyhhQktLLnxFXffeRAhDutB2UYzXle9FXAe3C7e7qMa2Ft0Cg0jNqshiv6SbkcUAXcD7cO_vL_mGx-MqYbFL2O6ZF_-8_hKO3GklBruCZr4pzLVFB7m8qQvkE5I9uig
  priority: 102
  providerName: Unpaywall
Title An extensible speaker identification sidekit in Python
URI https://ieeexplore.ieee.org/document/7472648
https://www.proquest.com/docview/1825498818
https://hal.science/hal-01433157
UnpaywallVersion submittedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH8BPKgXP8CIH2QmHh1srrTbkRAJmkCWIAmeln4RCWQQ2GLwr7ftxkTjwdsO69b2devv9_p77wHc0zZighNmO1rhigKf2YFPNWt1mTclhAfG4TYY4v4YvUzakxI8FLEwUkojPpNNfWnO8sWSp9pV1lLQVwuyylAmPs5itYq_rk-Qn2cVcp2g9dztjEahlm7hZt4sr5_yA0oepvGKbj_oYrG3q_ROYLDrTyYmmTfThDX5569Ujf_t8CnUvuP3rLDYmc6gJONzON5LPVgF3Ikt4wBXn8RCWpuVpHO5tmYiVw8Zg1m6lud8lliz2Aq3OstADca9p9du385rKNjvigslNheYKwbGMBWEeYhRx6NUIM6R8KnDpduWgSNoYCojMNZWaG6qIIWP1UbPvUfmXUAlXsbyEixHkoBMsTIlwYhQwRQXcTnWxX6nlEpUh6qegWiVpcmI8sHX4W4335Fauvo8gsZymW4i17BTX0GGOtiFIYr2hp44QaSGvNmsIm3K3TOv_n7VNRzpuzKJ2A1UknUqbxVmSFjDLJYGHIyHYeftC5ucwuI
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4gHtCLL4z4rIlHi63d7naPxmhAxZCICbdmX0QCKYS2Mfjr3d2W-ogHb71M9zHbzjez38wAXLAQcSkIdz3DcEU04i6NmPFafR6MCBHUBtx6z7jzih6G4bAGl1UujFLKks9U2zzau3w5E7kJlV1p6GsIWWuwHiKEwiJbq_rvRgRFZV0h36NX3dubl5e-IW_hdilYdlD5ASYbeTJny3c2nX6zK_db0FvNqKCTTNp5xtvi41exxv9OeRuaXxl8Tr-yTTtQU8kubH4rPrgH-CZxbAhcfxRT5aRzxSZq4YxlyR-yKnNMN8_JOHPGidNfmjoDTXi9vxvcdtyyi4L7pr2hzBUSC-2Dccwk4QHizAsYk0gIJCPmCeWHinqSUdsbgfNQ47mRBhUR1qZeBNc82Id6MkvUATieIpSMsFYmwYgwybU34gts2v2OGFOoBXtmB-J5USgjLhffgvPVfsf68JobCZaoWZ7GvvVPIw0aWuBWiqjkrYPi0VgvOU3nsVHl6p2Hfw91Bo3OoPcUP3WfH49gw0gUhLFjqGeLXJ1oBJHxU3twPgHTaMR_
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEB1qe9CTSivWL1bwujUxm93ssYilCJYeLNRT2K_Q0pKWJkHqr3c3SevHQfCWQBKSmSH73u7bNwB3IiRSKyax5xSuhEcS80g41urLIGFM8XLC7WVEhxPyPA2nDUC7vTAzizjrf787xs5-LvBDdgAtGlq03YTWZDTuv9UGQr7H7-3HZJmznvRpz4JjJ9mqW6X8QI2HRboW23exXH4bQAbHlZAxK30HnW5k0Sty2VMfv1wZ_3q3E-h8bc9D4_3AcwoNk7aB9lNUzmjbGl8alK2NWJgNmutaDlRmALnmnIt5juYpGm-dbUAHJoOn18chrpsi4JklNzlWmipLqSQVmsmASOEFQmiiFNGR8JTxQ8M9LXjZ6kDK0MKzxGKEiNqRWwUPMjiDZrpKzTkgzzDOEmpzwyhhQktLLnxFXffeRAhDutB2UYzXle9FXAe3C7e7qMa2Ft0Cg0jNqshiv6SbkcUAXcD7cO_vL_mGx-MqYbFL2O6ZF_-8_hKO3GklBruCZr4pzLVFB7m8qQvkE5I9uig
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+of+the+...+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%281998%29&rft.atitle=An+extensible+speaker+identification+sidekit+in+Python&rft.au=Larcher%2C+Anthony&rft.au=Lee%2C+Kong+Aik&rft.au=Meignier%2C+Sylvain&rft.date=2016-03-01&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=5095&rft.epage=5099&rft_id=info:doi/10.1109%2FICASSP.2016.7472648&rft.externalDocID=7472648