An extensible speaker identification sidekit in Python
SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualiz...
Saved in:
| Published in | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5095 - 5099 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding Journal Article |
| Language | English |
| Published |
IEEE
01.03.2016
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2379-190X |
| DOI | 10.1109/ICASSP.2016.7472648 |
Cover
| Abstract | SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualization, SIDEKIT offers a wide range of standard algorithms and flexible interfaces. The use of a single efficient programming and scripting language (Python in this case), and the limited dependencies, facilitate the deployment for industrial applications and extension to include new algorithms as part of the whole tool-chain provided by SIDEKIT. Performance of SIDEKIT is demonstrated on two standard evaluation tasks, namely the RSR2015 and NIST-SRE 2010. |
|---|---|
| AbstractList | SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualization, SIDEKIT offers a wide range of standard algorithms and flexible interfaces. The use of a single efficient programming and scripting language (Python in this case), and the limited dependencies, facilitate the deployment for industrial applications and extension to include new algorithms as part of the whole tool-chain provided by SIDEKIT. Performance of SIDEKIT is demonstrated on two standard evaluation tasks, namely the RSR2015 and NIST-SRE 2010. |
| Author | Meignier, Sylvain Larcher, Anthony Lee, Kong Aik |
| Author_xml | – sequence: 1 givenname: Anthony surname: Larcher fullname: Larcher, Anthony email: anthony.larcher@univ-lemans.fr organization: LIUM, Univ. du Maine, Le Mans, France – sequence: 2 givenname: Kong Aik surname: Lee fullname: Lee, Kong Aik organization: Human Language Technol. Dept., ASTAR, Singapore, Singapore – sequence: 3 givenname: Sylvain surname: Meignier fullname: Meignier, Sylvain organization: LIUM, Univ. du Maine, Le Mans, France |
| BookMark | eNo9kEtLAzEUhaMoaKu_oJtZupma1ySZZSm-oGChCu5CkrmlsdPMOEnR_ntnaPVuLhfOOZzvjtBFaAIgNCF4Sggu71_ms9VqOaWYiKnkkgquztCIcFn2oxQ5R9eUyTInJf64QqMYPzHGSnJ1jcQsZPCTIERva8hiC2YLXeYrCMmvvTPJNyGL_b31KfMhWx7Spgk36HJt6gi3pz1G748Pb_PnfPH61LdZ5BuqZMpdJRznygpTScu4NZgZU3HneKUMdkAKKHFlSmcKjK0tGBVrRqQSHAvHqGVjxI-5-9Caw7epa912fme6gyZYD_C67xhjqwd4fYLvbXdHW9s1X3uISe98dFDXJkCzj5ooWvDhM4N0cpR6APgP_wv6BQROZ4I |
| ContentType | Conference Proceeding Journal Article |
| DBID | 6IE 6IH CBEJK RIE RIO 7SP 8FD L7M ADTOC UNPAY |
| DOI | 10.1109/ICASSP.2016.7472648 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present Electronics & Communications Abstracts Technology Research Database Advanced Technologies Database with Aerospace Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts |
| DatabaseTitleList | Technology Research Database |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISBN | 1479999881 9781479999880 |
| EISSN | 2379-190X |
| EndPage | 5099 |
| ExternalDocumentID | oai:HAL:hal-01433157v1 7472648 |
| Genre | orig-research |
| GroupedDBID | 23M 29P 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RIO RNS 7SP 8FD L7M ADTOC UNPAY |
| ID | FETCH-LOGICAL-h287t-cd6c448b6ad7b34ba03aad4cc4d8a0ce15e90da9ca500bb5326f31786406c32b3 |
| IEDL.DBID | RIE |
| IngestDate | Sun Oct 26 03:48:59 EDT 2025 Fri Jul 11 16:17:23 EDT 2025 Wed Aug 27 02:08:06 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| License | other-oa |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-h287t-cd6c448b6ad7b34ba03aad4cc4d8a0ce15e90da9ca500bb5326f31786406c32b3 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://hal.science/hal-01433157 |
| PQID | 1825498818 |
| PQPubID | 23500 |
| PageCount | 5 |
| ParticipantIDs | ieee_primary_7472648 unpaywall_primary_10_1109_icassp_2016_7472648 proquest_miscellaneous_1825498818 |
| PublicationCentury | 2000 |
| PublicationDate | 20160301 |
| PublicationDateYYYYMMDD | 2016-03-01 |
| PublicationDate_xml | – month: 03 year: 2016 text: 20160301 day: 01 |
| PublicationDecade | 2010 |
| PublicationTitle | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
| PublicationTitleAbbrev | ICASSP |
| PublicationYear | 2016 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0008748 |
| Score | 2.22216 |
| Snippet | SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker... |
| SourceID | unpaywall proquest ieee |
| SourceType | Open Access Repository Aggregation Database Publisher |
| StartPage | 5095 |
| SubjectTerms | Algorithms Electronics Feature extraction Open source software open-source Panels python Source code Speaker recognition Speech toolkit Tutorials Visualization |
| SummonAdditionalLinks | – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEB1qe9CTSivWL1bwujUxm93ssYilCJYeLNRT2K_Q0pKWJkHqr3c3SevHQfCWQBKSmSH73u7bNwB3IiRSKyax5xSuhEcS80g41urLIGFM8XLC7WVEhxPyPA2nDUC7vTAzizjrf787xs5-LvBDdgAtGlq03YTWZDTuv9UGQr7H7-3HZJmznvRpz4JjJ9mqW6X8QI2HRboW23exXH4bQAbHlZAxK30HnW5k0Sty2VMfv1wZ_3q3E-h8bc9D4_3AcwoNk7aB9lNUzmjbGl8alK2NWJgNmutaDlRmALnmnIt5juYpGm-dbUAHJoOn18chrpsi4JklNzlWmipLqSQVmsmASOEFQmiiFNGR8JTxQ8M9LXjZ6kDK0MKzxGKEiNqRWwUPMjiDZrpKzTkgzzDOEmpzwyhhQktLLnxFXffeRAhDutB2UYzXle9FXAe3C7e7qMa2Ft0Cg0jNqshiv6SbkcUAXcD7cO_vL_mGx-MqYbFL2O6ZF_-8_hKO3GklBruCZr4pzLVFB7m8qQvkE5I9uig priority: 102 providerName: Unpaywall |
| Title | An extensible speaker identification sidekit in Python |
| URI | https://ieeexplore.ieee.org/document/7472648 https://www.proquest.com/docview/1825498818 https://hal.science/hal-01433157 |
| UnpaywallVersion | submittedVersion |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH8BPKgXP8CIH2QmHh1srrTbkRAJmkCWIAmeln4RCWQQ2GLwr7ftxkTjwdsO69b2devv9_p77wHc0zZighNmO1rhigKf2YFPNWt1mTclhAfG4TYY4v4YvUzakxI8FLEwUkojPpNNfWnO8sWSp9pV1lLQVwuyylAmPs5itYq_rk-Qn2cVcp2g9dztjEahlm7hZt4sr5_yA0oepvGKbj_oYrG3q_ROYLDrTyYmmTfThDX5569Ujf_t8CnUvuP3rLDYmc6gJONzON5LPVgF3Ikt4wBXn8RCWpuVpHO5tmYiVw8Zg1m6lud8lliz2Aq3OstADca9p9du385rKNjvigslNheYKwbGMBWEeYhRx6NUIM6R8KnDpduWgSNoYCojMNZWaG6qIIWP1UbPvUfmXUAlXsbyEixHkoBMsTIlwYhQwRQXcTnWxX6nlEpUh6qegWiVpcmI8sHX4W4335Fauvo8gsZymW4i17BTX0GGOtiFIYr2hp44QaSGvNmsIm3K3TOv_n7VNRzpuzKJ2A1UknUqbxVmSFjDLJYGHIyHYeftC5ucwuI |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4gHtCLL4z4rIlHi63d7naPxmhAxZCICbdmX0QCKYS2Mfjr3d2W-ogHb71M9zHbzjez38wAXLAQcSkIdz3DcEU04i6NmPFafR6MCBHUBtx6z7jzih6G4bAGl1UujFLKks9U2zzau3w5E7kJlV1p6GsIWWuwHiKEwiJbq_rvRgRFZV0h36NX3dubl5e-IW_hdilYdlD5ASYbeTJny3c2nX6zK_db0FvNqKCTTNp5xtvi41exxv9OeRuaXxl8Tr-yTTtQU8kubH4rPrgH-CZxbAhcfxRT5aRzxSZq4YxlyR-yKnNMN8_JOHPGidNfmjoDTXi9vxvcdtyyi4L7pr2hzBUSC-2Dccwk4QHizAsYk0gIJCPmCeWHinqSUdsbgfNQ47mRBhUR1qZeBNc82Id6MkvUATieIpSMsFYmwYgwybU34gts2v2OGFOoBXtmB-J5USgjLhffgvPVfsf68JobCZaoWZ7GvvVPIw0aWuBWiqjkrYPi0VgvOU3nsVHl6p2Hfw91Bo3OoPcUP3WfH49gw0gUhLFjqGeLXJ1oBJHxU3twPgHTaMR_ |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEB1qe9CTSivWL1bwujUxm93ssYilCJYeLNRT2K_Q0pKWJkHqr3c3SevHQfCWQBKSmSH73u7bNwB3IiRSKyax5xSuhEcS80g41urLIGFM8XLC7WVEhxPyPA2nDUC7vTAzizjrf787xs5-LvBDdgAtGlq03YTWZDTuv9UGQr7H7-3HZJmznvRpz4JjJ9mqW6X8QI2HRboW23exXH4bQAbHlZAxK30HnW5k0Sty2VMfv1wZ_3q3E-h8bc9D4_3AcwoNk7aB9lNUzmjbGl8alK2NWJgNmutaDlRmALnmnIt5juYpGm-dbUAHJoOn18chrpsi4JklNzlWmipLqSQVmsmASOEFQmiiFNGR8JTxQ8M9LXjZ6kDK0MKzxGKEiNqRWwUPMjiDZrpKzTkgzzDOEmpzwyhhQktLLnxFXffeRAhDutB2UYzXle9FXAe3C7e7qMa2Ft0Cg0jNqshiv6SbkcUAXcD7cO_vL_mGx-MqYbFL2O6ZF_-8_hKO3GklBruCZr4pzLVFB7m8qQvkE5I9uig |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+of+the+...+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%281998%29&rft.atitle=An+extensible+speaker+identification+sidekit+in+Python&rft.au=Larcher%2C+Anthony&rft.au=Lee%2C+Kong+Aik&rft.au=Meignier%2C+Sylvain&rft.date=2016-03-01&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=5095&rft.epage=5099&rft_id=info:doi/10.1109%2FICASSP.2016.7472648&rft.externalDocID=7472648 |