ParsiNorm: A Persian Toolkit for Speech Processing Normalization
In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the nor...
Saved in:
| Published in | 2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS) pp. 1 - 5 |
|---|---|
| Main Authors | , , , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
29.12.2021
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/ICSPIS54653.2021.9729392 |
Cover
| Abstract | In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the normalization step is so essential to format unification in pure textual applications. However, for embedded language models in speech processing modules, normalization is not limited to format unification. Moreover, it has to convert each readable symbol, number, etc., to how they are pronounced. To the best of our knowledge, there is no Persian normalization toolkits for embedded language models in speech processing modules, So in this paper, we propose an open-source normalization toolkit for text processing in speech applications. Briefly, we consider different readable Persian text like symbols (common currencies,#,@,URL, etc.), numbers (date, time, phone number, national code, etc.), and so on. Comparison with other available Persian textual normalization tools indicates the superiority of the proposed method in speech processing. Also, comparing the model's performance for one of the proposed functions (sentence separation) with other common natural language libraries such as HAZM and Parsivar indicates the proper performance of the proposed method. Besides, its evaluation of some Persian Wikipedia data confirms the proper performance of the proposed method. |
|---|---|
| AbstractList | In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the normalization step is so essential to format unification in pure textual applications. However, for embedded language models in speech processing modules, normalization is not limited to format unification. Moreover, it has to convert each readable symbol, number, etc., to how they are pronounced. To the best of our knowledge, there is no Persian normalization toolkits for embedded language models in speech processing modules, So in this paper, we propose an open-source normalization toolkit for text processing in speech applications. Briefly, we consider different readable Persian text like symbols (common currencies,#,@,URL, etc.), numbers (date, time, phone number, national code, etc.), and so on. Comparison with other available Persian textual normalization tools indicates the superiority of the proposed method in speech processing. Also, comparing the model's performance for one of the proposed functions (sentence separation) with other common natural language libraries such as HAZM and Parsivar indicates the proper performance of the proposed method. Besides, its evaluation of some Persian Wikipedia data confirms the proper performance of the proposed method. |
| Author | Razavi, Seyedeh Fatemeh Dehsorkh, Sajjad Abdi Asheri, Hadi Oji, Romina Hosseini, Reshad Hariri, Alireza |
| Author_xml | – sequence: 1 givenname: Romina surname: Oji fullname: Oji, Romina email: romina.oji@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran – sequence: 2 givenname: Seyedeh Fatemeh surname: Razavi fullname: Razavi, Seyedeh Fatemeh email: razavi_f@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran – sequence: 3 givenname: Sajjad Abdi surname: Dehsorkh fullname: Dehsorkh, Sajjad Abdi email: sadjad.abdi@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran – sequence: 4 givenname: Alireza surname: Hariri fullname: Hariri, Alireza email: alireza.hariri@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran – sequence: 5 givenname: Hadi surname: Asheri fullname: Asheri, Hadi email: hadi.asheri@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran – sequence: 6 givenname: Reshad surname: Hosseini fullname: Hosseini, Reshad email: reshad.hosseini@ut.ac.ir organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran |
| BookMark | eNotj8tKxDAYhSM4Cx3nCdzkBVrz5x5XDsVLYXAK7X5I0lSDnWZIu9Gnd8Q5m8MHhw_OLbqe0hQQwkBKAGIe6qpt6lZwKVhJCYXSKGqYoVdoY5QGKQUnhml-g54am-f4nvLxEW9xE85gJ9ylNH7FBQ8p4_YUgv_ETU4-zHOcPvDf2o7xxy4xTXdoNdhxDptLr1H38txVb8Vu_1pX210RAfRSwEDOUU65oCTlmnAGxmhtemOF8b30lnvpuSCeg3OSOtCgaS-8g8AVW6P7f20MIRxOOR5t_j5cbrFf6_JHLg |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ICSPIS54653.2021.9729392 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9781665409384 166540938X |
| EndPage | 5 |
| ExternalDocumentID | 9729392 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i118t-1f00007b7be76248043199889d9a59cd6ca4c6c450c41bb62b18182d5cb1e473 |
| IEDL.DBID | RIE |
| IngestDate | Thu Jun 29 18:37:35 EDT 2023 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i118t-1f00007b7be76248043199889d9a59cd6ca4c6c450c41bb62b18182d5cb1e473 |
| PageCount | 5 |
| ParticipantIDs | ieee_primary_9729392 |
| PublicationCentury | 2000 |
| PublicationDate | 2021-Dec.-29 |
| PublicationDateYYYYMMDD | 2021-12-29 |
| PublicationDate_xml | – month: 12 year: 2021 text: 2021-Dec.-29 day: 29 |
| PublicationDecade | 2020 |
| PublicationTitle | 2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS) |
| PublicationTitleAbbrev | ICSPIS |
| PublicationYear | 2021 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.8050119 |
| Snippet | In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants,... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Encyclopedias Internet Libraries Natural languages normalization Online services pre-processing Signal processing speech processing Tokenization |
| Title | ParsiNorm: A Persian Toolkit for Speech Processing Normalization |
| URI | https://ieeexplore.ieee.org/document/9729392 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3Pa8IwFA7O007b0LHf5LDjWm1M02anDZnoQBF04E2Sl1cmipVRL_vr91KrY2OH3UpI0yaP9L2v-b73GLuPjJHGSdppICCQmUsCrdAFCoxKMdMCy3I-w5Hqv8nXWTyrsYeDFgYRS_IZhv6yPMt3OWz9r7KWpkiQ_PkRO0pStdNq7ck5bd0adCfjwcQX9-4Q7hNRWHX_UTeldBu9EzbcP3DHFlmG28KG8PkrF-N_3-iUNb8Fenx8cD1nrIbrBnsaE0hdjCgIfeTP3FPbyfR8muer5aLgFJzyyQYR3nklDqA7ue9tVpUWs8mmvZdptx9UBRKCBeGCIoiy8iTSJhbpmyZTnyiH4FOqnTaxBkcLLkGBjNsgI2uVsOTPU-FisBHKpHPO6ut8jReMA-3rTuxSGsRKzITVtIIaICOsLY2CS9bwk59vdikw5tW8r_5uvmbH3gCe9SH0DasXH1u8Jd9d2LvSaF8in5r_ |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEJ0gHvSkBozf7sGjLbRsS9eThkhAgZCACTfSnZ1GAqHElIu_3tlSMBoP3pqmu-3uZDvzdt-bAbjz4ljGRvJKQx8dmZimo0IyTohxGFGifMrL-fQHYedNvkyCSQnud1oYIsrJZ-Tay_ws36S4tltlNcWRIPvzPdgPpJTBRq21pefUVa3bGg27I1veu8HIz_fcosGPyim542gfQX_7yg1fZO6uM-3i569sjP_9pmOofkv0xHDnfE6gRMsKPA4Zps4GHIY-iCdhye1sfDFO08V8lgkOT8VoRYTvopAHcEthn44XhRqzCuP287jVcYoSCc6MkUHmeEl-FqmbmvivJiObKocBVKSMigOFhqdcYogyqKP0tA59zR498k2A2iPZbJxCeZku6QwE8spuBCbiTrSkxNeKZ1AhJoy2ZRziOVTs4KerTRKMaTHui79v38JBZ9zvTXvdweslHFpjWA6Ir66gnH2s6Zo9eaZvcgN-AXKvnkw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2021+7th+International+Conference+on+Signal+Processing+and+Intelligent+Systems+%28ICSPIS%29&rft.atitle=ParsiNorm%3A+A+Persian+Toolkit+for+Speech+Processing+Normalization&rft.au=Oji%2C+Romina&rft.au=Razavi%2C+Seyedeh+Fatemeh&rft.au=Dehsorkh%2C+Sajjad+Abdi&rft.au=Hariri%2C+Alireza&rft.date=2021-12-29&rft.pub=IEEE&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1109%2FICSPIS54653.2021.9729392&rft.externalDocID=9729392 |