ParsiNorm: A Persian Toolkit for Speech Processing Normalization

In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the nor...

Full description

Saved in:
Bibliographic Details
Published in2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS) pp. 1 - 5
Main Authors Oji, Romina, Razavi, Seyedeh Fatemeh, Dehsorkh, Sajjad Abdi, Hariri, Alireza, Asheri, Hadi, Hosseini, Reshad
Format Conference Proceeding
LanguageEnglish
Published IEEE 29.12.2021
Subjects
Online AccessGet full text
DOI10.1109/ICSPIS54653.2021.9729392

Cover

Abstract In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the normalization step is so essential to format unification in pure textual applications. However, for embedded language models in speech processing modules, normalization is not limited to format unification. Moreover, it has to convert each readable symbol, number, etc., to how they are pronounced. To the best of our knowledge, there is no Persian normalization toolkits for embedded language models in speech processing modules, So in this paper, we propose an open-source normalization toolkit for text processing in speech applications. Briefly, we consider different readable Persian text like symbols (common currencies,#,@,URL, etc.), numbers (date, time, phone number, national code, etc.), and so on. Comparison with other available Persian textual normalization tools indicates the superiority of the proposed method in speech processing. Also, comparing the model's performance for one of the proposed functions (sentence separation) with other common natural language libraries such as HAZM and Parsivar indicates the proper performance of the proposed method. Besides, its evaluation of some Persian Wikipedia data confirms the proper performance of the proposed method.
AbstractList In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the normalization step is so essential to format unification in pure textual applications. However, for embedded language models in speech processing modules, normalization is not limited to format unification. Moreover, it has to convert each readable symbol, number, etc., to how they are pronounced. To the best of our knowledge, there is no Persian normalization toolkits for embedded language models in speech processing modules, So in this paper, we propose an open-source normalization toolkit for text processing in speech applications. Briefly, we consider different readable Persian text like symbols (common currencies,#,@,URL, etc.), numbers (date, time, phone number, national code, etc.), and so on. Comparison with other available Persian textual normalization tools indicates the superiority of the proposed method in speech processing. Also, comparing the model's performance for one of the proposed functions (sentence separation) with other common natural language libraries such as HAZM and Parsivar indicates the proper performance of the proposed method. Besides, its evaluation of some Persian Wikipedia data confirms the proper performance of the proposed method.
Author Razavi, Seyedeh Fatemeh
Dehsorkh, Sajjad Abdi
Asheri, Hadi
Oji, Romina
Hosseini, Reshad
Hariri, Alireza
Author_xml – sequence: 1
  givenname: Romina
  surname: Oji
  fullname: Oji, Romina
  email: romina.oji@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
– sequence: 2
  givenname: Seyedeh Fatemeh
  surname: Razavi
  fullname: Razavi, Seyedeh Fatemeh
  email: razavi_f@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
– sequence: 3
  givenname: Sajjad Abdi
  surname: Dehsorkh
  fullname: Dehsorkh, Sajjad Abdi
  email: sadjad.abdi@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
– sequence: 4
  givenname: Alireza
  surname: Hariri
  fullname: Hariri, Alireza
  email: alireza.hariri@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
– sequence: 5
  givenname: Hadi
  surname: Asheri
  fullname: Asheri, Hadi
  email: hadi.asheri@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
– sequence: 6
  givenname: Reshad
  surname: Hosseini
  fullname: Hosseini, Reshad
  email: reshad.hosseini@ut.ac.ir
  organization: School of ECE, College of Engineering, University of Tehran,Tehran,Iran
BookMark eNotj8tKxDAYhSM4Cx3nCdzkBVrz5x5XDsVLYXAK7X5I0lSDnWZIu9Gnd8Q5m8MHhw_OLbqe0hQQwkBKAGIe6qpt6lZwKVhJCYXSKGqYoVdoY5QGKQUnhml-g54am-f4nvLxEW9xE85gJ9ylNH7FBQ8p4_YUgv_ETU4-zHOcPvDf2o7xxy4xTXdoNdhxDptLr1H38txVb8Vu_1pX210RAfRSwEDOUU65oCTlmnAGxmhtemOF8b30lnvpuSCeg3OSOtCgaS-8g8AVW6P7f20MIRxOOR5t_j5cbrFf6_JHLg
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICSPIS54653.2021.9729392
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781665409384
166540938X
EndPage 5
ExternalDocumentID 9729392
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i118t-1f00007b7be76248043199889d9a59cd6ca4c6c450c41bb62b18182d5cb1e473
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:35 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i118t-1f00007b7be76248043199889d9a59cd6ca4c6c450c41bb62b18182d5cb1e473
PageCount 5
ParticipantIDs ieee_primary_9729392
PublicationCentury 2000
PublicationDate 2021-Dec.-29
PublicationDateYYYYMMDD 2021-12-29
PublicationDate_xml – month: 12
  year: 2021
  text: 2021-Dec.-29
  day: 29
PublicationDecade 2020
PublicationTitle 2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)
PublicationTitleAbbrev ICSPIS
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8050119
Snippet In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants,...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Encyclopedias
Internet
Libraries
Natural languages
normalization
Online services
pre-processing
Signal processing
speech processing
Tokenization
Title ParsiNorm: A Persian Toolkit for Speech Processing Normalization
URI https://ieeexplore.ieee.org/document/9729392
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3Pa8IwFA7O007b0LHf5LDjWm1M02anDZnoQBF04E2Sl1cmipVRL_vr91KrY2OH3UpI0yaP9L2v-b73GLuPjJHGSdppICCQmUsCrdAFCoxKMdMCy3I-w5Hqv8nXWTyrsYeDFgYRS_IZhv6yPMt3OWz9r7KWpkiQ_PkRO0pStdNq7ck5bd0adCfjwcQX9-4Q7hNRWHX_UTeldBu9EzbcP3DHFlmG28KG8PkrF-N_3-iUNb8Fenx8cD1nrIbrBnsaE0hdjCgIfeTP3FPbyfR8muer5aLgFJzyyQYR3nklDqA7ue9tVpUWs8mmvZdptx9UBRKCBeGCIoiy8iTSJhbpmyZTnyiH4FOqnTaxBkcLLkGBjNsgI2uVsOTPU-FisBHKpHPO6ut8jReMA-3rTuxSGsRKzITVtIIaICOsLY2CS9bwk59vdikw5tW8r_5uvmbH3gCe9SH0DasXH1u8Jd9d2LvSaF8in5r_
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEJ0gHvSkBozf7sGjLbRsS9eThkhAgZCACTfSnZ1GAqHElIu_3tlSMBoP3pqmu-3uZDvzdt-bAbjz4ljGRvJKQx8dmZimo0IyTohxGFGifMrL-fQHYedNvkyCSQnud1oYIsrJZ-Tay_ws36S4tltlNcWRIPvzPdgPpJTBRq21pefUVa3bGg27I1veu8HIz_fcosGPyim542gfQX_7yg1fZO6uM-3i569sjP_9pmOofkv0xHDnfE6gRMsKPA4Zps4GHIY-iCdhye1sfDFO08V8lgkOT8VoRYTvopAHcEthn44XhRqzCuP287jVcYoSCc6MkUHmeEl-FqmbmvivJiObKocBVKSMigOFhqdcYogyqKP0tA59zR498k2A2iPZbJxCeZku6QwE8spuBCbiTrSkxNeKZ1AhJoy2ZRziOVTs4KerTRKMaTHui79v38JBZ9zvTXvdweslHFpjWA6Ir66gnH2s6Zo9eaZvcgN-AXKvnkw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2021+7th+International+Conference+on+Signal+Processing+and+Intelligent+Systems+%28ICSPIS%29&rft.atitle=ParsiNorm%3A+A+Persian+Toolkit+for+Speech+Processing+Normalization&rft.au=Oji%2C+Romina&rft.au=Razavi%2C+Seyedeh+Fatemeh&rft.au=Dehsorkh%2C+Sajjad+Abdi&rft.au=Hariri%2C+Alireza&rft.date=2021-12-29&rft.pub=IEEE&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1109%2FICSPIS54653.2021.9729392&rft.externalDocID=9729392