Research and development of sign language recognition system using neural network algorithm
Sign language is an important communication tool for the hearing-impaired community. Due to technological development, it is now possible to develop systems that can recognize, translate and process sign language into text or speech, according to the visual representation of gestures. This article e...
Saved in:
| Published in | 2024 IEEE 4th International Conference on Smart Information Systems and Technologies (SIST) pp. 321 - 327 |
|---|---|
| Main Authors | , , , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
15.05.2024
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/SIST61555.2024.10629529 |
Cover
| Abstract | Sign language is an important communication tool for the hearing-impaired community. Due to technological development, it is now possible to develop systems that can recognize, translate and process sign language into text or speech, according to the visual representation of gestures. This article explores the development of a real-time sign language recognition system using a neural network algorithm. The aim of the research is to develop such a sign language recognition and translation system which should be optimized for integration into web applications. The Mediapipe library was used to determine the key points and orientation of the user's hands and fingers. After that, the software module transmits the collected data to a sequential neural network that includes layers of Long Short-Term Memory (LSTM). To build this type of neural network, the open Keras library was used. The key feature of the presented neural network model is the combination and interaction of convolutional and Recurrent neural network (RNN) layers. The considered set of layers provides the ability to track the dependence of data over time, this is achieved by switching between layers of different types and reducing the number of neurons. The LSTM network is trained using a custom editable dataset based on American Sign Language gestures. The dataset was formed on the recording of signs. Each sign representation was preprocessed to extract three-dimensional landmarks. These collected key points were transmitting to the layers of the LSTM neural network, which allowed the model to study the complex relationships between hand movements and the corresponding gestures. Each sign sample is represented by a sequence of 24 frames. The effectiveness of the neural network algorithm is evaluated using various indicators, including an accuracy of the model. The results of the experiment show that the developed software can achieve a high level of accuracy in recognizing the gestures of the sign language. The relevance of this study is confirmed by the possibility of its application in a wide range of areas. In particular, the software has the potential to be used as a service or tool for communication between people with disabilities and the general public, or as a technology that helps people with hearing impairments. The authors of the research note that the results of the work done demonstrate the possibility and effectiveness of using a neural network algorithm, including LSTM layers, when developing a neural network algorithm for sign language recognition. |
|---|---|
| AbstractList | Sign language is an important communication tool for the hearing-impaired community. Due to technological development, it is now possible to develop systems that can recognize, translate and process sign language into text or speech, according to the visual representation of gestures. This article explores the development of a real-time sign language recognition system using a neural network algorithm. The aim of the research is to develop such a sign language recognition and translation system which should be optimized for integration into web applications. The Mediapipe library was used to determine the key points and orientation of the user's hands and fingers. After that, the software module transmits the collected data to a sequential neural network that includes layers of Long Short-Term Memory (LSTM). To build this type of neural network, the open Keras library was used. The key feature of the presented neural network model is the combination and interaction of convolutional and Recurrent neural network (RNN) layers. The considered set of layers provides the ability to track the dependence of data over time, this is achieved by switching between layers of different types and reducing the number of neurons. The LSTM network is trained using a custom editable dataset based on American Sign Language gestures. The dataset was formed on the recording of signs. Each sign representation was preprocessed to extract three-dimensional landmarks. These collected key points were transmitting to the layers of the LSTM neural network, which allowed the model to study the complex relationships between hand movements and the corresponding gestures. Each sign sample is represented by a sequence of 24 frames. The effectiveness of the neural network algorithm is evaluated using various indicators, including an accuracy of the model. The results of the experiment show that the developed software can achieve a high level of accuracy in recognizing the gestures of the sign language. The relevance of this study is confirmed by the possibility of its application in a wide range of areas. In particular, the software has the potential to be used as a service or tool for communication between people with disabilities and the general public, or as a technology that helps people with hearing impairments. The authors of the research note that the results of the work done demonstrate the possibility and effectiveness of using a neural network algorithm, including LSTM layers, when developing a neural network algorithm for sign language recognition. |
| Author | Kambarov, Dastan Mukhammejanova, Dinargul Mukasheva, Assel Keneskanova, Arailym Matveyas, Yegor Yedilkhan, Didar |
| Author_xml | – sequence: 1 givenname: Yegor surname: Matveyas fullname: Matveyas, Yegor email: yogafeed@gmail.com organization: Almaty University of Power Engineering and Telecommunications,Almaty,Kazakhstan – sequence: 2 givenname: Assel surname: Mukasheva fullname: Mukasheva, Assel email: mukashevascience@gmail.com organization: Kazakh-British Technical University,School of Information Technology and Engineering,Almaty,Kazakhstan – sequence: 3 givenname: Didar surname: Yedilkhan fullname: Yedilkhan, Didar email: d.yedilkhan@astanait.edu.kz organization: Astana IT University,Department of Computer Engineering,Astana,Kazakhstan – sequence: 4 givenname: Arailym surname: Keneskanova fullname: Keneskanova, Arailym email: a.keneskanova@aues.kz organization: Almaty University of Power Engineering and Telecommunications,Department of Information Systems and Cybersecurity,Almaty,Kazakhstan – sequence: 5 givenname: Dastan surname: Kambarov fullname: Kambarov, Dastan email: d.kambarov@aues.kz organization: Almaty University of Power Engineering and Telecommunications,Almaty,Kazakhstan – sequence: 6 givenname: Dinargul surname: Mukhammejanova fullname: Mukhammejanova, Dinargul email: m.dinargul.14@gmail.com organization: Al-Farabi Kazakh National University,Department of Artificial Intelligence and Big Data,Almaty,Kazakhstan |
| BookMark | eNo1z91KwzAYgOEIeqBzdyCYG1jNb9McyvBnMBBczzwYX9uvWbBNRpoqu3sF59Fz9sJ7Qy5DDEjIPWcF58w-7Da7uuRa60IwoQrOSmG1sBdkaY2tpGbSqKrU1-TjHSeE1B4ohI52-IVDPI4YMo09nbwLdIDgZnBIE7bRBZ99DHQ6TRlHOk8-OBpwTjD8kr9j-qQwuJh8Poy35KqHYcLl2QWpn5_q9etq-_ayWT9uV97yvCrRVAimY1r1aEyjuGqqVpuyMZ3ExgjGOgtGqp6bnrMWNChbNRZYz1GjkAty95f1iLg_Jj9COu3_l-UPkGpTOQ |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/SIST61555.2024.10629529 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore Digital Library (LUT) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350374865 |
| EndPage | 327 |
| ExternalDocumentID | 10629529 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i91t-6e78ea7d054fe77b414b8c576b7d3eb7200d9a734f17f10ca5a498b9a0f1e5e23 |
| IEDL.DBID | RIE |
| IngestDate | Wed Aug 21 05:36:52 EDT 2024 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i91t-6e78ea7d054fe77b414b8c576b7d3eb7200d9a734f17f10ca5a498b9a0f1e5e23 |
| PageCount | 7 |
| ParticipantIDs | ieee_primary_10629529 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-May-15 |
| PublicationDateYYYYMMDD | 2024-05-15 |
| PublicationDate_xml | – month: 05 year: 2024 text: 2024-May-15 day: 15 |
| PublicationDecade | 2020 |
| PublicationTitle | 2024 IEEE 4th International Conference on Smart Information Systems and Technologies (SIST) |
| PublicationTitleAbbrev | SIST |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.8730013 |
| Snippet | Sign language is an important communication tool for the hearing-impaired community. Due to technological development, it is now possible to develop systems... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 321 |
| SubjectTerms | Accuracy Classification algorithms Keras neural network Python recognition Sign language Software Software algorithms Speech recognition Training |
| Title | Research and development of sign language recognition system using neural network algorithm |
| URI | https://ieeexplore.ieee.org/document/10629529 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ3dS8MwEMCD25NPKk78Jg--tjZtkmuexTEFh7AJAx9G0l7mcLYy2hf_epN23VAQfGooIQkJ4e5yv7sj5CYRTkdVKQuENDzgyE2gTQSBEWCd9LKJbuqnPI3l6IU_zsRsE6zexMIgYgOfYeibjS8_L7PaP5W5Gy5jJWLVIz1IZRustWG2WKRuJw-TqXezCWf2xTzsev-om9KIjeEBGXcTtrTIe1hXJsy-fuVi_PeKDslgF6FHn7ey54jsYXFMXjuMjuoip_kOB6KlpR7UoN3rJN1yQ2VB22TO1BPwC-rzW-qV-zR0ONWrRbleVm8fAzId3k_vRsGmeEKwVKwKJEKKGnKnkVkEMJxxk2bOuDCQJ2jAXY5caUi4ZWBZlGmhuUqN0pFlKDBOTki_KAs8JdTqBHRmJLixeJRFytlYLAPpNLUEQYozMvAbM_9s02PMuz05_-P_Bdn35-Nd8Exckn61rvHKSfbKXDcn-g0FU6V7 |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ3NS8MwFMCDzoOeVJz4bQ5eW5smaZqzODbdhrAKAw8jaV_mcGtldBf_epN23VAQPDWUkISE8N7L-733ELqj3OqoMiYejzTzGDDtKR0IT3NhrPQyVFX1UwbDqPvKnsZ8vA5Wr2JhAKCCz8B3zcqXnxXpyj2V2RsehZKHchftccYYr8O11tQWCeT9qDdKnKONW8MvZH7T_0fllEpwdA7RsJmy5kU-_FWp_fTrVzbGf6_pCLW3MXr4ZSN9jtEO5CforQHpsMoznG2BIFwY7FAN3LxP4g05VOS4TueMHQM_xS7DpZrbT8WHYzWfFstZ-b5oo6TzmDx0vXX5BG8mSelFIGJQIrM6mQEhNCNMx6k1L7TIKGhhr0cmlaDMEGFIkCqumIy1VIEhwCGkp6iVFzmcIWwUFSrVkbBjsSANpLWySCoiq6tREBE_R223MZPPOkHGpNmTiz_-36L9bjLoT_q94fMlOnBn5RzyhF-hVrlcwbWV86W-qU73G9yKqMg |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE+4th+International+Conference+on+Smart+Information+Systems+and+Technologies+%28SIST%29&rft.atitle=Research+and+development+of+sign+language+recognition+system+using+neural+network+algorithm&rft.au=Matveyas%2C+Yegor&rft.au=Mukasheva%2C+Assel&rft.au=Yedilkhan%2C+Didar&rft.au=Keneskanova%2C+Arailym&rft.date=2024-05-15&rft.pub=IEEE&rft.spage=321&rft.epage=327&rft_id=info:doi/10.1109%2FSIST61555.2024.10629529&rft.externalDocID=10629529 |