Multi-Modal Instrument Recognition Method Based on Improved YOLOv5s and ESPNet
Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mecha...
Saved in:
| Published in | IEEE International Conference on Power, Intelligent Computing and Systems (Online) pp. 549 - 555 |
|---|---|
| Main Authors | , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
26.07.2024
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2834-8567 |
| DOI | 10.1109/ICPICS62053.2024.10797062 |
Cover
| Abstract | Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mechanism is introduced into the YOLOv5s object detection model, enhancing its ability to accurately detect the position of pointer-type instruments. Simultaneously, affine and perspective transformations are employed to rectify skewed and rotated images. Secondly, variable convolutions are integrated into the ESPNet segmentation network to adaptively capture nonlinear deformation elements of pointers and dials in images, thereby extracting comprehensive key information from both. Subsequently, a Hough transform operation is applied to fit the straight line representing the pointer, while contour tracking methods are utilized to extract the valid range of dial data. Based on linear proportional relationships, the reading of pointer-type instruments is computed. Finally, a comprehensive model is built by integrating image-acquired instrument data with corresponding sensor data, enabling anomaly detection from a multimodal data analysis perspective. Experimental results demonstrate an identification accuracy of 96.21% for pointer-type instruments, with an average detection speed of 0.216 seconds and a data anomaly detection rate of 99.43%. This method accurately and rapidly identifies data anomalies, meeting the demands for key indicator monitoring in industrial production. |
|---|---|
| AbstractList | Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mechanism is introduced into the YOLOv5s object detection model, enhancing its ability to accurately detect the position of pointer-type instruments. Simultaneously, affine and perspective transformations are employed to rectify skewed and rotated images. Secondly, variable convolutions are integrated into the ESPNet segmentation network to adaptively capture nonlinear deformation elements of pointers and dials in images, thereby extracting comprehensive key information from both. Subsequently, a Hough transform operation is applied to fit the straight line representing the pointer, while contour tracking methods are utilized to extract the valid range of dial data. Based on linear proportional relationships, the reading of pointer-type instruments is computed. Finally, a comprehensive model is built by integrating image-acquired instrument data with corresponding sensor data, enabling anomaly detection from a multimodal data analysis perspective. Experimental results demonstrate an identification accuracy of 96.21% for pointer-type instruments, with an average detection speed of 0.216 seconds and a data anomaly detection rate of 99.43%. This method accurately and rapidly identifies data anomalies, meeting the demands for key indicator monitoring in industrial production. |
| Author | Feng, Yong Li, Yingna Wang, Jianguo Shen, Shixian |
| Author_xml | – sequence: 1 givenname: Shixian surname: Shen fullname: Shen, Shixian email: sherxian@163.com organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China – sequence: 2 givenname: Jianguo surname: Wang fullname: Wang, Jianguo email: wangjg@chncopper.com organization: China Copper Industry Co., Ltd,Kunming,China – sequence: 3 givenname: Yong surname: Feng fullname: Feng, Yong email: fybraver@163.com organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China – sequence: 4 givenname: Yingna surname: Li fullname: Li, Yingna email: 84077971@qq.com organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China |
| BookMark | eNo1kM1OwzAQhA0CiVL6BhzMA6Ss7Tj2HiFqIVL6I9oLp8qptxDUOlXiVuLtiQScZuYymm9u2VVoAjH2IGAsBOBjkS-LfJVJ0GosQaZjAQYNZPKCjdCgVRqUSZXQl2wgrUoTqzNzw0Zd9wUASgphrB2w-ey0j3Uya7zb8yJ0sT0dKET-RtvmI9SxbgKfUfxsPH92HXne5-JwbJtz798X5eKsO-6C55PVck7xjl3v3L6j0Z8O2Xo6WeevSbl4KfKnMqlRxMRkuEPUXqVbiRqrfi4oACcIqsphRSSc9-iUFZkwFaY98E4rgS7dWtezDdn9b21NRJtjWx9c-735f0D9AMeBUTo |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ICPICS62053.2024.10797062 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350374315 |
| EISSN | 2834-8567 |
| EndPage | 555 |
| ExternalDocumentID | 10797062 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK OCL RIE RIL |
| ID | FETCH-LOGICAL-i91t-769f995d34c2959b7980300a1e0bba9bee1add9a381617b94109f5319a4c8a983 |
| IEDL.DBID | RIE |
| IngestDate | Wed Jan 22 08:32:23 EST 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i91t-769f995d34c2959b7980300a1e0bba9bee1add9a381617b94109f5319a4c8a983 |
| PageCount | 7 |
| ParticipantIDs | ieee_primary_10797062 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-July-26 |
| PublicationDateYYYYMMDD | 2024-07-26 |
| PublicationDate_xml | – month: 07 year: 2024 text: 2024-July-26 day: 26 |
| PublicationDecade | 2020 |
| PublicationTitle | IEEE International Conference on Power, Intelligent Computing and Systems (Online) |
| PublicationTitleAbbrev | ICPICS |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0003211788 |
| Score | 1.8881699 |
| Snippet | Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 549 |
| SubjectTerms | Accuracy Analytical models Anomaly detection Data models digital twin dynamic non-monotonic focus mechanism ESPNet Focusing Image segmentation Industries Instruments Monitoring pointer-type instrument YOLO |
| Title | Multi-Modal Instrument Recognition Method Based on Improved YOLOv5s and ESPNet |
| URI | https://ieeexplore.ieee.org/document/10797062 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7ag3hSseKbCF6z7iPZTa6WllbsttgK9VTyWhBlV3DrwV_vJNutKAjesoFAMtnJZCbzfYPQdcS1ZGmSEWm5IDRWEZERMwSMs4XbQcGM5-ke5-nwkd4t2GINVvdYGGutTz6zgWv6t3xT6ZULlYGGZyIL3Ym7nfG0AWttAioJuDLgz-2gqzWP5s2oNx31ZmkM_xk4gjEN2vE_Kql4QzLYQ3k7hSZ_5CVY1SrQn7_YGf89x33U_cbs4enGGh2gLVseotzja8m4MvIVjzxZrBuOH9q0oarEY19DGt-COTMYvpswA7SfJveTD_aOZWlwfzbNbd1F80F_3huSdQUF8iyimmSpKIRgJqE6FkyoTHDQ6VBGNlRKCmVtBMebkO7xMMqUoCC1wimlpJpLwZMj1Cmr0h4jzBk1POVWwoUP1F6JRNNQOyaZmBaFFSeo62SxfGs4MpatGE7_6D9Du25LXJQ0Ts9RB9ZvL8C81-rSb-sXnN2iCA |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dS8MwEA8yQX1SceK3EXxt7UfSJq-OjVXXbrgJ82kkTQqitIKdD_71XtJ1oiD4lgRawiWXy13u9zuErn2WCxqFsSM04w4JpO8InyoHjLOG20FBleXpTrNo-Eju5nS-AqtbLIzW2iafadc07Vu-qvKlCZWBhsc89syJu0kJIbSBa61DKiE4M-DRbaGrFZPmTdKbJL1pFMBOA1cwIG77hx-1VKwpGeyirJ1Ek0Hy4i5r6eafv_gZ_z3LPdT9Ru3hydoe7aMNXR6gzCJsnbRS4hUnli7WfI4f2sShqsSprSKNb8GgKQz9JtAA7afxaPxB37EoFe5PJ5muu2g26M96Q2dVQ8F55n7txBEvOKcqJHnAKZcxZ6DVnvC1J6XgUmsfDjguzPOhH0tOQGqFUUtBciY4Cw9Rp6xKfYQwo0SxiGkBVz5QfMnDnHi54ZIJSFFofoy6RhaLt4YlY9GK4eSP8Uu0PZylo8Uoye5P0Y5ZHhMzDaIz1AFZ6HMw9rW8sEv8BUzMpVU |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Power%2C+Intelligent+Computing+and+Systems+%28Online%29&rft.atitle=Multi-Modal+Instrument+Recognition+Method+Based+on+Improved+YOLOv5s+and+ESPNet&rft.au=Shen%2C+Shixian&rft.au=Wang%2C+Jianguo&rft.au=Feng%2C+Yong&rft.au=Li%2C+Yingna&rft.date=2024-07-26&rft.pub=IEEE&rft.eissn=2834-8567&rft.spage=549&rft.epage=555&rft_id=info:doi/10.1109%2FICPICS62053.2024.10797062&rft.externalDocID=10797062 |