Multi-Modal Instrument Recognition Method Based on Improved YOLOv5s and ESPNet

Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mecha...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Conference on Power, Intelligent Computing and Systems (Online) pp. 549 - 555
Main Authors Shen, Shixian, Wang, Jianguo, Feng, Yong, Li, Yingna
Format Conference Proceeding
LanguageEnglish
Published IEEE 26.07.2024
Subjects
Online AccessGet full text
ISSN2834-8567
DOI10.1109/ICPICS62053.2024.10797062

Cover

Abstract Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mechanism is introduced into the YOLOv5s object detection model, enhancing its ability to accurately detect the position of pointer-type instruments. Simultaneously, affine and perspective transformations are employed to rectify skewed and rotated images. Secondly, variable convolutions are integrated into the ESPNet segmentation network to adaptively capture nonlinear deformation elements of pointers and dials in images, thereby extracting comprehensive key information from both. Subsequently, a Hough transform operation is applied to fit the straight line representing the pointer, while contour tracking methods are utilized to extract the valid range of dial data. Based on linear proportional relationships, the reading of pointer-type instruments is computed. Finally, a comprehensive model is built by integrating image-acquired instrument data with corresponding sensor data, enabling anomaly detection from a multimodal data analysis perspective. Experimental results demonstrate an identification accuracy of 96.21% for pointer-type instruments, with an average detection speed of 0.216 seconds and a data anomaly detection rate of 99.43%. This method accurately and rapidly identifies data anomalies, meeting the demands for key indicator monitoring in industrial production.
AbstractList Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study proposes a multi-modal instrument recognition method based on improved YOLOv5s and ESPNet. Specifically, a dynamic non-monotonic focusing mechanism is introduced into the YOLOv5s object detection model, enhancing its ability to accurately detect the position of pointer-type instruments. Simultaneously, affine and perspective transformations are employed to rectify skewed and rotated images. Secondly, variable convolutions are integrated into the ESPNet segmentation network to adaptively capture nonlinear deformation elements of pointers and dials in images, thereby extracting comprehensive key information from both. Subsequently, a Hough transform operation is applied to fit the straight line representing the pointer, while contour tracking methods are utilized to extract the valid range of dial data. Based on linear proportional relationships, the reading of pointer-type instruments is computed. Finally, a comprehensive model is built by integrating image-acquired instrument data with corresponding sensor data, enabling anomaly detection from a multimodal data analysis perspective. Experimental results demonstrate an identification accuracy of 96.21% for pointer-type instruments, with an average detection speed of 0.216 seconds and a data anomaly detection rate of 99.43%. This method accurately and rapidly identifies data anomalies, meeting the demands for key indicator monitoring in industrial production.
Author Feng, Yong
Li, Yingna
Wang, Jianguo
Shen, Shixian
Author_xml – sequence: 1
  givenname: Shixian
  surname: Shen
  fullname: Shen, Shixian
  email: sherxian@163.com
  organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China
– sequence: 2
  givenname: Jianguo
  surname: Wang
  fullname: Wang, Jianguo
  email: wangjg@chncopper.com
  organization: China Copper Industry Co., Ltd,Kunming,China
– sequence: 3
  givenname: Yong
  surname: Feng
  fullname: Feng, Yong
  email: fybraver@163.com
  organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China
– sequence: 4
  givenname: Yingna
  surname: Li
  fullname: Li, Yingna
  email: 84077971@qq.com
  organization: Kunming University of Science and Technology,Yunnan Key Laboratory of Computer Technologies Application,Kunming,China
BookMark eNo1kM1OwzAQhA0CiVL6BhzMA6Ss7Tj2HiFqIVL6I9oLp8qptxDUOlXiVuLtiQScZuYymm9u2VVoAjH2IGAsBOBjkS-LfJVJ0GosQaZjAQYNZPKCjdCgVRqUSZXQl2wgrUoTqzNzw0Zd9wUASgphrB2w-ey0j3Uya7zb8yJ0sT0dKET-RtvmI9SxbgKfUfxsPH92HXne5-JwbJtz798X5eKsO-6C55PVck7xjl3v3L6j0Z8O2Xo6WeevSbl4KfKnMqlRxMRkuEPUXqVbiRqrfi4oACcIqsphRSSc9-iUFZkwFaY98E4rgS7dWtezDdn9b21NRJtjWx9c-735f0D9AMeBUTo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICPICS62053.2024.10797062
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350374315
EISSN 2834-8567
EndPage 555
ExternalDocumentID 10797062
Genre orig-research
GroupedDBID 6IE
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i91t-769f995d34c2959b7980300a1e0bba9bee1add9a381617b94109f5319a4c8a983
IEDL.DBID RIE
IngestDate Wed Jan 22 08:32:23 EST 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i91t-769f995d34c2959b7980300a1e0bba9bee1add9a381617b94109f5319a4c8a983
PageCount 7
ParticipantIDs ieee_primary_10797062
PublicationCentury 2000
PublicationDate 2024-July-26
PublicationDateYYYYMMDD 2024-07-26
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-July-26
  day: 26
PublicationDecade 2020
PublicationTitle IEEE International Conference on Power, Intelligent Computing and Systems (Online)
PublicationTitleAbbrev ICPICS
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003211788
Score 1.8881699
Snippet Addressing the challenges of high cost, low accuracy, and poor real-time performance in anomaly detection within industrial production processes, this study...
SourceID ieee
SourceType Publisher
StartPage 549
SubjectTerms Accuracy
Analytical models
Anomaly detection
Data models
digital twin
dynamic non-monotonic focus mechanism
ESPNet
Focusing
Image segmentation
Industries
Instruments
Monitoring
pointer-type instrument
YOLO
Title Multi-Modal Instrument Recognition Method Based on Improved YOLOv5s and ESPNet
URI https://ieeexplore.ieee.org/document/10797062
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7ag3hSseKbCF6z7iPZTa6WllbsttgK9VTyWhBlV3DrwV_vJNutKAjesoFAMtnJZCbzfYPQdcS1ZGmSEWm5IDRWEZERMwSMs4XbQcGM5-ke5-nwkd4t2GINVvdYGGutTz6zgWv6t3xT6ZULlYGGZyIL3Ym7nfG0AWttAioJuDLgz-2gqzWP5s2oNx31ZmkM_xk4gjEN2vE_Kql4QzLYQ3k7hSZ_5CVY1SrQn7_YGf89x33U_cbs4enGGh2gLVseotzja8m4MvIVjzxZrBuOH9q0oarEY19DGt-COTMYvpswA7SfJveTD_aOZWlwfzbNbd1F80F_3huSdQUF8iyimmSpKIRgJqE6FkyoTHDQ6VBGNlRKCmVtBMebkO7xMMqUoCC1wimlpJpLwZMj1Cmr0h4jzBk1POVWwoUP1F6JRNNQOyaZmBaFFSeo62SxfGs4MpatGE7_6D9Du25LXJQ0Ts9RB9ZvL8C81-rSb-sXnN2iCA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dS8MwEA8yQX1SceK3EXxt7UfSJq-OjVXXbrgJ82kkTQqitIKdD_71XtJ1oiD4lgRawiWXy13u9zuErn2WCxqFsSM04w4JpO8InyoHjLOG20FBleXpTrNo-Eju5nS-AqtbLIzW2iafadc07Vu-qvKlCZWBhsc89syJu0kJIbSBa61DKiE4M-DRbaGrFZPmTdKbJL1pFMBOA1cwIG77hx-1VKwpGeyirJ1Ek0Hy4i5r6eafv_gZ_z3LPdT9Ru3hydoe7aMNXR6gzCJsnbRS4hUnli7WfI4f2sShqsSprSKNb8GgKQz9JtAA7afxaPxB37EoFe5PJ5muu2g26M96Q2dVQ8F55n7txBEvOKcqJHnAKZcxZ6DVnvC1J6XgUmsfDjguzPOhH0tOQGqFUUtBciY4Cw9Rp6xKfYQwo0SxiGkBVz5QfMnDnHi54ZIJSFFofoy6RhaLt4YlY9GK4eSP8Uu0PZylo8Uoye5P0Y5ZHhMzDaIz1AFZ6HMw9rW8sEv8BUzMpVU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Power%2C+Intelligent+Computing+and+Systems+%28Online%29&rft.atitle=Multi-Modal+Instrument+Recognition+Method+Based+on+Improved+YOLOv5s+and+ESPNet&rft.au=Shen%2C+Shixian&rft.au=Wang%2C+Jianguo&rft.au=Feng%2C+Yong&rft.au=Li%2C+Yingna&rft.date=2024-07-26&rft.pub=IEEE&rft.eissn=2834-8567&rft.spage=549&rft.epage=555&rft_id=info:doi/10.1109%2FICPICS62053.2024.10797062&rft.externalDocID=10797062