Research on Vision-based Water Surface Garbage Detection and Localization Methods

In complex water surface environments, real-time detection and localization of water surface garbage targets are essential for the advancement of water surface cleaning robots. To meet the practical needs of high accuracy and low time consumption, this paper presents a method for identifying and loc...

Full description

Saved in:
Bibliographic Details
Published inInternational Conference on Automation, Control and Robotics Engineering (Online) pp. 137 - 144
Main Authors Bao, Han, Wang, Yuliang, Li, Yiping, Li, Liang, Wang, Hailin
Format Conference Proceeding
LanguageEnglish
Published IEEE 16.07.2025
Subjects
Online AccessGet full text
ISSN2997-6278
DOI10.1109/CACRE66141.2025.11119551

Cover

Abstract In complex water surface environments, real-time detection and localization of water surface garbage targets are essential for the advancement of water surface cleaning robots. To meet the practical needs of high accuracy and low time consumption, this paper presents a method for identifying and localizing water surface debris using YOLOv8-WSG, which integrates deep learning with vision-based detection and localization techniques. Firstly, relying on the FloW - Img dataset, the water surface garbage homemade dataset was constructed by means of web search, on-site collection, and manual labeling. Secondly, based on YOLOv8, To enhance the detection efficacy for diminutive, overlapping and obscured targets, a P2 sampling layer is incorporated into the neck network, and a small target detection head is introduced into the head network; SIoU is used as the loss function to accelerate the convergence speed and generalization ability of the model. The research results indicate that the method improves 2.2%, 1.7%, and 1.4% in precision, recall, and Map50 values, respectively, under the premise of guaranteeing a detection speed of 1.9ms per frame. And then, the SGBM algorithm is applied to match features between the target regions in the left and right images from the stereo camera, and to compute the disparity and positional coordinates. Finally, the water surface garbage detection and localization system is built on Jetson Xavier NX for validation, and the experimental results indicate that the proposed method achieves an error of less than 0.29m with a detection speed of up to 20 FPS, ensuring high-precision and time-efficient water surface target identification and localization.
AbstractList In complex water surface environments, real-time detection and localization of water surface garbage targets are essential for the advancement of water surface cleaning robots. To meet the practical needs of high accuracy and low time consumption, this paper presents a method for identifying and localizing water surface debris using YOLOv8-WSG, which integrates deep learning with vision-based detection and localization techniques. Firstly, relying on the FloW - Img dataset, the water surface garbage homemade dataset was constructed by means of web search, on-site collection, and manual labeling. Secondly, based on YOLOv8, To enhance the detection efficacy for diminutive, overlapping and obscured targets, a P2 sampling layer is incorporated into the neck network, and a small target detection head is introduced into the head network; SIoU is used as the loss function to accelerate the convergence speed and generalization ability of the model. The research results indicate that the method improves 2.2%, 1.7%, and 1.4% in precision, recall, and Map50 values, respectively, under the premise of guaranteeing a detection speed of 1.9ms per frame. And then, the SGBM algorithm is applied to match features between the target regions in the left and right images from the stereo camera, and to compute the disparity and positional coordinates. Finally, the water surface garbage detection and localization system is built on Jetson Xavier NX for validation, and the experimental results indicate that the proposed method achieves an error of less than 0.29m with a detection speed of up to 20 FPS, ensuring high-precision and time-efficient water surface target identification and localization.
Author Li, Liang
Li, Yiping
Wang, Hailin
Bao, Han
Wang, Yuliang
Author_xml – sequence: 1
  givenname: Han
  surname: Bao
  fullname: Bao, Han
  email: baohan@sia.cn
  organization: Shenyang Institute of Automation,State Key Laboratory of Robotics, Chinese Academy of Sciences,Shenyang,China,110016
– sequence: 2
  givenname: Yuliang
  surname: Wang
  fullname: Wang, Yuliang
  email: wangyuliang2@sia.cn
  organization: Shenyang Institute of Automation,State Key Laboratory of Robotics, Chinese Academy of Sciences,Shenyang,China,110016
– sequence: 3
  givenname: Yiping
  surname: Li
  fullname: Li, Yiping
  email: lyp@sia.cn
  organization: Shenyang Institute of Automation,State Key Laboratory of Robotics, Chinese Academy of Sciences,Shenyang,China,110016
– sequence: 4
  givenname: Liang
  surname: Li
  fullname: Li, Liang
  email: liliang@sia.cn
  organization: Shenyang Institute of Automation,State Key Laboratory of Robotics, Chinese Academy of Sciences,Shenyang,China,110016
– sequence: 5
  givenname: Hailin
  surname: Wang
  fullname: Wang, Hailin
  email: wanghailin@sia.cn
  organization: Shenyang Institute of Automation,State Key Laboratory of Robotics, Chinese Academy of Sciences,Shenyang,China,110016
BookMark eNo1kM1OwzAQhA0CCSh9Aw5-gZS1N07sYxVKQSpClAqO1dre0KCSoDgc4OmJ-DmN9M2nOcyZOGq7loWQCmZKgbus5tV6URQqVzMN2oxQKWeMOhBTVzqLqAxom-OhONXOlVmhS3sipim9AgBqwLE7FQ9rTkx92MmulU9Naro285Q4ymcauJePH31NgeWSek8vLK944DCMlqQ2ylUXaN980Q-442HXxXQujmvaJ57-5URsrheb6iZb3S9vq_kqaxwOWVDACurCMmOoQ64jBUIg7UpLETBQ9MVoBKVzizH3EHJjCK03ZfBe40Rc_M42zLx975s36j-3_yfgNx72U9s
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CACRE66141.2025.11119551
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEL(IEEE/IET Electronic Library )
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798331502843
EISSN 2997-6278
EndPage 144
ExternalDocumentID 11119551
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i93t-c10e10f68ee3cfc42daca30a2978ad03cadb6e10c12483d4b0c455a38b57cbb23
IEDL.DBID RIE
IngestDate Wed Aug 27 07:37:54 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i93t-c10e10f68ee3cfc42daca30a2978ad03cadb6e10c12483d4b0c455a38b57cbb23
PageCount 8
ParticipantIDs ieee_primary_11119551
PublicationCentury 2000
PublicationDate 2025-July-16
PublicationDateYYYYMMDD 2025-07-16
PublicationDate_xml – month: 07
  year: 2025
  text: 2025-July-16
  day: 16
PublicationDecade 2020
PublicationTitle International Conference on Automation, Control and Robotics Engineering (Online)
PublicationTitleAbbrev CACRE
PublicationYear 2025
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003203843
Score 1.9198576
Snippet In complex water surface environments, real-time detection and localization of water surface garbage targets are essential for the advancement of water surface...
SourceID ieee
SourceType Publisher
StartPage 137
SubjectTerms Accuracy
binocular localization
Deep learning
Feature extraction
Head
Location awareness
Neck
Object detection
Real-time systems
SGBM
Surface cleaning
target detection
Visualization
water surface garbage
YOLOv8
Title Research on Vision-based Water Surface Garbage Detection and Localization Methods
URI https://ieeexplore.ieee.org/document/11119551
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7akycVK77JwWu2u3nsbo5SW4vY4qNqbyWZZEWEXam7F3-9SdqtKAjeQkggTEi-mcn3ZRA6h1zb3PnxxHnjmnBVuHtQioJkDgsNU1wkygeK40k6euTXMzFbidWDFsZaG8hnNvLN8JZvKmh8qqznj7cUXjC9mWVyKdZaJ1QYjVnOWcvWiWWvf9G_H3j88XEgFVE7_UchlYAjw200aVewpI-8RU2tI_j89Tnjv5e4g7rfkj18uwajXbRhyz101_LqcFXip6AiJx62DH52LuYCPzSLQrmJV_7R4cXiS1sHZlaJVWnwjYe5lUwTj0Ol6Y8umg4H0_6IrGookFfJagJJbJO4SHNrGRTAqVGgWKyoCx6ViRkoo1M3AhzM58xwHQMXQrFciwy0pmwfdcqqtAcIu1hZg8wME1rzRHJplLOtSIGmwLmih6jrzTF_X_6SMW8tcfRH_zHa8rvi86RJeoI69aKxpw7ga30WNvYLiIWk3Q
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwGA0yH_RJxYl38-BrujSXtnmUuTl1G16m7m3k1iFCJ7N98debZOtEQfCtlLaEL6TnO1_OyQfAuc6UzVwej1w2rhCTufsPCp6j1GGhoZLxWHqiOBgmvSd2M-bjpVk9eGGstUF8ZiN_GfbyzUxXvlTW8stbcG-YXueOVqQLu9aqpEIJphmjtV4Hi1b7ov3Q8QjkmSDhUf2BH61UApJ0t8CwHsNCQPIWVaWK9Oev4xn_Pcht0Pw27cG7FRztgDVb7IL7WlkHZwV8Dj5y5IHLwBeXZM7hYzXPpXvxym87TC28tGXQZhVQFgb2PdAtjZpwEHpNfzTBqNsZtXto2UUBvQpaIh1jG-M8yaylOteMGKklxZI4-igNploalbgntAP6jBqmsGacS5opnmqlCN0DjWJW2H0AHVtWWqSGcqVYLJgw0sWWJ5okmjFJDkDTh2PyvjgnY1JH4vCP-2dgozca9Cf96-HtEdj0M-SrpnFyDBrlvLInDu5LdRom-QtjGagu
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Conference+on+Automation%2C+Control+and+Robotics+Engineering+%28Online%29&rft.atitle=Research+on+Vision-based+Water+Surface+Garbage+Detection+and+Localization+Methods&rft.au=Bao%2C+Han&rft.au=Wang%2C+Yuliang&rft.au=Li%2C+Yiping&rft.au=Li%2C+Liang&rft.date=2025-07-16&rft.pub=IEEE&rft.eissn=2997-6278&rft.spage=137&rft.epage=144&rft_id=info:doi/10.1109%2FCACRE66141.2025.11119551&rft.externalDocID=11119551