超越单一感知的农田害虫检测算法MRA-YOLOX

TP391.4; 目标检测技术正逐步应用于农业,然而在农田害虫检测的运用中仍存在检测速度慢、检测准确率偏低的问题,且仅仅预测害虫的种类和位置信息不足以满足复杂的工程需求.提出一种可以额外预测害虫状态信息的融合MAE和 YOLOX算法的高速高精度农田害虫检测模型 MRA-YOLOX(masked autoencoders and rapid aim detection-exceeding YOLO).算法构建包含近4万张图片以及5万余标注的数据集TDBFP(target detection dataset be used for farmland pests),TDBFP数据集标注了 10种害虫...

Full description

Saved in:
Bibliographic Details
Published in计算机工程与应用 Vol. 60; no. 16; pp. 206 - 216
Main Authors 王中天, 邹颖波, 吴昌霖, 李新
Format Journal Article
LanguageChinese
Published 桂林理工大学 信息科学与工程学院,广西 桂林 541006 15.08.2024
广西嵌入式技术与智能系统重点实验室,广西 桂林 541006
Subjects
Online AccessGet full text
ISSN1002-8331
DOI10.3778/j.issn.1002-8331.2305-0318

Cover

Abstract TP391.4; 目标检测技术正逐步应用于农业,然而在农田害虫检测的运用中仍存在检测速度慢、检测准确率偏低的问题,且仅仅预测害虫的种类和位置信息不足以满足复杂的工程需求.提出一种可以额外预测害虫状态信息的融合MAE和 YOLOX算法的高速高精度农田害虫检测模型 MRA-YOLOX(masked autoencoders and rapid aim detection-exceeding YOLO).算法构建包含近4万张图片以及5万余标注的数据集TDBFP(target detection dataset be used for farmland pests),TDBFP数据集标注了 10种害虫的生长状态、物种类别以及位置,以便更好地把握害虫信息,从而更准确地制定对策.修改YOLOX模型的解耦头及loss,额外输出生长状态,以改进模型预测更多信息;将ECA(efficient channel attention)和SA(shuffle attention)注意力机制进行有机融合,并插入backbone与 FPN(feature pyr-amid networks)的连接过程以及FPN的通道堆叠过程,以便能够增强获得全局信息和丰富上下文信息的能力,从而取得比单一注意力机制更好的效果;将MAE中自监督解码器部分插入YOLOX的数据增强部分,扩大感受野,增强识别细粒度,获得超越mixup和mosaic的数据增强效果.实验结果表明,当需要同时感知目标的状态、分类和位置时,MRA-YOLOX相较于原始YOLOX模型对于TDBFP数据集的检测精度mAP@0.5由60.1%上升到88.2%,平均检测准确率提高了 18.8个百分点,且检测帧率达到145 FPS,可以用于更复杂的工程实践.
AbstractList TP391.4; 目标检测技术正逐步应用于农业,然而在农田害虫检测的运用中仍存在检测速度慢、检测准确率偏低的问题,且仅仅预测害虫的种类和位置信息不足以满足复杂的工程需求.提出一种可以额外预测害虫状态信息的融合MAE和 YOLOX算法的高速高精度农田害虫检测模型 MRA-YOLOX(masked autoencoders and rapid aim detection-exceeding YOLO).算法构建包含近4万张图片以及5万余标注的数据集TDBFP(target detection dataset be used for farmland pests),TDBFP数据集标注了 10种害虫的生长状态、物种类别以及位置,以便更好地把握害虫信息,从而更准确地制定对策.修改YOLOX模型的解耦头及loss,额外输出生长状态,以改进模型预测更多信息;将ECA(efficient channel attention)和SA(shuffle attention)注意力机制进行有机融合,并插入backbone与 FPN(feature pyr-amid networks)的连接过程以及FPN的通道堆叠过程,以便能够增强获得全局信息和丰富上下文信息的能力,从而取得比单一注意力机制更好的效果;将MAE中自监督解码器部分插入YOLOX的数据增强部分,扩大感受野,增强识别细粒度,获得超越mixup和mosaic的数据增强效果.实验结果表明,当需要同时感知目标的状态、分类和位置时,MRA-YOLOX相较于原始YOLOX模型对于TDBFP数据集的检测精度mAP@0.5由60.1%上升到88.2%,平均检测准确率提高了 18.8个百分点,且检测帧率达到145 FPS,可以用于更复杂的工程实践.
Abstract_FL At present,target detection technology is gradually applied in agriculture,but there are still problems in the application of farmland pest detection,such as slow detection speed and low detection accuracy,and only predicting the type and location information of pests is not enough to meet the complex engineering needs.In this paper,a high speed and high precision farmland pest detection model MRA-YOLOX(masked autoencoders and rapid aim detection-exceeding YOLO),which can be used to predict additional pest status information,is proposed by fusing MAE and YOLOX algo-rithm.By constructing nearly 40 000 pictures and more than 50 000 labels dataset TDBFP(target detection dataset be used for farmland pests),the TDBFP dataset labels the growth status,species category and location of 10 kinds of pests,so as to better grasp the information of pests and develop more accurate countermeasures.Firstly,the decoupling head and loss of YOLOX model are modified to output additional growth states to improve model prediction.Secondly,ECA and SA attention mechanisms are organically integrated,and the connection process between backbone and FPN and the channel stacking process of FPN are inserted,so as to enhance the ability to obtain global information and enrich context information and achieve better results than a single attention mechanism.Finally,the self-supervised decoder part of MAE is inserted into the data enhancement part of YOLOX in order to expand the receptive field,enhance the recognition granularity,and obtain the data enhancement effect beyond mixup and mosaic.Experimental results show that when it is necessary to perceive the state,classification and position of the target at the same time,compared with the original YOLOX model,the detection accuracy of MLA-YOLOX for TDBFP dataset mAP@0.5 increases from 60.1%to 88.2%,the average detection accuracy increases by 18.8 percentage points,and the detection frame rate reaches 145 FPS,MLA-YOLOX can be used for more complex engineering.
Author 王中天
李新
邹颖波
吴昌霖
AuthorAffiliation 桂林理工大学 信息科学与工程学院,广西 桂林 541006;广西嵌入式技术与智能系统重点实验室,广西 桂林 541006
AuthorAffiliation_xml – name: 桂林理工大学 信息科学与工程学院,广西 桂林 541006;广西嵌入式技术与智能系统重点实验室,广西 桂林 541006
Author_FL WU Changlin
ZOU Yingbo
WANG Zhongtian
LI Xin
Author_FL_xml – sequence: 1
  fullname: WANG Zhongtian
– sequence: 2
  fullname: ZOU Yingbo
– sequence: 3
  fullname: WU Changlin
– sequence: 4
  fullname: LI Xin
Author_xml – sequence: 1
  fullname: 王中天
– sequence: 2
  fullname: 邹颖波
– sequence: 3
  fullname: 吴昌霖
– sequence: 4
  fullname: 李新
BookMark eNo9jrtKA0EARaeIYIz5CTuLWec9s2UIvmBlQRS0Cju7syGLTMBBZLsU0UJJZxBFSCFWQhCCwQ9yH59hQLG6cIpz7gZo2KE1AGxh5FEp1U7mDZyzHkaIQEUp9ghFHCKKVQM0_-k6aDs30IhjKrmkfhOIenlTL--KyfT7a1SOZ9XsrXoeF7cv1cNHMV_UT-_l66j8vK_mj-VienTcgedhEJ5tgrU0unCm_bctcLq3e9I9gEG4f9jtBNBhJBCME8xSyX2ksTGaJTrxUyESIShThmrOleZGUiaZXL3jsZJpJPwkJoRxrBihLbD9672ObBrZfi8bXl3aVbGXuawf53lOEGFYIILoD988Wxg
ClassificationCodes TP391.4
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2B.
4A8
92I
93N
PSX
TCJ
DOI 10.3778/j.issn.1002-8331.2305-0318
DatabaseName Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
DocumentTitle_FL MRA-YOLOX for Pest Detection in Farmland Beyond Single Perception
EndPage 216
ExternalDocumentID jsjgcyyy202416020
GrantInformation_xml – fundername: (广西自然科学基金项目); (广西嵌入式技术与智能系统重点实验室开放基金)
  funderid: (广西自然科学基金项目); (广西嵌入式技术与智能系统重点实验室开放基金)
GroupedDBID -0Y
2B.
4A8
5XA
5XJ
92H
92I
93N
ABJNI
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CCEZO
CUBFJ
CW9
PSX
TCJ
TGT
U1G
U5S
ID FETCH-LOGICAL-s1060-cd14f7590b1eeb4dbd9f66d66348e3b558b5e7347473315c87fa69dc224518423
ISSN 1002-8331
IngestDate Thu May 29 04:10:55 EDT 2025
IsPeerReviewed false
IsScholarly false
Issue 16
Keywords 掩码自编码器(MAE)
masked autoencoders(MAE)
注意力机制
state detection
YOLOX
害虫识别
attention mechanism
pest detection
状态检测
Language Chinese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1060-cd14f7590b1eeb4dbd9f66d66348e3b558b5e7347473315c87fa69dc224518423
PageCount 11
ParticipantIDs wanfang_journals_jsjgcyyy202416020
PublicationCentury 2000
PublicationDate 2024-08-15
PublicationDateYYYYMMDD 2024-08-15
PublicationDate_xml – month: 08
  year: 2024
  text: 2024-08-15
  day: 15
PublicationDecade 2020
PublicationTitle 计算机工程与应用
PublicationTitle_FL Computer Engineering and Applications
PublicationYear 2024
Publisher 桂林理工大学 信息科学与工程学院,广西 桂林 541006
广西嵌入式技术与智能系统重点实验室,广西 桂林 541006
Publisher_xml – name: 广西嵌入式技术与智能系统重点实验室,广西 桂林 541006
– name: 桂林理工大学 信息科学与工程学院,广西 桂林 541006
SSID ssib051375739
ssib001102935
ssj0000561668
ssib023646291
ssib057620132
Score 1.995603
Snippet TP391.4; 目标检测技术正逐步应用于农业,然而在农田害虫检测的运用中仍存在检测速度慢、检测准确率偏低的问题,且仅仅预测害虫的种类和位置信息不足以满足复杂的工程需求.提...
SourceID wanfang
SourceType Aggregation Database
StartPage 206
Title 超越单一感知的农田害虫检测算法MRA-YOLOX
URI https://d.wanfangdata.com.cn/periodical/jsjgcyyy202416020
Volume 60
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: Inspec with Full Text
  issn: 1002-8331
  databaseCode: ADMLS
  dateStart: 20200501
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  omitProxy: false
  ssIdentifier: ssib057620132
  providerName: EBSCOhost
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LaxRBEG7yuOhBfOKbIPYpTJyemX4de7K7BEkMaALxFOYZyWEFkxySUw7Rg-LNIIqQg3gSgxAM_iCzm59hVU9nZmMiRC9DU_Nt1ddd21PVQ1cPIffzNC_9JMm9TKewQFEq9JKclR5LeBH6eYFbo3C3xSMxNR89XOALQ8PfBnYtra2mE9nGqXUl_-NVkIFfsUr2HzxbKwUBtMG_cAUPw_VMPqZtRWNBFT9qGNrmVLWoBklEY4X7GNogj6ju0LbEq-G2YVCIYEH1pJUA3keJadM4RIVaUxPjz03o9MSAjxEMGC2tJARbM4-N93R2enZhMNNFDQAz7BgebMWWZCwdE6Oszopt294ySKaiZOqdtiiA-zXWtCxZgOgGoqkKaKyxYQKqhaNoggbCqfZpHFkyYHESscBKiwYCd1qWi0AVsT_4ZiSI8FVvVRtq_8t2gBjaRXjVT4k2YGSbflZMpW20qBHjthcdOzzgH0aN9Q8gNKtBx0fljwGrMdo6KoC5YmEa1eLYQ7szfpLdOI-Y74uBYGSj1VFFm4tW1dcXjmblsdjjflu4SCROi5ChlMpGSLQwUVvAegDu4QO-yQvq3ZrLK8tL2fr6Oo4xE7C-GCajAQRRf4SMmtbM9JMm_4Z0VTf5N36cQATNYUychZLL5hhajkrc0aHuMH7BhCtfdcyqg4KR9oO_k7YVd90y6S4NJIdzF8kFt6obM9UUvUSGNp5dJucHzvq8QsTh_svD_dcHb7d__dzsbe30d770P24dvPrUf_f9YHfv8MPX3ufN3o83_d33vb3tekpdJfOd9tzklOc-WuKtMF_4XpazqJRc-ykrijSCZ6EuhcghsY9UEaacq5QXMoxgGQ9d4JmSZSJ0nkEqzZmCxc01MtJ93i2ukzFRFiKTpWZlmMOqJtdCFbIoeBmVScmS7Aa553q96B5KK4snfHXzLKBb5FwzgW6TkdUXa8UdSLZX07vOxb8Bv02ZXQ
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E8%B6%85%E8%B6%8A%E5%8D%95%E4%B8%80%E6%84%9F%E7%9F%A5%E7%9A%84%E5%86%9C%E7%94%B0%E5%AE%B3%E8%99%AB%E6%A3%80%E6%B5%8B%E7%AE%97%E6%B3%95MRA-YOLOX&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E5%B7%A5%E7%A8%8B%E4%B8%8E%E5%BA%94%E7%94%A8&rft.au=%E7%8E%8B%E4%B8%AD%E5%A4%A9&rft.au=%E9%82%B9%E9%A2%96%E6%B3%A2&rft.au=%E5%90%B4%E6%98%8C%E9%9C%96&rft.au=%E6%9D%8E%E6%96%B0&rft.date=2024-08-15&rft.pub=%E6%A1%82%E6%9E%97%E7%90%86%E5%B7%A5%E5%A4%A7%E5%AD%A6+%E4%BF%A1%E6%81%AF%E7%A7%91%E5%AD%A6%E4%B8%8E%E5%B7%A5%E7%A8%8B%E5%AD%A6%E9%99%A2%2C%E5%B9%BF%E8%A5%BF+%E6%A1%82%E6%9E%97+541006&rft.issn=1002-8331&rft.volume=60&rft.issue=16&rft.spage=206&rft.epage=216&rft_id=info:doi/10.3778%2Fj.issn.1002-8331.2305-0318&rft.externalDocID=jsjgcyyy202416020
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjgcyyy%2Fjsjgcyyy.jpg