LMUAV-YOLOv8:低空无人机视觉目标检测轻量化网络
TP391.4; 针对低空无人机目标检测面临目标尺度变化大、小目标容易漏检和误检的挑战,发展了一种融合多尺度特征的目标检测轻量化网络(LMUAV-YOLOv8),通过开展消融和对比实验,验证了算法的有效性和先进性,并借助类激活图,对模型的决策过程进行了解释.设计了一种轻量化的特征融合网络(UAV_RepGFPN),提出新的特征融合路径以及特征融合模块DBB GELAN,降低参数量和计算量的同时,提高特征融合网络的性能.使用部分卷积(PConv)和三重注意力机制(Triplet Attention)构建特征提取模块(FTA_C2f),并引入ADown下采样模块,通过对输入特征图维度的重新排列和细...
        Saved in:
      
    
          | Published in | 计算机工程与应用 Vol. 61; no. 3; pp. 94 - 110 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | Chinese | 
| Published | 
            河北经贸大学管理科学与信息工程学院,石家庄 050061
    
        01.02.2025
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1002-8331 | 
| DOI | 10.3778/j.issn.1002-8331.2407-0127 | 
Cover
| Abstract | TP391.4; 针对低空无人机目标检测面临目标尺度变化大、小目标容易漏检和误检的挑战,发展了一种融合多尺度特征的目标检测轻量化网络(LMUAV-YOLOv8),通过开展消融和对比实验,验证了算法的有效性和先进性,并借助类激活图,对模型的决策过程进行了解释.设计了一种轻量化的特征融合网络(UAV_RepGFPN),提出新的特征融合路径以及特征融合模块DBB GELAN,降低参数量和计算量的同时,提高特征融合网络的性能.使用部分卷积(PConv)和三重注意力机制(Triplet Attention)构建特征提取模块(FTA_C2f),并引入ADown下采样模块,通过对输入特征图维度的重新排列和细粒度调整,以提升模型中深层网络对空间特征的捕捉能力,并进一步降低参数量和计算量.优化YOLOv9的可编程梯度信息(programmable gradient information,PGI)策略,设计基于上下文引导(Con-text_guided)的可逆架构,并额外生成三个辅助检测头,提出UAV_PGI可编程梯度方法,避免传统深度监督中多路径特征集成可能导致的语义信息损失.为了验证模型的有效性及泛化能力,在VisDrone 2019测试集上开展了对比实验,结果显示,与YOLOv8s相比,LMUAV-YOLOv8s的准确度、召回率、mAP@0.5和mAP@0.5:0.95等指标分别提升了 4.2、3.9、5.1和3.0个百分点,同时参数量减少了 63.9%,计算量仅增加0.4 GFLOPs,实现了检测性能与资源消耗的良好平衡.基于NVIDIA Jetson Xavier NX嵌入式平台的推理实验结果显示:与基线模型相比,该算法能够在满足实时检测要求的条件下,获得更高的检测精度,对于无人机实时目标检测场景具有较好的适用性.借助类激活图,对算法的决策过程进行了可视化分析,结果表明,该模型具备更优异的小尺度特征提取和高分辨率处理能力. | 
    
|---|---|
| AbstractList | TP391.4; 针对低空无人机目标检测面临目标尺度变化大、小目标容易漏检和误检的挑战,发展了一种融合多尺度特征的目标检测轻量化网络(LMUAV-YOLOv8),通过开展消融和对比实验,验证了算法的有效性和先进性,并借助类激活图,对模型的决策过程进行了解释.设计了一种轻量化的特征融合网络(UAV_RepGFPN),提出新的特征融合路径以及特征融合模块DBB GELAN,降低参数量和计算量的同时,提高特征融合网络的性能.使用部分卷积(PConv)和三重注意力机制(Triplet Attention)构建特征提取模块(FTA_C2f),并引入ADown下采样模块,通过对输入特征图维度的重新排列和细粒度调整,以提升模型中深层网络对空间特征的捕捉能力,并进一步降低参数量和计算量.优化YOLOv9的可编程梯度信息(programmable gradient information,PGI)策略,设计基于上下文引导(Con-text_guided)的可逆架构,并额外生成三个辅助检测头,提出UAV_PGI可编程梯度方法,避免传统深度监督中多路径特征集成可能导致的语义信息损失.为了验证模型的有效性及泛化能力,在VisDrone 2019测试集上开展了对比实验,结果显示,与YOLOv8s相比,LMUAV-YOLOv8s的准确度、召回率、mAP@0.5和mAP@0.5:0.95等指标分别提升了 4.2、3.9、5.1和3.0个百分点,同时参数量减少了 63.9%,计算量仅增加0.4 GFLOPs,实现了检测性能与资源消耗的良好平衡.基于NVIDIA Jetson Xavier NX嵌入式平台的推理实验结果显示:与基线模型相比,该算法能够在满足实时检测要求的条件下,获得更高的检测精度,对于无人机实时目标检测场景具有较好的适用性.借助类激活图,对算法的决策过程进行了可视化分析,结果表明,该模型具备更优异的小尺度特征提取和高分辨率处理能力. | 
    
| Abstract_FL | To tackle the challenges of weak sensing capacity and high missed detection rates for small-scale objects using low-altitude UAV in complex traffic scenarios,the LMUAV-YOLOv8 algorithm is proposed.Its efficiency and advantage are verified through ablation and comparative experiments.The internal mechanisms is visualized by using the method of class activation mapping.In this dissertation,a lightweight feature fusion network(UAV_RepGFPN)is introduced firstly,proposing new feature fusion paths and a feature fusion module DBB_GELAN,which reduces the number of parameters and computation while improving the performance of the feature fusion network.Secondly,the feature extraction module(FTA_C2f)is constructed using partial convolution(PConv)and triplet attention mechanism(Triplet Attention),and the ADown down-sampling module is introduced.By rearranging the dimensions of the input feature maps and making fine-grained adjustments,the ability of the deep network to capture spatial features is enhanced,further reducing the num-ber of parameters and computation.Then,concerning large amount of information loss during in layer-by-layer feature extraction and spatial transformation,a new context-guided programmable gradient information(UAV_PGI)strategy is proposed.By designing a context-guided reversible architecture and three additional auxiliary detection heads,UAV_PGI significantly enhance detection capabilities for aerial objects.In order to verify the validity and generalization ability of the model,comparative experiments are carried out on the VisDrone 2019 test set,and the results show that:compared with YOLOv8s,LMUAV-YOLOv8s on the VisDrone 2019 test set improves precision,recall,mAP@0.5,and mAP@0.5:0.95 by 4.2,3.9,5.1,and 3.0 percentage points,separately,with the computational cost increased by only 0.4 GFLOPs and the parameter count reduced by 63.9%,meaning a good balance between performance and cost.The inference experi-mental results based on NVIDIA Jetson Xavier NX embedded platform show that compared with the baseline model,the proposed algorithm can obtain higher detection accuracy under the condition of meeting the requirements of real-time detection,rendering it more suitable for real-time target detection scenarios in drones.Finally,the decision making pro-cess is visualized by using the method of class activation mapping,which provides a intuitive way to understand the internal mechanisms of the networ.And the results show that the proposed model has superior small-scale feature extraction and high-resolution processing capabilities. | 
    
| Author | 侯少杰 董一兵 曾辉  | 
    
| AuthorAffiliation | 河北经贸大学管理科学与信息工程学院,石家庄 050061 | 
    
| AuthorAffiliation_xml | – name: 河北经贸大学管理科学与信息工程学院,石家庄 050061 | 
    
| Author_FL | ZENG Hui HOU Shaojie DONG Yibing  | 
    
| Author_FL_xml | – sequence: 1 fullname: DONG Yibing – sequence: 2 fullname: ZENG Hui – sequence: 3 fullname: HOU Shaojie  | 
    
| Author_xml | – sequence: 1 fullname: 董一兵 – sequence: 2 fullname: 曾辉 – sequence: 3 fullname: 侯少杰  | 
    
| BookMark | eNo9jT9Lw0AcQG-oYK39Em4Oib_LL5e7ExxK8R9EsljBqVyTXGmQFDxUujlZpKK46GCHgkNxcXNJybepSfwWCorTgze8t0Zq6TCNCdmgYCPnYiuxB8akNgVwLIFIbccFbgF1eI3U_-0qaRoz6AGjyBlHWSc7_lGndWKdBn5wKbaX-X35lhXPs2WWFdOsmt9U89vy5b2YjYvX6-JjUuWLr_HD591TmT-Wi-k6WdHqzMTNPzZIZ2_3uH1g-cH-YbvlW4aCJ6wIeOxG4EWgQAsAyQSPqFQaNOOx5KHu6Rg9B6hkGAnlohdKGmolJSBHhg2y-du9UqlWab-bDC_O059jNzFJPxyNRg44DBBA4DfEHF9N | 
    
| ClassificationCodes | TP391.4 | 
    
| ContentType | Journal Article | 
    
| Copyright | Copyright © Wanfang Data Co. Ltd. All Rights Reserved. | 
    
| Copyright_xml | – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved. | 
    
| DBID | 2B. 4A8 92I 93N PSX TCJ  | 
    
| DOI | 10.3778/j.issn.1002-8331.2407-0127 | 
    
| DatabaseName | Wanfang Data Journals - Hong Kong WANFANG Data Centre Wanfang Data Journals 万方数据期刊 - 香港版 China Online Journals (COJ) China Online Journals (COJ)  | 
    
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering | 
    
| DocumentTitle_FL | LMUAV-YOLOv8:Lightweight Network for Object Detection in Low-Altitude UAV Vision | 
    
| EndPage | 110 | 
    
| ExternalDocumentID | jsjgcyyy202503008 | 
    
| GrantInformation_xml | – fundername: (河北省高等教育教学改革研究与实践项目); 河北省引进国外智力项目; (河北经贸大学科学研究与发展计划项目); (河北经贸大学研究生创新资助项目) funderid: (河北省高等教育教学改革研究与实践项目); 河北省引进国外智力项目; (河北经贸大学科学研究与发展计划项目); (河北经贸大学研究生创新资助项目)  | 
    
| GroupedDBID | -0Y 2B. 4A8 5XA 5XJ 92H 92I 93N ABJNI ACGFS ALMA_UNASSIGNED_HOLDINGS CCEZO CUBFJ CW9 PSX TCJ TGT U1G U5S  | 
    
| ID | FETCH-LOGICAL-s1068-d07e4d06d0a0f8009587d19af0f57e97cfbfe36201953d8a436c91cfa99037353 | 
    
| ISSN | 1002-8331 | 
    
| IngestDate | Thu May 29 04:10:55 EDT 2025 | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| Issue | 3 | 
    
| Keywords | small object detection 可编程梯度信息 multi-scale programable gradient information YOLOv8 小目标检测 多尺度 轻量化 lightweight  | 
    
| Language | Chinese | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-s1068-d07e4d06d0a0f8009587d19af0f57e97cfbfe36201953d8a436c91cfa99037353 | 
    
| PageCount | 17 | 
    
| ParticipantIDs | wanfang_journals_jsjgcyyy202503008 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2025-02-01 | 
    
| PublicationDateYYYYMMDD | 2025-02-01 | 
    
| PublicationDate_xml | – month: 02 year: 2025 text: 2025-02-01 day: 01  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | 计算机工程与应用 | 
    
| PublicationTitle_FL | Computer Engineering and Applications | 
    
| PublicationYear | 2025 | 
    
| Publisher | 河北经贸大学管理科学与信息工程学院,石家庄 050061 | 
    
| Publisher_xml | – name: 河北经贸大学管理科学与信息工程学院,石家庄 050061 | 
    
| SSID | ssib051375739 ssib001102935 ssj0000561668 ssib023646291 ssib057620132  | 
    
| Score | 2.0062392 | 
    
| Snippet | TP391.4; 针对低空无人机目标检测面临目标尺度变化大、小目标容易漏检和误检的挑战,发展了一种融合多尺度特征的目标检测轻量化网络(LMUAV-YOLOv8),通过开展消融和对比实验,... | 
    
| SourceID | wanfang | 
    
| SourceType | Aggregation Database | 
    
| StartPage | 94 | 
    
| Title | LMUAV-YOLOv8:低空无人机视觉目标检测轻量化网络 | 
    
| URI | https://d.wanfangdata.com.cn/periodical/jsjgcyyy202503008 | 
    
| Volume | 61 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVEBS databaseName: Inspec with Full Text issn: 1002-8331 databaseCode: ADMLS dateStart: 20200501 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text omitProxy: false ssIdentifier: ssib057620132 providerName: EBSCOhost  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LTxRBEO4gXPRgfMZ3iLFPZrBn--1tBmZDDMhBMHgiszszGA5rImACJ08So9F40YPEkHggXrx5WcK_wV38F1b19uyMqAl62VS6qru-qpreqp509xByy-StVOcFDxrCZIEwrAgMz2wQFmkqsiLLZYHvO2bvq-kFcW9RLo6c-FTbtbS-1ppob_7xXMn_RBXaIK54SvYfIjscFBqAhvjCL0QYfo8V45nZhehh8GhuZu6ZgT40ETSeoiahiaaRpTG0KGqBZo4VlS2TjjA00tSokrDYy8Y0SlAGuhjtCI6bIYCIJTUxCoOKGAiLAqZJE2ifpFZhd2DZ0BExaKkXvk5LQiPHBcLqGhJJY2iUjmWcFkBrnCESBaxw2ARwywcExwNVgM7LMgcExGUlotCeOHGgE2psxRHYEDnwMXOYQRbsYvW3IA1Zbpz2z61zQ4PGvDRae1vRDaDDIQFWJNCjSEzRSHmLB6Zb5lyuUQDVDmWGFgPRdMIKqvwS4xH3DHtZai2NGjBR3OBNjw3Uxco7z4jbTGINWcs_LkGVh9h8gqoEqvcXLtsMPg_t65ZwsD34aErkWhuXEnH8ieH4E7iSD3DfQVUIDLdnrqyuLLc3NjbQ0ZAE8Cz9WAOyJhslY9HU7MyDquCG-tRWBTd-jUA1qtuXZMi11NW9sxIH8XeF-tv3Vaj8eVWPbHAzMMK-83fQ7ohdp0g7y7VqcP4MOe2XcePRYE6eJSObj8-RU7XLPc-TX2bn3YP9N_0v3d6HnYNut7fdPdx9cbj7sv_xa29nq_f5ee_bq8P9vR9bb7-_ft_ff9ff275AFprJ_OR04D9WEqyGTJkgYzoXGVMZS1lhcOVidBbatGCF1LnV7aJV5BzNt5JnJhVctW3YLlIoB7nmkl8ko50nnfwSGS_AZy1YTqo0ZyLlKs2EtUqkYQbFNlQvl8lNb_yS_zNaXfotZFeOI3SVnKwm0zUyuvZ0Pb8ORfZa64aP9E8LLZly | 
    
| linkProvider | EBSCOhost | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=LMUAV-YOLOv8%3A%E4%BD%8E%E7%A9%BA%E6%97%A0%E4%BA%BA%E6%9C%BA%E8%A7%86%E8%A7%89%E7%9B%AE%E6%A0%87%E6%A3%80%E6%B5%8B%E8%BD%BB%E9%87%8F%E5%8C%96%E7%BD%91%E7%BB%9C&rft.jtitle=%E8%AE%A1%E7%AE%97%E6%9C%BA%E5%B7%A5%E7%A8%8B%E4%B8%8E%E5%BA%94%E7%94%A8&rft.au=%E8%91%A3%E4%B8%80%E5%85%B5&rft.au=%E6%9B%BE%E8%BE%89&rft.au=%E4%BE%AF%E5%B0%91%E6%9D%B0&rft.date=2025-02-01&rft.pub=%E6%B2%B3%E5%8C%97%E7%BB%8F%E8%B4%B8%E5%A4%A7%E5%AD%A6%E7%AE%A1%E7%90%86%E7%A7%91%E5%AD%A6%E4%B8%8E%E4%BF%A1%E6%81%AF%E5%B7%A5%E7%A8%8B%E5%AD%A6%E9%99%A2%2C%E7%9F%B3%E5%AE%B6%E5%BA%84+050061&rft.issn=1002-8331&rft.volume=61&rft.issue=3&rft.spage=94&rft.epage=110&rft_id=info:doi/10.3778%2Fj.issn.1002-8331.2407-0127&rft.externalDocID=jsjgcyyy202503008 | 
    
| thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fjsjgcyyy%2Fjsjgcyyy.jpg |