Efficient small target detection system with composite multi-scale detection head

In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through...

Full description

Saved in:
Bibliographic Details
Published inJournal of electronic imaging Vol. 34; no. 3; p. 033036
Main Authors Ma, Yue, Yang, Ye, Zhu, Yongchang, Yang, Sen, Wang, Zenghui, Tong, Jigang
Format Journal Article
LanguageEnglish
Published Society of Photo-Optical Instrumentation Engineers 01.05.2025
SPIE
Subjects
Online AccessGet full text
ISSN1017-9909
1560-229X
DOI10.1117/1.JEI.34.3.033036

Cover

Abstract In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through the employment of decoupled detector head and auxiliary detector head algorithms. The classification and localization tasks are partitioned by these two algorithms, and the sample quality is enhanced by the auxiliary detector head to improve accuracy. To solve the information loss of small targets in the model downsampling, the residual structure and loss function are optimized so that the feature information of small targets is retained. Moreover, a multi-scale path aggregation network and a multi-scale detection head are adopted to retain local feature information while focusing on global features. Furthermore, dense scenes and small targets are adapted to by the integration of the anchor-based algorithm and the introduction of a loss algorithm, and more accurate localization is facilitated. Experimental results show a 1.7% increase in mAP on the MS COCO dataset, indicating improved performance in complex real-life scenes. A 7.2% enhancement on the TCOD dataset is shown, indicating better performance in densely clustered or overlapping scenes and small target detection. Challenges such as variations in target size, background similarity, dense and overlapping instances, and small-sized targets are tackled, and the accuracy and robustness of object detection are improved. The practical applications of the proposed approach are expanded.
AbstractList In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through the employment of decoupled detector head and auxiliary detector head algorithms. The classification and localization tasks are partitioned by these two algorithms, and the sample quality is enhanced by the auxiliary detector head to improve accuracy. To solve the information loss of small targets in the model downsampling, the residual structure and loss function are optimized so that the feature information of small targets is retained. Moreover, a multi-scale path aggregation network and a multi-scale detection head are adopted to retain local feature information while focusing on global features. Furthermore, dense scenes and small targets are adapted to by the integration of the anchor-based algorithm and the introduction of a loss algorithm, and more accurate localization is facilitated. Experimental results show a 1.7% increase in mAP on the MS COCO dataset, indicating improved performance in complex real-life scenes. A 7.2% enhancement on the TCOD dataset is shown, indicating better performance in densely clustered or overlapping scenes and small target detection. Challenges such as variations in target size, background similarity, dense and overlapping instances, and small-sized targets are tackled, and the accuracy and robustness of object detection are improved. The practical applications of the proposed approach are expanded.
Audience Academic
Author Ma, Yue
Wang, Zenghui
Zhu, Yongchang
Yang, Ye
Yang, Sen
Tong, Jigang
Author_xml – sequence: 1
  givenname: Yue
  surname: Ma
  fullname: Ma, Yue
  email: 2710467773@qq.com
  organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China
– sequence: 2
  givenname: Ye
  orcidid: 0009-0000-2439-6570
  surname: Yang
  fullname: Yang, Ye
  email: yangye20220222@126.com
  organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China
– sequence: 3
  givenname: Yongchang
  surname: Zhu
  fullname: Zhu, Yongchang
  email: zyc24yo@163.com
  organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China
– sequence: 4
  givenname: Sen
  orcidid: 0000-0002-6481-2134
  surname: Yang
  fullname: Yang, Sen
  email: s_yang@email.tjut.edu.cn
  organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China
– sequence: 5
  givenname: Zenghui
  orcidid: 0000-0003-3025-336X
  surname: Wang
  fullname: Wang, Zenghui
  email: wangzengh@gmail.com
  organization: University of South Africa, Florida, South Africa
– sequence: 6
  givenname: Jigang
  orcidid: 0000-0002-6740-4179
  surname: Tong
  fullname: Tong, Jigang
  email: tjgtjut@163.com
  organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China
BookMark eNp9kNtKAzEQhoMo2FYfwLt9gV0zyW6ye1lK1UpBBAXvQppM2pQ9lE2K9O2NVNArmYsZhv-fwzcll_3QIyF3QAsAkPdQPC9XBS8LXlDOKRcXZAKVoDljzcdlqinIvGloc02mIewpBahLmJDXpXPeeOxjFjrdtlnU4xZjZjGiiX7os3AKEbvs08ddZobuMAQfMeuObfR5MLrFP9odantDrpxuA97-5Bl5f1i-LZ7y9cvjajFf54ZJGXNZVihSoK03ruGy1qzWvKpMZSllCDVsnOAIXJSbsuJWbppSWxBScJvUlM9IcZ67TTco37shjtqksNh5k-g4n_rzuhKMMSFlMsDZYMYhhBGdOoy-0-NJAVXfEBWoBFHxUnF1hvi7JBw8qv1wHPv00z-GLzuldVM
ContentType Journal Article
Copyright 2025 SPIE and IS&T
COPYRIGHT 2025 SPIE
Copyright_xml – notice: 2025 SPIE and IS&T
– notice: COPYRIGHT 2025 SPIE
DBID AAYXX
CITATION
DOI 10.1117/1.JEI.34.3.033036
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList

DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Visual Arts
Engineering
EISSN 1560-229X
EndPage 033036
ExternalDocumentID A856222677
10_1117_1_JEI_34_3_033036
GeographicLocations China
GeographicLocations_xml – name: China
GrantInformation_xml – fundername: South African National Research Foundation
  grantid: RA22112976288; AJCR230704126719
– fundername: National Natural Science Foundation of China
  grantid: 62103298
GroupedDBID .DC
0R~
29K
4.4
5GY
AAJMC
ABDPE
ABJNI
ACGFO
ACGFS
ADMLS
AENEX
AKROS
ALMA_UNASSIGNED_HOLDINGS
CS3
D-I
DU5
EBS
F5P
FQ0
HZ~
ITE
M4X
O9-
P2P
RNS
SJN
SPBNH
TAE
AAYXX
CITATION
IAO
ID FETCH-LOGICAL-c277t-745e6e6eed8bf9378a28a355c5d002e181bf63e1364b453d7b94ad16763d37803
ISSN 1017-9909
IngestDate Mon Oct 20 16:54:46 EDT 2025
Wed Oct 01 06:05:20 EDT 2025
Thu Jul 03 03:11:43 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords spatial pyramid pooling fast
object detection
multi-scale detectors
decoupled detectors
small target detection
YOLOv7
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c277t-745e6e6eed8bf9378a28a355c5d002e181bf63e1364b453d7b94ad16763d37803
ORCID 0000-0003-3025-336X
0000-0002-6481-2134
0000-0002-6740-4179
0009-0000-2439-6570
PageCount 1
ParticipantIDs spie_journals_10_1117_1_JEI_34_3_033036
crossref_primary_10_1117_1_JEI_34_3_033036
gale_infotracacademiconefile_A856222677
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-05-01
PublicationDateYYYYMMDD 2025-05-01
PublicationDate_xml – month: 05
  year: 2025
  text: 2025-05-01
  day: 01
PublicationDecade 2020
PublicationTitle Journal of electronic imaging
PublicationTitleAlternate J. Electron. Imaging
PublicationYear 2025
Publisher Society of Photo-Optical Instrumentation Engineers
SPIE
Publisher_xml – name: Society of Photo-Optical Instrumentation Engineers
– name: SPIE
SSID ssj0011841
Score 2.3986228
Snippet In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the...
SourceID gale
crossref
spie
SourceType Aggregation Database
Index Database
Publisher
StartPage 033036
SubjectTerms Algorithms
Detectors
Title Efficient small target detection system with composite multi-scale detection head
URI http://www.dx.doi.org/10.1117/1.JEI.34.3.033036
Volume 34
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1560-229X
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0011841
  issn: 1017-9909
  databaseCode: ADMLS
  dateStart: 19920101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NT9swFLdGuYzDPtgQhW3yYdKkTQlJbMfpsYJOgMakCZhgFyuJX1kPLahJL_z1PH80SRlIMEWKWst9sfL79X3Y79mEfEYnFa84DlKAccBFOQ7yTGcYpQy0LJOBZrZc7ORnenjOjy_ERXsYp60uqYuwvH2wruR_UMU2xNVUyT4D2UYoNuBnxBfviDDen4TxyO7_YFbzq6lZYnZp3d801OBOAHf7NC_zy6c2QwtcEmFQITrQ6YtaeeXQzo6r2jkrZzK1xxq1E9lWhy8aelz6-efLpuXP34VtuJ5d2Srj-z1PfTGan3pIRJvo57WlMXFozgZddernJidttP2AlrZ1_uHx6ChkPGRhxIwlbU3Schn-nqVq8gdd5CJVrFCEYlwx5USskfUE1XvUI-vDg5Mfp82CEgayNvZejtgvcKOQvX_GseKieEPdq24m0PE8zt6QVx4HOnT4vyUvYLZJXvvwgXrlXG2Sjc7ekvjt96RauJ9V78ivhirUUoU6qtAGfuqoQg1VaEMV2qFKp6-hynty_n10tn8Y-NM0gjKRsg4kF5DiBTorxuiUZnmS5ehtlkKjVQT09IpxyiBmKS-4YFoWA57rOEUDpLF3xLZIb3Y9g21CIRcgZFxkZjohQjmgRRExrWVe8iSJ-uTr8gWqG7dpinoUsj75Yl6xMgyp53mZ-7oQfJTZmkwNM3TRMUiQEnsaFJT_B1aPy9x5zgB2ycuW3B9Ir54v4CN6mnXxybPoDr8AeYE
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+small+target+detection+system+with+composite+multi-scale+detection+head&rft.jtitle=Journal+of+electronic+imaging&rft.au=Ma%2C+Yue&rft.au=Yang%2C+Ye&rft.au=Zhu%2C+Yongchang&rft.au=Yang%2C+Sen&rft.date=2025-05-01&rft.issn=1017-9909&rft.volume=34&rft.issue=3&rft_id=info:doi/10.1117%2F1.JEI.34.3.033036&rft.externalDBID=n%2Fa&rft.externalDocID=10_1117_1_JEI_34_3_033036
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1017-9909&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1017-9909&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1017-9909&client=summon