Efficient small target detection system with composite multi-scale detection head
In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through...
        Saved in:
      
    
          | Published in | Journal of electronic imaging Vol. 34; no. 3; p. 033036 | 
|---|---|
| Main Authors | , , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
            Society of Photo-Optical Instrumentation Engineers
    
        01.05.2025
     SPIE  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1017-9909 1560-229X  | 
| DOI | 10.1117/1.JEI.34.3.033036 | 
Cover
| Abstract | In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through the employment of decoupled detector head and auxiliary detector head algorithms. The classification and localization tasks are partitioned by these two algorithms, and the sample quality is enhanced by the auxiliary detector head to improve accuracy. To solve the information loss of small targets in the model downsampling, the residual structure and loss function are optimized so that the feature information of small targets is retained. Moreover, a multi-scale path aggregation network and a multi-scale detection head are adopted to retain local feature information while focusing on global features. Furthermore, dense scenes and small targets are adapted to by the integration of the anchor-based algorithm and the introduction of a loss algorithm, and more accurate localization is facilitated. Experimental results show a 1.7% increase in mAP on the MS COCO dataset, indicating improved performance in complex real-life scenes. A 7.2% enhancement on the TCOD dataset is shown, indicating better performance in densely clustered or overlapping scenes and small target detection. Challenges such as variations in target size, background similarity, dense and overlapping instances, and small-sized targets are tackled, and the accuracy and robustness of object detection are improved. The practical applications of the proposed approach are expanded. | 
    
|---|---|
| AbstractList | In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the background, dense and overlapping instances, and small-sized targets. To address these challenges, we optimized the detector head part through the employment of decoupled detector head and auxiliary detector head algorithms. The classification and localization tasks are partitioned by these two algorithms, and the sample quality is enhanced by the auxiliary detector head to improve accuracy. To solve the information loss of small targets in the model downsampling, the residual structure and loss function are optimized so that the feature information of small targets is retained. Moreover, a multi-scale path aggregation network and a multi-scale detection head are adopted to retain local feature information while focusing on global features. Furthermore, dense scenes and small targets are adapted to by the integration of the anchor-based algorithm and the introduction of a loss algorithm, and more accurate localization is facilitated. Experimental results show a 1.7% increase in mAP on the MS COCO dataset, indicating improved performance in complex real-life scenes. A 7.2% enhancement on the TCOD dataset is shown, indicating better performance in densely clustered or overlapping scenes and small target detection. Challenges such as variations in target size, background similarity, dense and overlapping instances, and small-sized targets are tackled, and the accuracy and robustness of object detection are improved. The practical applications of the proposed approach are expanded. | 
    
| Audience | Academic | 
    
| Author | Ma, Yue Wang, Zenghui Zhu, Yongchang Yang, Ye Yang, Sen Tong, Jigang  | 
    
| Author_xml | – sequence: 1 givenname: Yue surname: Ma fullname: Ma, Yue email: 2710467773@qq.com organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China – sequence: 2 givenname: Ye orcidid: 0009-0000-2439-6570 surname: Yang fullname: Yang, Ye email: yangye20220222@126.com organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China – sequence: 3 givenname: Yongchang surname: Zhu fullname: Zhu, Yongchang email: zyc24yo@163.com organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China – sequence: 4 givenname: Sen orcidid: 0000-0002-6481-2134 surname: Yang fullname: Yang, Sen email: s_yang@email.tjut.edu.cn organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China – sequence: 5 givenname: Zenghui orcidid: 0000-0003-3025-336X surname: Wang fullname: Wang, Zenghui email: wangzengh@gmail.com organization: University of South Africa, Florida, South Africa – sequence: 6 givenname: Jigang orcidid: 0000-0002-6740-4179 surname: Tong fullname: Tong, Jigang email: tjgtjut@163.com organization: Tianjin University of Technology, School of Electrical Engineering and Automation, Department, Xiqing, China  | 
    
| BookMark | eNp9kNtKAzEQhoMo2FYfwLt9gV0zyW6ye1lK1UpBBAXvQppM2pQ9lE2K9O2NVNArmYsZhv-fwzcll_3QIyF3QAsAkPdQPC9XBS8LXlDOKRcXZAKVoDljzcdlqinIvGloc02mIewpBahLmJDXpXPeeOxjFjrdtlnU4xZjZjGiiX7os3AKEbvs08ddZobuMAQfMeuObfR5MLrFP9odantDrpxuA97-5Bl5f1i-LZ7y9cvjajFf54ZJGXNZVihSoK03ruGy1qzWvKpMZSllCDVsnOAIXJSbsuJWbppSWxBScJvUlM9IcZ67TTco37shjtqksNh5k-g4n_rzuhKMMSFlMsDZYMYhhBGdOoy-0-NJAVXfEBWoBFHxUnF1hvi7JBw8qv1wHPv00z-GLzuldVM | 
    
| ContentType | Journal Article | 
    
| Copyright | 2025 SPIE and IS&T COPYRIGHT 2025 SPIE  | 
    
| Copyright_xml | – notice: 2025 SPIE and IS&T – notice: COPYRIGHT 2025 SPIE  | 
    
| DBID | AAYXX CITATION  | 
    
| DOI | 10.1117/1.JEI.34.3.033036 | 
    
| DatabaseName | CrossRef | 
    
| DatabaseTitle | CrossRef | 
    
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Applied Sciences Visual Arts Engineering  | 
    
| EISSN | 1560-229X | 
    
| EndPage | 033036 | 
    
| ExternalDocumentID | A856222677 10_1117_1_JEI_34_3_033036  | 
    
| GeographicLocations | China | 
    
| GeographicLocations_xml | – name: China | 
    
| GrantInformation_xml | – fundername: South African National Research Foundation grantid: RA22112976288; AJCR230704126719 – fundername: National Natural Science Foundation of China grantid: 62103298  | 
    
| GroupedDBID | .DC 0R~ 29K 4.4 5GY AAJMC ABDPE ABJNI ACGFO ACGFS ADMLS AENEX AKROS ALMA_UNASSIGNED_HOLDINGS CS3 D-I DU5 EBS F5P FQ0 HZ~ ITE M4X O9- P2P RNS SJN SPBNH TAE AAYXX CITATION IAO  | 
    
| ID | FETCH-LOGICAL-c277t-745e6e6eed8bf9378a28a355c5d002e181bf63e1364b453d7b94ad16763d37803 | 
    
| ISSN | 1017-9909 | 
    
| IngestDate | Mon Oct 20 16:54:46 EDT 2025 Wed Oct 01 06:05:20 EDT 2025 Thu Jul 03 03:11:43 EDT 2025  | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 3 | 
    
| Keywords | spatial pyramid pooling fast object detection multi-scale detectors decoupled detectors small target detection YOLOv7  | 
    
| Language | English | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-c277t-745e6e6eed8bf9378a28a355c5d002e181bf63e1364b453d7b94ad16763d37803 | 
    
| ORCID | 0000-0003-3025-336X 0000-0002-6481-2134 0000-0002-6740-4179 0009-0000-2439-6570  | 
    
| PageCount | 1 | 
    
| ParticipantIDs | spie_journals_10_1117_1_JEI_34_3_033036 crossref_primary_10_1117_1_JEI_34_3_033036 gale_infotracacademiconefile_A856222677  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2025-05-01 | 
    
| PublicationDateYYYYMMDD | 2025-05-01 | 
    
| PublicationDate_xml | – month: 05 year: 2025 text: 2025-05-01 day: 01  | 
    
| PublicationDecade | 2020 | 
    
| PublicationTitle | Journal of electronic imaging | 
    
| PublicationTitleAlternate | J. Electron. Imaging | 
    
| PublicationYear | 2025 | 
    
| Publisher | Society of Photo-Optical Instrumentation Engineers SPIE  | 
    
| Publisher_xml | – name: Society of Photo-Optical Instrumentation Engineers – name: SPIE  | 
    
| SSID | ssj0011841 | 
    
| Score | 2.3986228 | 
    
| Snippet | In real-world complex scenarios, challenges are faced by object detection due to factors such as significant variations in target scale, similarity with the... | 
    
| SourceID | gale crossref spie  | 
    
| SourceType | Aggregation Database Index Database Publisher  | 
    
| StartPage | 033036 | 
    
| SubjectTerms | Algorithms Detectors  | 
    
| Title | Efficient small target detection system with composite multi-scale detection head | 
    
| URI | http://www.dx.doi.org/10.1117/1.JEI.34.3.033036 | 
    
| Volume | 34 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVEBS databaseName: Inspec with Full Text customDbUrl: eissn: 1560-229X dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0011841 issn: 1017-9909 databaseCode: ADMLS dateStart: 19920101 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text providerName: EBSCOhost  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NT9swFLdGuYzDPtgQhW3yYdKkTQlJbMfpsYJOgMakCZhgFyuJX1kPLahJL_z1PH80SRlIMEWKWst9sfL79X3Y79mEfEYnFa84DlKAccBFOQ7yTGcYpQy0LJOBZrZc7ORnenjOjy_ERXsYp60uqYuwvH2wruR_UMU2xNVUyT4D2UYoNuBnxBfviDDen4TxyO7_YFbzq6lZYnZp3d801OBOAHf7NC_zy6c2QwtcEmFQITrQ6YtaeeXQzo6r2jkrZzK1xxq1E9lWhy8aelz6-efLpuXP34VtuJ5d2Srj-z1PfTGan3pIRJvo57WlMXFozgZddernJidttP2AlrZ1_uHx6ChkPGRhxIwlbU3Schn-nqVq8gdd5CJVrFCEYlwx5USskfUE1XvUI-vDg5Mfp82CEgayNvZejtgvcKOQvX_GseKieEPdq24m0PE8zt6QVx4HOnT4vyUvYLZJXvvwgXrlXG2Sjc7ekvjt96RauJ9V78ivhirUUoU6qtAGfuqoQg1VaEMV2qFKp6-hynty_n10tn8Y-NM0gjKRsg4kF5DiBTorxuiUZnmS5ehtlkKjVQT09IpxyiBmKS-4YFoWA57rOEUDpLF3xLZIb3Y9g21CIRcgZFxkZjohQjmgRRExrWVe8iSJ-uTr8gWqG7dpinoUsj75Yl6xMgyp53mZ-7oQfJTZmkwNM3TRMUiQEnsaFJT_B1aPy9x5zgB2ycuW3B9Ir54v4CN6mnXxybPoDr8AeYE | 
    
| linkProvider | EBSCOhost | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+small+target+detection+system+with+composite+multi-scale+detection+head&rft.jtitle=Journal+of+electronic+imaging&rft.au=Ma%2C+Yue&rft.au=Yang%2C+Ye&rft.au=Zhu%2C+Yongchang&rft.au=Yang%2C+Sen&rft.date=2025-05-01&rft.issn=1017-9909&rft.volume=34&rft.issue=3&rft_id=info:doi/10.1117%2F1.JEI.34.3.033036&rft.externalDBID=n%2Fa&rft.externalDocID=10_1117_1_JEI_34_3_033036 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1017-9909&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1017-9909&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1017-9909&client=summon |